Wikistats: Pageview complete dumps

Maintained by the Wikimedia Analytics team Link to the dumps

Pageview complete is our best effort to provide a comprehensive timeseries of per-article pageview data for Wikimedia projects. Data spans from December 2007 to the present with a uniform format and compression.

Features of the dataset

Details on data segments

Sets of daily files are derived from the best data available at the time:

Data format

Compression of dataset is similar to that of pagecounts-ez: bzip files with hourly data embedded on each row, following this format:

Hourly counts can be deciphered as follows:

Hour:
from 0 to 23, written as 0 = A, 1 = B ... 22 = W, 23 = X


All Analytics datasets are available under the Creative Commons CC0 dedication.