knwiki dump progress on 20200901
This is the Wikimedia dump service.
Please read the copyrights information.
See Meta:Data dumps
for documentation on the provided data formats.
The 7zip decoder on Windows is known to have
problems with some bz2-format
files for larger wikis; we recommend the use of bzip2 for Windows for these cases.
Please report problems with these dumps on Phabricator and add the
Dumps-generation tag.
See all databases list.
Last dumped on 2020-08-20
For a machine-readable version of the information on this page,
see the json status file.
Dump complete
Verify downloaded files against the (md5), (sha1) checksums
to check for corrupted files.
- 2020-09-02 04:40:20 done Articles, templates, media/file descriptions, and primary meta-pages, in multiple bz2 streams, 100 pages per stream
- 2020-09-03 04:34:39 done All pages with complete edit history (.7z)
- 2020-09-03 04:27:36 done All pages with complete page edit history (.bz2)
b'2020-09-03 04:27:17: knwiki (ID 63110) 118740 pages (44.3|298884.7/sec all|curr), 959138 revs (358.1|347.4/sec all|curr), 99.5%|99.5% prefetched (all|curr), ETA 2020-09-03 04:29:33 [max 1007764]'
- 2020-09-03 03:42:33 done Log events to all pages and users.
- 2020-09-02 21:51:22 done All pages, current versions only.
- 2020-09-01 22:52:32 done Articles, templates, media/file descriptions, and primary meta-pages.
- 2020-09-01 09:49:17 done First-pass for page XML data dumps
- 2020-09-03 03:42:04 done Extracted page abstracts for Yahoo
b'2020-09-03 03:42:01: knwiki (ID 61100) 183 pages (76.5|76.5/sec all|curr), 182 revs (76.1|76.1/sec all|curr), ETA 2020-09-03 04:10:47 [max 131427]'
- 2020-09-03 03:31:52 done List of all page titles
- 2020-09-03 03:31:47 done List of page titles in main namespace
- 2020-09-03 03:31:42 done Namespaces, namespace aliases, magic words.
- 2020-09-01 13:32:03 done User group assignments.
- 2020-09-01 13:31:51 done List of annotations (tags) for revisions and log entries
- 2020-09-01 13:30:21 done Wiki media/files usage records.
- 2020-09-01 13:31:57 done Nonexistent pages that have been protected.
- 2020-09-01 13:32:16 done This contains the SiteMatrix information from meta.wikimedia.org provided as a table.
- 2020-09-01 13:32:09 done Newer per-page restrictions table.
- 2020-09-01 13:31:02 done Language proficiency information per user.
- 2020-09-01 13:30:27 done Wiki category membership link records.
- 2020-09-01 13:31:07 done A few statistics such as the page count.
- 2020-09-01 13:32:21 done Past user group assignments.
- 2020-09-01 13:32:27 done Name/value pairs for pages.
- 2020-09-01 13:32:38 done Annotation (tag) names and ids.
- 2020-09-01 13:31:27 done Base per-page data (id, title, old restrictions, etc).
- 2020-09-01 13:32:33 done Category information.
- 2020-09-01 13:31:33 done Metadata on current versions of uploaded media/files.
- 2020-09-01 13:30:37 done Wiki page-to-page link records.
- 2020-09-01 13:31:21 done Redirect list
- 2020-09-01 13:31:46 done Wiki external URL link records.
- 2020-09-01 13:30:50 done Wiki template inclusion link records.
- 2020-09-01 13:31:14 done Wiki interlanguage link records.
- 2020-09-01 13:30:57 done Tracks which pages use which Wikidata items or properties and what aspect (e.g. item label) is used.
- 2020-09-01 13:31:39 done Interwiki link tracking records
- 2020-09-01 13:30:43 done List of pages' geographical coordinates