https://bugzilla.wikimedia.org/show_bug.cgi?id=72348

            Bug ID: 72348
           Summary: Wikidata dumps contain old-style serialization.
           Product: Datasets
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: General/Unknown
          Assignee: ar...@wikimedia.org
          Reporter: daniel.kinz...@wikimedia.de
                CC: gsv...@gmail.com
       Web browser: ---
   Mobile Platform: ---

Some time ago, we changed the serialization format of wikidata items. For
consistency, we implemented on-the-fly conversion to the new format in the
exporter (using the ContentHandler::exportTransform facility). 

This seems to work fine with Special:Export, and when I try it with
dumpBackup.php locally. However, the dumps like
wikidatawiki-20141009-pages-articles.xml.bz2 still contain revisions with the
old style format, both . 

Is this because new revisions get stitched into old dumps? That's the only
explanation I currently have. If this is the case, how do we reset this, so all
revisions get re-exported? If this is not the case, how can we investigate what
is going wrong?

One alternative explanation would be if the host that generates the dump was
running an old version of wikibase, I suppose.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to