daniel closed this task as "Invalid".
daniel added a comment.
The grep run did not turn up any old style serialization in the dump, so I'm
closing this as "invalid". For good measure, I'm now double-checking by looking
for the other pattern I suggested above, `"label":{`:
bzgrep '"label":{' wikidatawiki-20150330-pages-meta-history.xml.bz2
Note that you may encounter the following when importing XML dumps:
- redirects encoded as JSON
- broken serialization of empty maps as lists ([] instead of {}).
- entity serializations missing fields (e.g. no badges in sitelinks).
Generally, code processing old revisions should be robust, since fields may be
serialized in a different order, fields may become optional, or fields can be
added. But the overall structure should always be the same. You should however
no longer encounter "old style serialization", which has a completely different
structure.
TASK DETAIL
https://phabricator.wikimedia.org/T74348
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: hoo, daniel
Cc: JanZerebecki, Jimkont, Liuxinyu970226, Wikidata-bugs, Tobi_WMDE_SW, jayvdb,
Svick, ArielGlenn, Ricordisamoa, mark, Lydia_Pintscher, jeremyb-phone, daniel,
Manybubbles, hoo, RobH, aude, faidon, fgiunchedi, Dzahn, jeremyb, chasemp
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs