Hi Lukas! That really shouldn't happen...
Can you tell me on which item that happens? Also, please double-check the namespace and content model of the respective entry in the dump. -- daniel Am 21.10.2014 17:02, schrieb Lukas Benedix: > Different keys can still be found in the actual xml dump > wikidatawiki-20141009-pages-articles.xml.bz2. > > This bug/feature is also present in the current dump with history. > > page_id wd_id keys 111 Q15 ['aliases', 'claims', > 'descriptions', 'id', 'labels', 'sitelinks', 'type'] 137 Q24 > ['aliases', 'claims', 'description', 'entity', 'label', 'links'] 31500 > Q28119 ['aliases', 'description', 'entity', 'label', 'links'] 225144 ? > ['entity', 'redirect'] 3916689 P6 ['aliases', 'claims', 'datatype', > 'descriptions', 'id', 'labels', 'type'] 3916937 P10 ['aliases', > 'claims', 'datatype', 'description', 'entity', 'label'] > > > Lukas > > Am Do 09.10.2014 19:32, schrieb Lydia Pintscher: >> On Thu, Oct 9, 2014 at 3:19 PM, Magnus Manske >> <[email protected]> wrote: >>> I managed to do the task at hand by switching to JSON dumps (because >>> that's the new, officially supported, long-term-stable Wikidata dump >>> format, right? Right???), so no hurry there. >>> >>> Maybe the XML dump process was run in the middle of the switch to the >>> new format, or got a stale cache for some items? >> >> It looks like the switch happened in the middle of a dump creation so >> this one is half old and half new format mixed. The ones after that >> should be all new format. And yay for switching to JSON! >> >> >> Cheers Lydia >> > > > > > > > > > _______________________________________________ Wikidata-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikidata-l > -- Daniel Kinzler Senior Software Developer Wikimedia Deutschland Gesellschaft zur Förderung Freien Wissens e.V. _______________________________________________ Wikidata-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-l
