Re: [Wikitech-l] Non-linear search in an XML dump

2018-09-03 Thread Daniel Kinzler
If I read the code in WikiExporter.php correctly, dumps are currently ordered by page ID. However, I would not consider this a guarantee. I'd recommend to assume that the content of a dump are in no particular order, and that the order is subject to change without notice. -- daniel Am

Re: [Wikitech-l] Non-linear search in an XML dump

2018-09-03 Thread Jaime Crespo
Not that this is offtopic here, but you will find probably more knowledgeable people and probably a quicker response at the specialized list https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l On Mon, Sep 3, 2018 at 3:06 PM BinĂ¡ris wrote: > Hi, > > As far as I understand, pages in an

[Wikitech-l] Non-linear search in an XML dump

2018-09-03 Thread BinĂ¡ris
Hi, As far as I understand, pages in an XML dump are in the order of their original creation. This does not correspond to the page ID, because if a page gets a new id after deletion and restore or renaming to that title or anything, the order still remains the original. But this sortkey itself is