Hi Federico,

I don't have insights into how that was handled. I've cc'd Xabriel Collazo
Mojica from the Data Engineering team who can help answer this question.
You can also follow up at https://phabricator.wikimedia.org/T414389 with
your question.

Best,
Kinneret


On Thu, Feb 12, 2026 at 12:08 PM Federico Leva (Nemo) <[email protected]>
wrote:

> Thanks for the update.
>
> Il 09/02/26 13:56, Kinneret Gordon via Wiki-research-l ha scritto:
> > You can read the full announcement here:
> > https://lists.wikimedia.org/hyperkitty/list/xmldatadumps-
> > [email protected]/thread/E6D5EU4PMSTSOI2J7A46HJ3YW2W554CS/,
> > and view the full documentation at:
> > https://wikitech.wikimedia.org/wiki/MediaWiki_Content_File_Exports.
>
> How was the filename structure decided? Was there a reason to drop the
> previous conventions? For example, the two files
> itwiki-2026-02-01-p2p4267764.xml.bz2 and
> itwiki-2026-02-01-p100066p103578.xml.bz2 are named very similarly, but
> one contains full history and the other only current versions.
>
> Best,
>         Federico
>
_______________________________________________
Wiki-research-l mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to