Hi Federico, I don't have insights into how that was handled. I've cc'd Xabriel Collazo Mojica from the Data Engineering team who can help answer this question. You can also follow up at https://phabricator.wikimedia.org/T414389 with your question.
Best, Kinneret On Thu, Feb 12, 2026 at 12:08 PM Federico Leva (Nemo) <[email protected]> wrote: > Thanks for the update. > > Il 09/02/26 13:56, Kinneret Gordon via Wiki-research-l ha scritto: > > You can read the full announcement here: > > https://lists.wikimedia.org/hyperkitty/list/xmldatadumps- > > [email protected]/thread/E6D5EU4PMSTSOI2J7A46HJ3YW2W554CS/, > > and view the full documentation at: > > https://wikitech.wikimedia.org/wiki/MediaWiki_Content_File_Exports. > > How was the filename structure decided? Was there a reason to drop the > previous conventions? For example, the two files > itwiki-2026-02-01-p2p4267764.xml.bz2 and > itwiki-2026-02-01-p100066p103578.xml.bz2 are named very similarly, but > one contains full history and the other only current versions. > > Best, > Federico > _______________________________________________ Wiki-research-l mailing list -- [email protected] To unsubscribe send an email to [email protected]
