[Xmldatadumps-l] Change in multistream dump file production

2019-01-19 Thread Ariel Glenn WMF
TL;DR: Don't panic, the single articles multistream bz2 file for big wikis will be produced shortly after the new smaller fles. Long version: For big wikis which already have split up article files, we now produce one multistream file per article file. These are now recombined into a single file l

[Xmldatadumps-l] mwbzutils BREAKING CHANGE

2019-01-19 Thread Ariel Glenn WMF
If you use recompressxml in the mwbzutils package, as of version 0.0.9 (just deployed) it no longer writes bz2 compressed data by default to stdout; instead it relies on the extension of the output file and will write either gzipped, bz2 or plain text output, accordingly. This means that if it is d