https://bugzilla.wikimedia.org/show_bug.cgi?id=28956
Quim Gil <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |[email protected] --- Comment #7 from Quim Gil <[email protected]> --- Ariel has proposed this feature request as a Google Summer of Code Project at http://www.mediawiki.org/wiki/Mentorship_programs/Possible_projects#XML_dumps We have accepted it and a shorter version is now listed at https://www.mediawiki.org/wiki/Summer_of_Code_2013#Incremental_data_dumps Pasting here the recommendation for a implementation from Ariel, just in case: This could be achieved by designing the right output format for the XML files containing text for all revisions. It would need: a smart choice for compression of multiple items together, an index into the compressed blocks, a way to remove content quickly, possibly leaving zeroed blocks bhind, a way to re-use empty blocks. To use the new archive format, we would need tools to convert to bz2 or 7z (so users can keep all their existing scripts for the dumps), a format for storing isolated sets of changes (so dump users can download just these sets), a script to apply those changes to the above format (so users can run the script against the change set and their full dump to update their copy). It would likely need to take as input an XML file of new pages and new revisions for old pages, as well as a list of pages and/or revisions that have been deleted in the meantime; this would entail no changes to MediaWiki core, all of the work would be done by a separate set of tools. -- You are receiving this mail because: You are watching all bug changes. _______________________________________________ Wikibugs-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
