https://bugzilla.wikimedia.org/show_bug.cgi?id=28956

Quim Gil <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |[email protected]

--- Comment #7 from Quim Gil <[email protected]> ---
Ariel has proposed this feature request as a Google Summer of Code Project at
http://www.mediawiki.org/wiki/Mentorship_programs/Possible_projects#XML_dumps

We have accepted it and a shorter version is now listed at
https://www.mediawiki.org/wiki/Summer_of_Code_2013#Incremental_data_dumps

Pasting here the recommendation for a implementation from Ariel, just in case:

This could be achieved by designing the right output format for the XML files
containing text for all revisions. It would need: a smart choice for
compression of multiple items together, an index into the compressed blocks, a
way to remove content quickly, possibly leaving zeroed blocks bhind, a way to
re-use empty blocks. To use the new archive format, we would need tools to
convert to bz2 or 7z (so users can keep all their existing scripts for the
dumps), a format for storing isolated sets of changes (so dump users can
download just these sets), a script to apply those changes to the above format
(so users can run the script against the change set and their full dump to
update their copy). It would likely need to take as input an XML file of new
pages and new revisions for old pages, as well as a list of pages and/or
revisions that have been deleted in the meantime; this would entail no changes
to MediaWiki core, all of the work would be done by a separate set of tools.

-- 
You are receiving this mail because:
You are watching all bug changes.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to