https://bugzilla.wikimedia.org/show_bug.cgi?id=27618

           Summary: Backup dumps could contain a title index
           Product: MediaWiki
           Version: unspecified
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: Normal
         Component: Maintenance scripts
        AssignedTo: wikibugs-l@lists.wikimedia.org
        ReportedBy: s...@ludd.net
                CC: innocentkil...@gmail.com


There are several readers available for mediawiki xml.bz2 dumps, some able to
read the native format, and others which transform the data.

All suffer from there not being an index into this data.  It is a major barrier
to development and adoption by users.

The simplest remedy would be to register a dump filter which creates a text
file mapping article title -> byte offset.  If this is done during the backup
process, there is almost no resource overhead.

I can write a patch if other developers agree this would be a worthwhile
pursuit.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to