Bug ID: 62109
           Summary: Add canonical namespaces and aliases to XML dumps
           Product: Datasets
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: General/Unknown
       Web browser: ---
   Mobile Platform: ---

The XML dump contains a siteinfo header with a <namespaces> tag that is very
useful for processing the text in the dumps.  It looks something like this:

<mediawiki ...snip... >
    <generator>MediaWiki 1.23wmf15</generator>
      <namespace key="-2" case="first-letter">Մեդիա</namespace>
      <namespace key="-1" case="first-letter">Սպասարկող</namespace>
      <namespace key="0" case="first-letter" />
      <namespace key="1" case="first-letter">Քննարկում</namespace>
      <namespace key="2" case="first-letter">Մասնակից</namespace>



Regretfully, this header does not include canonical namespace names or
namespace aliases.  However, an API request for "meta=siteinfo" does include
these bits.  For example, the call for|namespacealiases
returns the following XML:

      <ns id="-2" case="first-letter" canonical="Media"
      <ns id="-1" case="first-letter" canonical="Special"
      <ns id="0" case="first-letter" content="" xml:space="preserve" />
      <ns id="1" case="first-letter" subpages="" canonical="Talk"
      <ns id="2" case="first-letter" subpages="" canonical="User"


      <ns id="6" xml:space="preserve">Image</ns>
      <ns id="7" xml:space="preserve">Image talk</ns>

The XML dump should be updated to include this important metadata about

You are receiving this mail because:
You are on the CC list for the bug.
Wikibugs-l mailing list

Reply via email to