Do the Wikimedia xml dump scripts even use php / MediaWiki at all?  I am
aware of some python scripts.

Please check with Ariel.

Katie
On Apr 14, 2014 12:47 PM, "Daniel Kinzler" <[email protected]>
wrote:

> Hi all!
>
> Context: We plan to change the XML dumps (and Special:Export) to use the
> same
> JSON serialization that is used by the API, instead of the terse but
> brittle
> "internal" format. This is about the mechanism we plan to use for the
> conversion.
>
> SO, I just went and checked my assertion that WikiExporter will use the
> Content
> object's serialize method to generate output. I WAS WRONG. It doesn't.
> I'll use
> the text from the database, as-is (for reference, find the call to
> Revision::getRevisionText in Export.php).
>
> In order to force a conversion to the new format, we'll need to patch core
> to a)
> inject a hook here to override the default behavior or b) make it always
> use a
> Content object (unless, perhaps, told explicitly not to).
>
> This is not hard to code, but doing it Right (tm) may need some
> discussion, and
> getting it merged may need some time.
>
> Sorry for not checking this earlier.
> Daniel
>
> --
> Daniel Kinzler
> Senior Software Developer
>
> Wikimedia Deutschland
> Gesellschaft zur Förderung Freien Wissens e.V.
>
> _______________________________________________
> Wikidata-tech mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikidata-tech
>
_______________________________________________
Wikidata-tech mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-tech

Reply via email to