Hi,
I hate to be resurrecting an old thread, but I think for the purpose of
completion I would like to post my experience with the Import of XML
Dumps of Wikipedia into Mediawiki, so that it would help someone else
looking for this information. I started this thread after all.
Mohamed Magdy wrote:
I don't remember if I already mentioned this: you can split the xml
file * into smaller pieces then import it using importDump.php.
Use a loop to make a file like this and then run it:
#!/bin/bash
php maintenance/importDump.php /path/pagexml.1
wait
php
Thanks Joshua. I am intending to try two approaches. The first being to
use the xml2sql and then fill the rest of the tables with the individual
dumps of the Tables that are already provided in SQL. The second would
be using Mwdumper – and then import the rest of the Tables using the SQL
Dumps
Daniel Kinzler wrote:
That sounds very *very* odd. because page content is imported as-is in both
cases, it's not processed in any way. The only thing I can imagine is that
things don't look right if you don't have all the templates imported yet.
Thanks Daniel. Yes, I think that this may be
--- El dom, 8/3/09, O. O. olson...@yahoo.com escribió:
I thought that the
pages-articles.xml.bz2 (i.e. the XML Dump) contains
the templates – but I did not find a way to do install it
separately.
No, it only contains a dump of the current version of each article (involving
the
O. O. schrieb:
Daniel Kinzler wrote:
That sounds very *very* odd. because page content is imported as-is in both
cases, it's not processed in any way. The only thing I can imagine is that
things don't look right if you don't have all the templates imported yet.
Thanks Daniel. Yes, I think
Felipe Ortega wrote:
--- El dom, 8/3/09, O. O. olson...@yahoo.com escribió:
I thought that the
pages-articles.xml.bz2 (i.e. the XML Dump) contains
the templates – but I did not find a way to do install it
separately.
No, it only contains a dump of the current version of each
Daniel Kinzler wrote:
O. O. schrieb:
I thought that the pages-articles.xml.bz2 (i.e. the XML Dump) contains
the templates – but I did not find a way to do install it separately.
They should be contained. As it sais on the download page: Articles,
templates,
image descriptions, and
Thanks Joshua. I would prefer that you post to the Mailing List / Newsgroup –
so that all can benefit from your ideas.
--- El dom 8-mar-09, Joshua C. Lerner jler...@gmail.com escribió:
De: Joshua C. Lerner jler...@gmail.com
Asunto: Re: [Wikitech-l] Importing Wikipedia XML Dumps
Platonides schrieb:
O. Olson wrote:
Does anyone have experience importing the Wikipedia XML Dumps into
MediaWiki. I made an attempt with the English Wiki Dump as well as the
Portuguese Wiki Dump, giving php (cli) 1024 MB of Memory in the php.ini
file. Both of these attempts fail with out of
Platonides wrote:
Don't use importDump.php for a whole wiki dump, use MWDumper
http://www.mediawiki.org/wiki/MWDumper
Thanks Platonides. I am just curious why does
http://www.mediawiki.org/wiki/Manual:Importing_XML_dumps#Using_importDump.php
say that importDump.php is the recommended
Daniel Kinzler wrote:
Platonides schrieb:
MWDumper doesn't fill the secondary link tables. Please see
http://www.mediawiki.org/wiki/Manual:Importing_XML_dumps for detailed
instructions and considerations.
Also keep in mind that the english wikipedia is *huge*. You will need a decent
Is this on MW older than 1.14? You may want to disable profiling if it is
on.
-Aaron
--
From: O. O. olson...@yahoo.com
Sent: Saturday, March 07, 2009 10:28 PM
To: wikitech-l@lists.wikimedia.org
Subject: Re: [Wikitech-l] Importing Wikipedia XML
Jason Schulz wrote:
Is this on MW older than 1.14? You may want to disable profiling if it is
on.
-Aaron
Thanks Jason/Aaron. No, this is the recent MW 1.14 – downloaded in the
beginning of this week from http://www.mediawiki.org/wiki/Download.
Hi,
I am not sure if this is the correct place to ask this – if not then please
let me know which is the best place for such a question.
Does anyone have experience importing the Wikipedia XML Dumps into
MediaWiki. I made an attempt with the English Wiki Dump as well as the
15 matches
Mail list logo