On 03/12/2011 13:18, Tim Starling wrote: > On 03/12/11 08:58, Platonides wrote: >> On 02/12/11 22:33, Khalida BEN SIDI AHMED wrote: >>> Hello, >>> I need an html dump of Wikipedia but the link http://static.wikipedia.org/ >>> does >>> not work. >>> I'd appreciate any explanation or suggestion. >>> >>> Regards >>> Ben Sidi Ahmed >> >> Why do oyu need an html dump of Wikipedia? > > It's a huge task to set up MediaWiki in precisely the same way as it > is on Wikimedia, to import an XML dump and to generate HTML. It takes > a serious amount of hardware and software development resources. > That's why I spent so much time making HTML dump scripts. It's just a > pity that nobody cared enough about it to keep the project going.
The DumpHTML Mediawiki extension is an essential piece of software: https://www.mediawiki.org/wiki/Extension:DumpHTML This is IMO the good approach and the only way to do high-quality static dumps. I have been using it since many years and all ZIM files I made were done using Tim's Mediawiki DumpHTML extension. http://download.kiwix.org/zim/0.9/ At Kiwix we currently pretty much focus on the end-user software but we still want to do everything necessary for having an open/efficient/handful toolchain to create static dumps from Mediawiki instances (in particular in the ZIM format). That is the reason why we have an small action plan to improve DumpHTML http://www.kiwix.org/index.php/Mediawiki_DumpHTML_extension_improvement Any comment or critic is welcome. If hackers are interested in working on DumpHTML, please let me know ; we currently work to get a grant for that, and this is on the good way. Emmanuel
signature.asc
Description: OpenPGP digital signature
_______________________________________________ Wikitech-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikitech-l
