On 03/12/2011 13:18, Tim Starling wrote:
> On 03/12/11 08:58, Platonides wrote:
>> On 02/12/11 22:33, Khalida BEN SIDI AHMED wrote:
>>> Hello,
>>> I need an html dump of Wikipedia but the link http://static.wikipedia.org/ 
>>> does
>>> not work.
>>> I'd appreciate any explanation or suggestion.
>>>
>>> Regards
>>> Ben Sidi Ahmed
>>
>> Why do oyu need an html dump of Wikipedia?
> 
> It's a huge task to set up MediaWiki in precisely the same way as it
> is on Wikimedia, to import an XML dump and to generate HTML. It takes
> a serious amount of hardware and software development resources.
> That's why I spent so much time making HTML dump scripts. It's just a
> pity that nobody cared enough about it to keep the project going.

The DumpHTML Mediawiki extension is an essential piece of software:
https://www.mediawiki.org/wiki/Extension:DumpHTML

This is IMO the good approach and the only way to do high-quality static
dumps. I have been using it since many years and all ZIM files I made
were done using Tim's Mediawiki DumpHTML extension.
http://download.kiwix.org/zim/0.9/

At Kiwix we currently pretty much focus on the end-user software but we
still want to do everything necessary for having an
open/efficient/handful toolchain to create static dumps from Mediawiki
instances (in particular in the ZIM format).

That is the reason why we have an small action plan to improve DumpHTML
http://www.kiwix.org/index.php/Mediawiki_DumpHTML_extension_improvement
Any comment or critic is welcome.

If hackers are interested in working on DumpHTML, please let me know ;
we currently work to get a grant for that, and this is on the good way.

Emmanuel

Attachment: signature.asc
Description: OpenPGP digital signature

_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to