A few months ago I successfully downloaded the November 2006 HTML version of 
Wikipedia (about 6GB expanding to 90GB) and the October 2008 xml.bz2 file 
(4.1GB converted to 7.1GB Wikitaxi format). 
I have just downloaded the June 2008 HTML version in .tar.7z format and 
extracted into .tar format (14.3GB.to 230GB). I now have no idea what to do 
next. I ran WinRAR on it and it gave up after more than 6 million files. 
1. How do I actually access all this information? I use the Wikitaxi version, 
but only the HTML version allows access to, for instance, categories, so the 
latest version would be useful. 
2. Is there any way to recompress it to a reasonable size such that I can still 
access it without it occupying nearly all my disk?3. Or, failing that, is there 
any way to access the original .tar.7z file, as BzReader can access .xml.bz2 
files?


      
_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to