I've copied the old cdb code into mwlib.cdb<https://github.com/doozan/mwlib.cdb>on github. I also resurrected the old xhtmlwriter as mwlib.xhtml <https://github.com/doozan/mwlib.xhtml>. I can submit both packages to pypi if all of the credits and attributions look okay (none of this is actually my code, it just the old code from mwlib)
-- Jeff On Thursday, July 26, 2012 1:34:17 PM UTC-4, UltraNurd wrote: > > I had been using mwlib to interface with Wikipedia dumps for a research > project. The first step I did was to run: > > mw-buildcdb --input enwiki-latest-pages-articles.2012.03.26.xml.bz2 --output > 2012.03.26.no-redirects --ignore-redirects > > My wikiconf.txt in that output directory sets the type to nucdb. In my > Python code that operates on the dump, I call mwlib.wiki.makewiki on that > conf file. > > I saw in the commit log on GitHub that CDB support was going away. What is > the current correct method for loading a Wikipedia dump file? > -- You received this message because you are subscribed to the Google Groups "mwlib" group. To view this discussion on the web visit https://groups.google.com/d/msg/mwlib/-/d6PAIDjl9dcJ. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/mwlib?hl=en.
