I had been using mwlib to interface with Wikipedia dumps for a research project. The first step I did was to run:
mw-buildcdb --input enwiki-latest-pages-articles.2012.03.26.xml.bz2 --output 2012.03.26.no-redirects --ignore-redirects My wikiconf.txt in that output directory sets the type to nucdb. In my Python code that operates on the dump, I call mwlib.wiki.makewiki on that conf file. I saw in the commit log on GitHub that CDB support was going away. What is the current correct method for loading a Wikipedia dump file? -- You received this message because you are subscribed to the Google Groups "mwlib" group. To view this discussion on the web visit https://groups.google.com/d/msg/mwlib/-/hFlLG_QmGJgJ. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/mwlib?hl=en.
