I have the same problem as Danny. My problem with AWB (which is free in itself) is that it depends on the non-free library .NET and it works on Windows only. A Python command-line tool will be perfect.
2009/7/25 John Doe <[email protected]>: > awb does but wont work on the ts > > On Sat, Jul 25, 2009 at 8:46 AM, Simon Walker <[email protected]> > wrote: >> >> Does AWB not do something along those lines? >> >> 2009/7/25 Danny B. <[email protected]> >>> >>> Hello, >>> >>> I'm looking for any kind of tool which would take the XML dump (most >>> probably the pages-meta-current.xml.bz2, at least the >>> pages-articles.xml.bz2) and would return the list of page titles (or >>> alternatively/configurably page ids) of pages containing given string. >>> >>> Does anybody have such (kind of) tool and is willing to share? Both >>> command line or webpage interface are OK. >>> >>> Thank you. >>> >>> >>> Danny B. >>> >>> _______________________________________________ >>> Toolserver-l mailing list >>> [email protected] >>> https://lists.wikimedia.org/mailman/listinfo/toolserver-l >> >> >> >> -- >> Regards, >> >> Simon Walker >> User:Stwalkerster on all public Wikimedia Foundation wikis >> Administrator on the English Wikipedia >> Developer of Helpmebot, the ACC tool, and Nubio 2 FAQ repository >> >> _______________________________________________ >> Toolserver-l mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/toolserver-l >> > > > _______________________________________________ > Toolserver-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/toolserver-l > > -- OsamaKhalid _______________________________________________ Toolserver-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/toolserver-l
