Danny B. wrote:
> Hello,
> 
> I'm looking for any kind of tool which would take the XML dump (most probably 
> the pages-meta-current.xml.bz2, at least the pages-articles.xml.bz2) and 
> would return the list of page titles (or alternatively/configurably page ids) 
> of pages containing given string.
> 
> Does anybody have such (kind of) tool and is willing to share? Both command 
> line or webpage interface are OK.
> 
> Thank you.
> 
> 
> Danny B.

I have one program to do so, dated 3 years ago. Many of us have probably
rewritten that wheel. I'll send you privately.
There's also http://meta.wikimedia.org/wiki/User:Micke/WikiFind
Not the ultimate solution, but good enough. I have some changes to unify
both versions if you're interested.

_______________________________________________
Toolserver-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/toolserver-l

Reply via email to