2012/1/15 Eric K <[email protected]>

> [For others: trying to list all page titles that contain any characters,
> *other than*: alphanumeric, spaces, underscores, dashes]
>
> I'm seeing the same. I went to this regex chat and asked them for a regex,
> which worked on the regex tester http://regexpal.com/ but not on the bot
> :-(.
>
> python pagegenerators.py -titleregex:.*[^\w\s-].*            ...  - [1]
> #1 hits all pages including ones that only have that set of characters,
> for example its also showing up "Apple".
>

This works for me, adding the quotation marks: -titleregex:".*[^\w\s-].*"
But it will also display titles that have non-English letters in. Note that
\w will not match them.

-- 
Bináris
_______________________________________________
Pywikipedia-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l

Reply via email to