2012/1/15 Eric K <[email protected]> > [For others: trying to list all page titles that contain any characters, > *other than*: alphanumeric, spaces, underscores, dashes] > > I'm seeing the same. I went to this regex chat and asked them for a regex, > which worked on the regex tester http://regexpal.com/ but not on the bot > :-(. > > python pagegenerators.py -titleregex:.*[^\w\s-].* ... - [1] > #1 hits all pages including ones that only have that set of characters, > for example its also showing up "Apple". >
This works for me, adding the quotation marks: -titleregex:".*[^\w\s-].*" But it will also display titles that have non-English letters in. Note that \w will not match them. -- Bináris
_______________________________________________ Pywikipedia-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l
