Escaping the # makes no difference
pwb.py listpages -ns:0 -start -grep:"\#[Rr]" -format:"{page.title}"
only returns pages with # in the body of URLs
On Wed, 19 Aug 2020 at 10:48, Strainu <[email protected]> wrote:
>
> '#' is special in python, maybe the regex gets post-processed into becoming a
> comment? Try escaping it.
>
> Strainu
>
> Pe marți, 18 august 2020, John Bray <[email protected]> a scris:
>>
>> I'd like to get all the redirect pages out of a wiki, but
>>
>> pwb.py listpages -start:UK -grep:'#REDIRECT' -format:"{page.title}" -get
>>
>> produces nothing, but
>>
>> pwb.py listpages -start:UK -grep:'United Kingdom' -format:"{page.title}" -get
>>
>> produces, as expected
>>
>> UK
>> #REDIRECT [[United Kingdom]]
>>
>> pwb.py listpages -start:UK -grep:'#' -format:"{page.title}" -get
>>
>> picks up pages with a # in them, but not any of the #REDIRECTs
>>
>> why isn't -grep just parsing the page fully?
>>
>> John
>>
>> _______________________________________________
>> pywikibot mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/pywikibot
>
> _______________________________________________
> pywikibot mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/pywikibot
_______________________________________________
pywikibot mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/pywikibot