Anyone wanting a list of redirects to work on with pywikibot could do this kludge
curl 'http:/wiki/index.php?title=Special:ListRedirects&limit=5000&offset=0' | grep mw-redirect temp | sed -Ee s'`.*">([^<]*)</a>.*.*">([^<]*)</a>.*`\1`' On Sat, 29 Aug 2020 at 12:27, <[email protected]> wrote: > > Hi all, > > -start parameter does not include redirects. > Unfortunately -redirectonly option wasn’t ported from compat. > > Best > xqt > > > Am 29.08.2020 um 12:42 schrieb John Bray <[email protected]>: > > > > The problem gets worse > > > > pwb.py listpages -ns:0 -start -grep:"REDIRECT" -format:"{page.title}" > > > > won't spot files with #REDIRECT, and > > > > pwb.py listpages -ns:0 -start -format:"{page.title}" > > > > won't list pages with just a #DIRECT line, though it will if they have > > extra text > > > > pwb.py replace -page:"David" "#REDIRECT" "@redirect" works > > > > pwb.py replace -start "REDIRECT" "@redirect" will ignore the page > > David if it just contains the # line > > > > So whatever pywikibot does for page selection, a #REDIRECT line upsets it > > > >> On Sat, 29 Aug 2020 at 10:31, John Bray <[email protected]> wrote: > >> > >> Escaping the # makes no difference > >> > >> pwb.py listpages -ns:0 -start -grep:"\#[Rr]" -format:"{page.title}" > >> > >> only returns pages with # in the body of URLs > >> > >>> On Wed, 19 Aug 2020 at 10:48, Strainu <[email protected]> wrote: > >>> > >>> '#' is special in python, maybe the regex gets post-processed into > >>> becoming a comment? Try escaping it. > >>> > >>> Strainu > >>> > >>> Pe marți, 18 august 2020, John Bray <[email protected]> a scris: > >>>> > >>>> I'd like to get all the redirect pages out of a wiki, but > >>>> > >>>> pwb.py listpages -start:UK -grep:'#REDIRECT' -format:"{page.title}" -get > >>>> > >>>> produces nothing, but > >>>> > >>>> pwb.py listpages -start:UK -grep:'United Kingdom' -format:"{page.title}" > >>>> -get > >>>> > >>>> produces, as expected > >>>> > >>>> UK > >>>> #REDIRECT [[United Kingdom]] > >>>> > >>>> pwb.py listpages -start:UK -grep:'#' -format:"{page.title}" -get > >>>> > >>>> picks up pages with a # in them, but not any of the #REDIRECTs > >>>> > >>>> why isn't -grep just parsing the page fully? > >>>> > >>>> John > >>>> > >>>> _______________________________________________ > >>>> pywikibot mailing list > >>>> [email protected] > >>>> https://lists.wikimedia.org/mailman/listinfo/pywikibot > >>> > >>> _______________________________________________ > >>> pywikibot mailing list > >>> [email protected] > >>> https://lists.wikimedia.org/mailman/listinfo/pywikibot > > > > _______________________________________________ > > pywikibot mailing list > > [email protected] > > https://lists.wikimedia.org/mailman/listinfo/pywikibot > > > _______________________________________________ > pywikibot mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/pywikibot _______________________________________________ pywikibot mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/pywikibot
