jayvdb created this task.
jayvdb added a subscriber: jayvdb.
jayvdb added a project: Pywikibot-weblinkchecker.py.
Herald added subscribers: pywikibot-bugs-list, StudiesWorld, Aklapper.
TASK DESCRIPTION
weblinkchecker has an ignore list, which includes web archives and rfc2606
domains, but also includes several other websites and the reasons are not clear
or current. At least one entry is invalid.
```
re.compile(r'.*[\./@]berlinonline\.de(/.*)?'),
# above entry to be manually fixed per request at
[[de:Benutzer:BLueFiSH.as/BZ]]
# bot somehow can't handle their redirects:
re.compile(r'.*[\./@]gso\.gbv\.de(/.*)?'),
# bot can't handle their redirects:
re.compile(r'.*[\./@]bodo\.kommune\.no(/.*)?'),
re.compile(r'.*[\./@]jpl\.nasa\.gov(/.*)?'), # bot rejected on the site
re.compile(r'.*[\./@]itis\.gov(/.*)?'), # bot rejected on the site
```
See {T57282}:
```
re.compile(r'.*[\./@]cev\.lu(/.*)?'), # bot rejected on the site
```
```
# very slow response resulting in bot error:
re.compile(r'.*[\./@]science\.ksc\.nasa\.gov(/.*)?'),
```
4db982d:
```
# HTTP redirect loop
re.compile(r'.*[\./@]britannica\.com(/.*)?'),
# bot rejected on the site:
re.compile(r'.*[\./@]quickfacts\.census\.gov(/.*)?'),
```
TASK DETAIL
https://phabricator.wikimedia.org/T124015
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: jayvdb
Cc: Aklapper, StudiesWorld, jayvdb, pywikibot-bugs-list
_______________________________________________
pywikibot-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/pywikibot-bugs