jenkins-bot has submitted this change and it was merged.

Change subject: Remove old unneeded URL exclusion rules
......................................................................


Remove old unneeded URL exclusion rules

No problems are experienced now accessing these URLs
using requests.

Bug: T124015
Change-Id: I1f33969058fc67f8437f2f92325faba636a9c81c
---
M scripts/weblinkchecker.py
1 file changed, 1 insertion(+), 12 deletions(-)

Approvals:
  MtDu: Looks good to me, but someone else must approve
  Xqt: Looks good to me, approved
  jenkins-bot: Verified



diff --git a/scripts/weblinkchecker.py b/scripts/weblinkchecker.py
index 51bce05..ed22c3c 100755
--- a/scripts/weblinkchecker.py
+++ b/scripts/weblinkchecker.py
@@ -133,7 +133,6 @@
 
 import requests
 
-# TODO: Convert to httlib2
 if sys.version_info[0] > 2:
     import http.client as httplib
     import urllib.parse as urlparse
@@ -164,20 +163,10 @@
     re.compile(r'.*[\./@]example\.org(/.*)?'),
 
     # Other special cases
-    # bot somehow can't handle their redirects:
-    re.compile(r'.*[\./@]gso\.gbv\.de(/.*)?'),
     re.compile(r'.*[\./@]berlinonline\.de(/.*)?'),
     # above entry to be manually fixed per request at 
[[de:Benutzer:BLueFiSH.as/BZ]]
     # bot can't handle their redirects:
-    re.compile(r'.*[\./@]bodo\.kommune\.no(/.*)?'),
-    re.compile(r'.*[\./@]jpl\.nasa\.gov(/.*)?'),  # bot rejected on the site
-    re.compile(r'.*[\./@]itis\.gov(/.*)?'),  # bot rejected on the site
-    re.compile(r'.*[\./@]cev\.lu(/.*)?'),  # bot rejected on the site
-    # very slow response resulting in bot error:
-    re.compile(r'.*[\./@]science\.ksc\.nasa\.gov(/.*)?'),
-    re.compile(r'.*[\./@]britannica\.com(/.*)?'),  # HTTP redirect loop
-    # bot rejected on the site:
-    re.compile(r'.*[\./@]quickfacts\.census\.gov(/.*)?'),
+
     # bot rejected on the site, already archived
     re.compile(r'.*[\./@]web\.archive\.org(/.*)?'),
 ]

-- 
To view, visit https://gerrit.wikimedia.org/r/265442
To unsubscribe, visit https://gerrit.wikimedia.org/r/settings

Gerrit-MessageType: merged
Gerrit-Change-Id: I1f33969058fc67f8437f2f92325faba636a9c81c
Gerrit-PatchSet: 2
Gerrit-Project: pywikibot/core
Gerrit-Branch: master
Gerrit-Owner: John Vandenberg <[email protected]>
Gerrit-Reviewer: MtDu <[email protected]>
Gerrit-Reviewer: Xqt <[email protected]>
Gerrit-Reviewer: jenkins-bot <>

_______________________________________________
MediaWiki-commits mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits

Reply via email to