https://bugzilla.wikimedia.org/show_bug.cgi?id=18414
Summary: The all-titles-in-ns0 list for enwiki contains some
weird stuff
Product: Wikimedia
Version: unspecified
Platform: All
URL: http://download.wikimedia.org/enwiki/latest/enwiki-
latest-all-titles-in-ns0.gz
OS/Version: All
Status: NEW
Severity: enhancement
Priority: Normal
Component: Downloads
AssignedTo: [email protected]
ReportedBy: [email protected]
CC: [email protected]
The all-titles-in-ns0 list for enwiki:
http://download.wikimedia.org/enwiki/latest/enwiki-latest-all-titles-in-ns0.gz
contains some weird stuff:
AC\\DC_Lane,_Melbourne
A_ch\\'im_un_pinnara,_i_kangsan_ungum_e
Bill_Clinton\\
C:\\WINDOWS
E\\I
.
.
.
It's mainly an escaping issue, as it seems [e.g.: \' etc.].
Caused by maintenance script(s):
Broken//\\x2e
Broken/File\\x3a
Broken/S/\\x2e
Broken/\\xe2\\x80\\xad
Broken/\\xe2\\x80\\xae
Broken/Norsk_(bokmål)
Broken/Norsk_(nynorsk)
--
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l