https://bugzilla.wikimedia.org/show_bug.cgi?id=18414

           Summary: The all-titles-in-ns0 list for enwiki contains some
                    weird stuff
           Product: Wikimedia
           Version: unspecified
          Platform: All
               URL: http://download.wikimedia.org/enwiki/latest/enwiki-
                    latest-all-titles-in-ns0.gz
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: Normal
         Component: Downloads
        AssignedTo: [email protected]
        ReportedBy: [email protected]
                CC: [email protected]


The all-titles-in-ns0 list for enwiki:
http://download.wikimedia.org/enwiki/latest/enwiki-latest-all-titles-in-ns0.gz

contains some weird stuff:

AC\\DC_Lane,_Melbourne
A_ch\\'im_un_pinnara,_i_kangsan_ungum_e
Bill_Clinton\\
C:\\WINDOWS
E\\I
.
.
.

It's mainly an escaping issue, as it seems [e.g.: \' etc.].
Caused by maintenance script(s):

Broken//\\x2e
Broken/File\\x3a
Broken/S/\\x2e
Broken/\\xe2\\x80\\xad
Broken/\\xe2\\x80\\xae
Broken/Norsk_(bokmål)
Broken/Norsk_(nynorsk)


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to