https://bugzilla.wikimedia.org/show_bug.cgi?id=28460

             Bug #: 28460
           Summary: invisible character in url leads to "no article" page
           Product: MediaWiki
           Version: wikimedia-deployment
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: Normal
         Component: Redirects
        AssignedTo: [email protected]
        ReportedBy: [email protected]
    Classification: Unclassified


Created attachment 8387
  --> https://bugzilla.wikimedia.org/attachment.cgi?id=8387
example of invisible character in wikipedia url

An example:
http://en.wikipedia.org/wiki/Horsesho%E2%80%8Be_orbit

If this link is pasted into Wikipedia, it fails. The character which is
decoding here as "E2 80 8B" is normally invisible. It appears to be the UTF-8
character "zero width space". Since it has no syntactical value, shouldn't such
characters simply be removed from pasted urls? I don't know how the spurious
character got into the URL in the first place, but surely any invisible
characters ought to be removed by the parser, right?

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to