https://bugzilla.wikimedia.org/show_bug.cgi?id=27987

           Summary: Function ccnorm shouldn't convert "I" and "L" to "1",
                    "O" to "0" and "S" to "5"
           Product: MediaWiki
           Version: 1.17
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: Normal
         Component: General/Unknown
        AssignedTo: [email protected]
        ReportedBy: [email protected]


Currently the result of 
------------------------------------------------
 ccnorm("ABCDEFGHIJKLMNOPQRSTUVWXYZ_abcdefghijklmnopqrstuwxyz")
------------------------------------------------

is
------------------------------------------------
 ABCDEFGH1JK1MN0PQR5TUVWXYZ_ABCDEFGH1JK1MN0PQR5TUWXYZ
------------------------------------------------

This makes the creation of filters on [[Special:AbuseFilter]] not intuitive,
since if we want to catch all variations of a word like "testing" and try to
use something like
------------------------------------------------
 words :="TESTING|VANDALIZING";
 ccnorm(added_lines) rlike words)
 & !(ccnorm(removed_lines) rlike words)
------------------------------------------------

it won't work. Instead of this natural approach, the text would need to be
changed to
------------------------------------------------
 words :="TE5T1NG|VANDA11Z1NG";
------------------------------------------------


You can confirm the problem on [[Special:AbuseFilter/tools]], by using the
following:
------------------------------------------------
 words :="TESTING|VANDALIZING";
 ccnorm("I'm testing here. I'm vandalizing the article!") rlike words
------------------------------------------------

The regex above will not match, but it will match in the following:
------------------------------------------------
 words := "TE5T1NG|VANDA11Z1NG";
 ccnorm("I'm testing here. I'm vandalizing the article!") rlike words
------------------------------------------------

Could this be fixed?

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to