I'm not sure whether people are using "canonicalize" in the generic sense 
or if they mean canonical mappings as defined by the Unicode standard. Just 
to be clear, the initial issue raised about U+00B5 MICRO SIGN versus U+03BC 
GREEK SMALL LETTER MU would not be fixed by a canonical decomposition. 
However, U+00B5 does have a compatibility decomposition to U+03BC.

The official definitions are given 
in http://www.unicode.org/versions/Unicode6.2.0/ch03.pdf, and some relevant 
suggestions about handling identifiers in the context of Unicode are 
in http://www.unicode.org/versions/Unicode6.2.0/ch03.pdf.

On Friday, January 17, 2014 3:58:18 PM UTC-6, Ivar Nesje wrote:
>
> +10 for automatic canoncialization. If we could have a optional warning if 
> canonicalization is needed for travis to barf at, that would be great too. 
>
> kl. 22:22:02 UTC+1 fredag 17. januar 2014 skrev Steven G. Johnson følgende:
>>
>> I opened an issue for this:
>>
>>    https://github.com/JuliaLang/julia/issues/5434
>>
>> My preference would be for Julia to silently canonicalize all homoglyphs 
>> in identifiers (rather than issuing a warning or whatever).
>>
>

Reply via email to