Make that last link to the FAQ http://www.unicode.org/faq/normalization.html
On Friday, January 17, 2014 11:34:44 PM UTC-6, Marcus Urban wrote: > > I'm not sure whether people are using "canonicalize" in the generic sense > or if they mean canonical mappings as defined by the Unicode standard. Just > to be clear, the initial issue raised about U+00B5 MICRO SIGN versus U+03BC > GREEK SMALL LETTER MU would not be fixed by a canonical decomposition. > However, U+00B5 does have a compatibility decomposition to U+03BC. > > The official definitions are given in > http://www.unicode.org/versions/Unicode6.2.0/ch03.pdf, and some relevant > suggestions about handling identifiers in the context of Unicode are in > http://www.unicode.org/versions/Unicode6.2.0/ch03.pdf. > > On Friday, January 17, 2014 3:58:18 PM UTC-6, Ivar Nesje wrote: >> >> +10 for automatic canoncialization. If we could have a optional warning >> if canonicalization is needed for travis to barf at, that would be great >> too. >> >> kl. 22:22:02 UTC+1 fredag 17. januar 2014 skrev Steven G. Johnson >> følgende: >>> >>> I opened an issue for this: >>> >>> https://github.com/JuliaLang/julia/issues/5434 >>> >>> My preference would be for Julia to silently canonicalize all homoglyphs >>> in identifiers (rather than issuing a warning or whatever). >>> >>
