@Dave-Allured I don't know enough about the consequences of requiring a 
specific normalization. There is some [interesting 
information](http://unicode.org/faq/normalization.html) on the unicode website 
about normalization. Which suggests that [over 99% of unicode text on the web 
is already in 
NFC](http://www.macchiato.com/unicode/nfc-faq#TOC-How-much-text-is-already-NFC-).
 Also interesting is that combining NFC normalized strings may not result in a 
new string that is normalized. It is also stated in the FAQ that "Programs 
should always compare canonical-equivalent Unicode strings as equal", so it's 
probably not an issue as long as the controlled vocabulary attributes have 
values with code points in the  U+0000 and U+007F range (control chars 
excluded).

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/cf-convention/cf-conventions/issues/141#issuecomment-407542279

Reply via email to