Philippe Verdy <verdy underscore p at wanadoo dot fr> wrote: >>> Unicode defines only 4 *standard* normalization forms (NFC, NFD, >>> NFKC, NFKD), but other *non-standard* normalization forms are >>> possible: >> >> But should not be used. It can be tricky enough getting the four >> standard ones right as it is. > > Wrong. Non-standard normalization forms are useful too, and can even > be safe if they preserve one of the two standard equivalences > (canonical or compatibility).
I agree that non-standard normalization forms may have benefits, as in your Korean example. I respectfully disagree that they should be used. IMHO, the potential for confusion and lack of interoperability is greater than the benefit. -Doug Ewell Fullerton, California http://users.adelphia.net/~dewell/

