On 6/8/07, Rauli Ruohonen <[EMAIL PROTECTED]> wrote: > The additional field is 8 bits, two bits for each normalization (a > Yes/Maybe/No value). In Unicode 4.1 only 5 different combinations are > used, but I don't know if that's true of later versions.
There are no "Maybe" values for the Decomposed forms. It is impossible to be Compatibility without also being Canonical. (The definition of Compatibility includes folding as much as possible under either form.) So there are really 3 possibilities (both, canonical only, neither) for the decomposed, and (at most) 6 for the composed forms. (I'm not sure all 6 of those can occur in practice.) But there are other normalization forms that may be added later. The ones I found reference to are basically orthogonal (an existing normalization may or may not meet them). See the proposed changes at http://www.unicode.org/reports/tr15/tr15-28.html -jJ _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com