"Martin v. Löwis" <[EMAIL PROTECTED]> writes: > It is somewhat tricky to implement a normalization algorithm in > UTF-16, since you must combine surrogate pairs first in order to > find out what the canonical decomposition of the code point is; > but it's just more code, and no problem in principle.
The same issue is with virtually any algorithm: more code, more complex code is needed with UTF-16 than with UTF-32. -- __("< Marcin Kowalczyk \__/ [EMAIL PROTECTED] ^^ http://qrnik.knm.org.pl/~qrczak/ _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com