>> Say you have a Latin-1-encoded source code. Is that in NFC or not? > > Doesn't that depend on whether they happened to ever write some of the > combined characters (such as ö) using a two-character form like o¨?
No. Latin-1 does not support that form; the concept does not exist in that encoding. When converting to an UCS representation, it's the codec's choice to either produce a pre-composed or decomposed form. Regards, Martin _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com