John S. Yates, Jr. schrieb: > It is a mistake on Microsoft's part to fail to strip the BOM > during conversion to UTF-8. There is no MEANINGFUL definition > of BOM in a UTF-8 string.
That's not true. See http://unicode.org/faq/utf_bom.html#23 http://unicode.org/faq/utf_bom.html#29 The BOM can also serve as an encoding marker. I refer to the BOM encoded in UTF-8 as "UTF-8 signature". As such, it is very meaningful. Usage of the BOM in UTF-8-encoded text is deliberate. Regards, Martin _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com