Georg Brandl schrieb: >>>>> b = (codecs.BOM_UTF8 + "hello").decode("utf-8") >>>>> len(a) >> 5 > > This behavior is questionable...
Indeed. Try py> b = (codecs.BOM_UTF8 + "hello").decode("utf-8-sig") py> len(b) 5 instead. Regards, Martin _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com