Am 24.07.2012 17:01, schrieb [email protected]:
reader = codecs.getreader(encoding) lines = [] with open(filename, 'rb') as f: lines = reader(f, 'strict').readlines(keepends=False)where encoding == 'utf-16-be' Everything works fine, except that lines[0] is equal to codecs.BOM_UTF16_BE Is this behaviour correct, that the BOM is still present?
Yes, assuming the first line only contains that BOM. Technically it's a space character, and why should those be removed?
Uli -- http://mail.python.org/mailman/listinfo/python-list
