Michael Urman wrote: > On 9/7/06, David Hopwood <[EMAIL PROTECTED]> wrote: > >>Yes. However, this is not a good idea for precisely the reason described >>on that page (false detection of Unicode), and so any Unicode detection >>algorithm in Python should only be based on detecting a BOM, IMHO. > > Right, except BOMs break tons of Unix applications (and even > occasional Windows ones) which do not expect them.
This problem is overstated. A BOM anywhere in a text causes no problem with display, and *should* be treated as an ignorable character for searching, etc. Note that there are plenty of other characters that should be treated as ignorable, so the applications that are broken for BOMs are broken more generally. -- David Hopwood <[EMAIL PROTECTED]> _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com