David Hopwood <[EMAIL PROTECTED]> writes:

>> Right, except BOMs break tons of Unix applications (and even
>> occasional Windows ones) which do not expect them.
>
> This problem is overstated. A BOM anywhere in a text causes no
> problem with display, and *should* be treated as an ignorable
> character for searching, etc.

It is not ignorable in most file formats, and it is not automatically
ignored by reading functions of most programming languages.

> Note that there are plenty of other characters that should be
> treated as ignorable, so the applications that are broken for BOMs
> are broken more generally.

I disagree. UTF-8 BOM should not be used on Unix. It's not a reliable
method of encoding detection in general (applies only to Unicode),
and it breaks the simplicity of text streams.

-- 
   __("<         Marcin Kowalczyk
   \__/       [EMAIL PROTECTED]
    ^^     http://qrnik.knm.org.pl/~qrczak/
_______________________________________________
Python-3000 mailing list
Python-3000@python.org
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Reply via email to