David Hopwood <[EMAIL PROTECTED]> writes: >> Right, except BOMs break tons of Unix applications (and even >> occasional Windows ones) which do not expect them. > > This problem is overstated. A BOM anywhere in a text causes no > problem with display, and *should* be treated as an ignorable > character for searching, etc.
It is not ignorable in most file formats, and it is not automatically ignored by reading functions of most programming languages. > Note that there are plenty of other characters that should be > treated as ignorable, so the applications that are broken for BOMs > are broken more generally. I disagree. UTF-8 BOM should not be used on Unix. It's not a reliable method of encoding detection in general (applies only to Unicode), and it breaks the simplicity of text streams. -- __("< Marcin Kowalczyk \__/ [EMAIL PROTECTED] ^^ http://qrnik.knm.org.pl/~qrczak/ _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com