Tim Lesher sagte:

> Here's the first draft of the python-dev summary for the first half of April. 
>  Please send any corrections or suggestions to
> the summarizers.
> [...]
> ----------------------------------------
> Unicode byte order mark decoding
> ----------------------------------------
>
> Evan Jones saw that the UTF-16 decoder discards the byte-order mark (BOM) 
> from Unicode files, while the UTF-8 decoder
> doesn't. Although the BOM isn't really required in UTF-8 files, many 
> Unicode-generating applications, especially on Microsoft
> platforms, add it.
>
> Walter Dörwald created a patch_ to add a UTF-8-Sig codec that generates a BOM 
> on writing and skips it on reading, but after a
> long discussion on the history of the Unicode, Microsoft's influence over its
> evolution, the consensus was that BOM and signature handling belong at a 
> higher level (for example, a stream API) than the
> codec.

All codecs provide a stream API, so there is no higher level.

Bye,
   Walter Dörwald




_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Reply via email to