On Sun, May 18, 2008 at 8:44 PM, Tony Mechelynck
<[EMAIL PROTECTED]> wrote:

>> Note, that you probably do not want to use BOM with UTF-8.
>> See http://unicode.org/faq/utf_bom.html#29 (Q: Can a UTF-8 data stream
>> contain the BOM character (in UTF-8 form)? If yes, then can I still
>> assume the remaining UTF-8 bytes are in big-endian order?)

> The BOM can also be used in UTF-8, not to determine endianness (which is
> not relevant for UTF-8 -- one could argue that UTF-8 is always
> big-endian) but to distinguish UTF-8 from other encodings including
> UTF-16 and UTF-32.

How can you argue that?  UTF-8 is neither big-endian nor
little-endian.  It's just a sequence of 8-bit bytes.

--~--~---------~--~----~------------~-------~--~----~
You received this message from the "vim_dev" maillist.
For more information, visit http://www.vim.org/maillist.php
-~----------~----~----~----~------~----~------~--~---

Raspunde prin e-mail lui