Re: Does "endian-ness" apply to UTF-8 characters that use multiple bytes?

2019-02-04 Thread Doug Ewell via Unicode
http://www.unicode.org/faq/utf_bom.html#utf8-2 -- Doug Ewell | Thornton, CO, US | ewellic.org

Re: Does "endian-ness" apply to UTF-8 characters that use multiple bytes?

2019-02-04 Thread James Tauber via Unicode
Endian-ness only affects ordering of bytes within a code unit. Because UTF-8 has single byte code units, the order is not affected by endian-ness, only the UTF-8 bit mapping itself. Note also that endian-ness only affects individual 16-bit code units in UTF-16. If you have a surrogate pair,

Re: Does "endian-ness" apply to UTF-8 characters that use multiple bytes?

2019-02-04 Thread Clive Hohberger via Unicode
Asmus, I believe it also applies to the bit order in the bytes I believe UTF-16 and UTF-32 are transmitted as single 16 or 32-bit numbers. UTF-8 is a stream of 8-bit numbers Clive *Clive P. Hohberger, PhD MBA* Managing Director Clive Hohberger, LLC +1 847 910 8794 cp...@case.edu *Inventor of

Re: Does "endian-ness" apply to UTF-8 characters that use multiple bytes?

2019-02-04 Thread Asmus Freytag via Unicode
On 2/4/2019 11:21 AM, Costello, Roger L. via Unicode wrote: Hello Unicode Experts! As I understand it, endian-ness applies to multi-byte words. Endian-ness does not apply to ASCII characters because each character is a single byte. Endian-ness does apply to

Does "endian-ness" apply to UTF-8 characters that use multiple bytes?

2019-02-04 Thread Costello, Roger L. via Unicode
Hello Unicode Experts! As I understand it, endian-ness applies to multi-byte words. Endian-ness does not apply to ASCII characters because each character is a single byte. Endian-ness does apply to UTF-16BE (Big-Endian), UTF-16LE (Little-Endian), UTF-32BE and UTF32-LE because each character