http://www.unicode.org/faq/utf_bom.html#utf8-2
--
Doug Ewell | Thornton, CO, US | ewellic.org
Endian-ness only affects ordering of bytes within a code unit.
Because UTF-8 has single byte code units, the order is not affected by
endian-ness, only the UTF-8 bit mapping itself.
Note also that endian-ness only affects individual 16-bit code units in
UTF-16. If you have a surrogate pair,
Asmus,
I believe it also applies to the bit order in the bytes
I believe UTF-16 and UTF-32 are transmitted as single 16 or 32-bit numbers.
UTF-8 is a stream of 8-bit numbers
Clive
*Clive P. Hohberger, PhD MBA*
Managing Director
Clive Hohberger, LLC
+1 847 910 8794
cp...@case.edu
*Inventor of
On 2/4/2019 11:21 AM, Costello, Roger
L. via Unicode wrote:
Hello Unicode Experts!
As I understand it, endian-ness applies to multi-byte words.
Endian-ness does not apply to ASCII characters because each character is a single byte.
Endian-ness does apply to
Hello Unicode Experts!
As I understand it, endian-ness applies to multi-byte words.
Endian-ness does not apply to ASCII characters because each character is a
single byte.
Endian-ness does apply to UTF-16BE (Big-Endian), UTF-16LE (Little-Endian),
UTF-32BE and UTF32-LE because each character
5 matches
Mail list logo