Just to clarify, Unicode defines characters with scalar values in the range 0x0000...0x10FFFF. How these values get encoded is a separate issue. There are many encodings, including:> A couple questions about multibyte strings...Do all the characters in the same encoding use the same number of bytespercharacter, or is it possible for an encoding to use 3 bytes for one character, 1 for another, etc?Unicode (not used on Palm) uses constant char length 2
UTF-8 One to four bytes per character.
UTF-16 One or two 16-bit values per character.
UTF-32 One 32-bit value per character.
Many people think of Unicode in terms of the older, deprecated UCS-2 encoding, where every character consists of a single 16-bit word. But since the range of Unicode scalar values now exceeds 64K, this is no longer sufficient, thus the creation of UTF-16.
-- Ken
--
Ken Krugler
TransPac Software, Inc.
<http://www.transpac.com>
+1 530-470-9200
--
For information on using the Palm Developer Forums, or to unsubscribe, please see http://www.palmos.com/dev/support/forums/
