In <[email protected]>, on 09/23/2013
at 04:56 PM, Charles Mills <[email protected]> said:
>"Unicode" is not a character set (or "format") -- it's a whole family
>of character sets.
Not really; it's a 32-bit character set with a 16-bit subset (UCS-2)
for the BMP. UTF-8 and UTF-16 are not character sets, but, rather,
transforms for representing Unicode characters in bytes smaller than
32 bits.
--
Shmuel (Seymour J.) Metz, SysProg and JOAT
Atid/2 <http://patriot.net/~shmuel>
We don't care. We don't have to care, we're Congress.
(S877: The Shut up and Eat Your spam act of 2003)
----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: INFO IBM-MAIN