On 08-Feb-2004, max <[EMAIL PROTECTED]> wrote:
> Hi Gabor,
> I think you're confused. Characters in .NET are 16 bits BECAUSE they are 
> unicode. 16 bits = 2 bytes = 65536 values.

No, Gabor is not confused.  Unicode has grown.  It is now 20 bits, not 16.
See for example <http://www.terena.nl/library/multiling/unicode/utf16.html>
(which I just found by googling; it looks a bit out-of-date).

Unfortunately Windows, Java, and .NET all use 16-bit characters.
That means that they must either (a) use UCS-2 encoding, i.e.
don't support the new unicode characters such as "OLD ITALIC LETTER A";
or (b) use UTF-16 encoding, which means that these characters which
don't fit in 16 bits get represented as a pair of 16-bit codes.

-- 
Fergus Henderson <[EMAIL PROTECTED]>  |  "I have always known that the pursuit
The University of Melbourne         |  of excellence is a lethal habit"
WWW: <http://www.cs.mu.oz.au/~fjh>  |     -- the last words of T. S. Garp.
_______________________________________________
Mono-list maillist  -  [EMAIL PROTECTED]
http://lists.ximian.com/mailman/listinfo/mono-list

Reply via email to