On Tue, 28 Aug 2012 11:29:48 -0500, Kirk Wolf wrote:

>From:
>http://java.sun.com/javase/technologies/core/basic/intl/faq.jsp#core-textrep
>
> The primitive data type char in the Java programming language is an
>unsigned 16-bit integer that can represent a Unicode code point in the
>range U+0000 to U+FFFF, or the code units of
>UTF-16<http://java.sun.com/javase/technologies/core/basic/intl/faq.jsp#utf-16>
>.
Which does describe UTF-16 as a variable-length encoding of which
Java uses a subset.

>Confusing eh?
>I guess you would call what Java uses internally a UTF-16 subset.
>
>So, technically not UTF-16, but practically UTF-16  (a two-byte UTF-16
>subset)
> 
That's more like UCS-2.  Big-endian or little-endian?

-- gil

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN

Reply via email to