On Tue, 28 Aug 2012 11:29:48 -0500, Kirk Wolf wrote: >From: >http://java.sun.com/javase/technologies/core/basic/intl/faq.jsp#core-textrep > > The primitive data type char in the Java programming language is an >unsigned 16-bit integer that can represent a Unicode code point in the >range U+0000 to U+FFFF, or the code units of >UTF-16<http://java.sun.com/javase/technologies/core/basic/intl/faq.jsp#utf-16> >. Which does describe UTF-16 as a variable-length encoding of which Java uses a subset.
>Confusing eh? >I guess you would call what Java uses internally a UTF-16 subset. > >So, technically not UTF-16, but practically UTF-16 (a two-byte UTF-16 >subset) > That's more like UCS-2. Big-endian or little-endian? -- gil ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN