I think Java uses UTF-16.

UTF-32 is a subset of UCS-4, I believe. My impression is that UTF-8 and
UCS-4 have less to do with UNICODE than UTF-16 and UTF-32.

I was using:
http://www.cl.cam.ac.uk/~mgk25/ucs/ISO-10646-UTF-8.html

Some other links for the curious:
http://czyborra.com/utf/
http://www.tldp.org/HOWTO/Unicode-HOWTO-1.html
http://www.cl.cam.ac.uk/~mgk25/unicode.html

Andrew, thanks for the answer. Personally I see no problem using UTF-8
internally, but I don't do piecetable work so it's not really my call. I
wasn't trying to preempt the decision; the new class is just a utility.

Frank

Francis James Franklin
[EMAIL PROTECTED]

"No, she really likes me. She told me I look like Britney Spears, and why
would you say that to somebody you don't like?"
                                                           --- Elle Woods


Reply via email to