I think Java uses UTF-16. UTF-32 is a subset of UCS-4, I believe. My impression is that UTF-8 and UCS-4 have less to do with UNICODE than UTF-16 and UTF-32.
I was using: http://www.cl.cam.ac.uk/~mgk25/ucs/ISO-10646-UTF-8.html Some other links for the curious: http://czyborra.com/utf/ http://www.tldp.org/HOWTO/Unicode-HOWTO-1.html http://www.cl.cam.ac.uk/~mgk25/unicode.html Andrew, thanks for the answer. Personally I see no problem using UTF-8 internally, but I don't do piecetable work so it's not really my call. I wasn't trying to preempt the decision; the new class is just a utility. Frank Francis James Franklin [EMAIL PROTECTED] "No, she really likes me. She told me I look like Britney Spears, and why would you say that to somebody you don't like?" --- Elle Woods
