On Fri, Sep 08, 2006 at 12:19:19PM -0400, Tom Lane wrote: > Martijn van Oosterhout <firstname.lastname@example.org> writes: > > On Fri, Sep 08, 2006 at 10:35:58AM -0400, Tom Lane wrote: > >> what's more, the docs suggest that it doesn't support anything wider > >> than UTF16. > > > Well, that's not true, which part of the docs were you looking at? > > AFAICT, most of the useful operations work on UChar, which is uint16: > http://icu.sourceforge.net/apiref/icu4c/umachine_8h.html#6bb9fad572d65b305324ef288165e2ac
Oh, you're confusing UCS-2 with UTF-16, UCS-2 is a subset of UTF-16 that only handles the basic plane. Just like no-one is surprised that UTF-8 handles more than 256 characters, it shouldn't surprise you that UTF-16 handles more than 65536. ICU hasn't used UCS-2 since 1996. It's in the FAQ: http://icu.sourceforge.net/userguide/icufaq.html Have a nice day, -- Martijn van Oosterhout <email@example.com> http://svana.org/kleptog/ > From each according to his ability. To each according to his ability to > litigate.
Description: Digital signature