Adriano wrote:

>> Looking in the source of intl_builtin.cpp I noticed that there is 
>> support for UTF16, UTF32 and UNICODE_UCS2, for UNICODE_UCS2 there is 
>> also a constant (=8) defined in charsets.h
>>
>> These definitions are missing from RDB$CHARACTER_SETS. Can these be used 
>> as a connection or column character set? If not, what are they for?
>
>These are for internal usage only.
>
>I doubt someone can make UTF16/32 works as connection charset, it's too
>much work.
>
>For columns, with some work may be possible. But why? UTF-8 uses 1-4
>bytes per char, UTF-16 is also multibyte, using 2-4, and UTF-32 always 4
>bytes per char.
>
>I don't see how they might be preferred over UTF-8.

UTF-16 is much preferred for e.g. Far Eastern languages, because it
will use 2 bytes for every code point, whereas UTF-8 needs at least 3.

I can imagine that Japanese Firebird users would appreciate UTF-16
support.


Paul Vinkenoog

------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead. 
Download for free and get started troubleshooting in minutes. 
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk
Firebird-Devel mailing list, web interface at 
https://lists.sourceforge.net/lists/listinfo/firebird-devel

Reply via email to