On 20 July 2012 11:02, ольга крыжановская <olga.kryzhanov...@gmail.com> wrote:
> Can any one say why the following iconv fails in GB18030 and prints 2
> '?' of the unicode character U+1F000?
>
> printf '\xf0\x9f\x80\x80' | iconv -f 'UTF-8' -t GB18030 | iconv -f GB18030
> ??
>
> My understanding is that GB18030 supports all Unicode characters with
> a GBK-like encoding, right?

Right. My understanding is that GB18030 is slightly broken (iconv
isn't the only part, the whole Tibetan glyphs come up as ? as well)
and Sun^WORACLE doesn't care. I think a well-tuned email to the PRC
ministry of commerce [english.mofcom.gov.cn] will be the only way to
get that fixed (all software sold in China must conform to GB18030,
and if the software does not it will get banned from gov.cn sales or
even banned from China altogether. And the communists KNOW how to make
ORACLE dance).

Ced
-- 
Cedric Blancher <cedric.blanc...@googlemail.com>
Institute Pasteur
_______________________________________________
opensolaris-discuss mailing list
opensolaris-discuss@opensolaris.org

Reply via email to