On 20 July 2012 11:02, ольга крыжановская <olga.kryzhanov...@gmail.com> wrote: > Can any one say why the following iconv fails in GB18030 and prints 2 > '?' of the unicode character U+1F000? > > printf '\xf0\x9f\x80\x80' | iconv -f 'UTF-8' -t GB18030 | iconv -f GB18030 > ?? > > My understanding is that GB18030 supports all Unicode characters with > a GBK-like encoding, right?
Right. My understanding is that GB18030 is slightly broken (iconv isn't the only part, the whole Tibetan glyphs come up as ? as well) and Sun^WORACLE doesn't care. I think a well-tuned email to the PRC ministry of commerce [english.mofcom.gov.cn] will be the only way to get that fixed (all software sold in China must conform to GB18030, and if the software does not it will get banned from gov.cn sales or even banned from China altogether. And the communists KNOW how to make ORACLE dance). Ced -- Cedric Blancher <cedric.blanc...@googlemail.com> Institute Pasteur _______________________________________________ opensolaris-discuss mailing list opensolaris-discuss@opensolaris.org