Hi Olga,

I got the following response forwarding your query:
===
That is because current GB18030<->unicode conversion code table we're using is NOT latest one. The character in your input belongs to CJK unified ideographs extension B, which is defined only in GB18030-2005 standard.
===

Regards,
hnhn

On 07/20/12 11:02 AM, ольга крыжановская wrote:
Can any one say why the following iconv fails in GB18030 and prints 2
'?' of the unicode character U+1F000?

printf '\xf0\x9f\x80\x80' | iconv -f 'UTF-8' -t GB18030 | iconv -f GB18030
??

My understanding is that GB18030 supports all Unicode characters with
a GBK-like encoding, right?

Olga

--
Jan Hnatek
jan.hna...@oracle.com
_______________________________________________
opensolaris-discuss mailing list
opensolaris-discuss@opensolaris.org

Reply via email to