Is your argument that the Chinese, Japanese and Koreans won't be confused because they gave up on the idea that computers could compare symbols long ago? :-)
In Dyalog APL, I can go to http://www.languageguide.org/mandarin/, and copy the first line of text in the section labeled 2 into my session: text←'身体 shēn tǐ 身体 shēn tǐ 水果 shuǐ guǒ 蔬菜 shū cài ' ∪text ⍝ Unique symbols 身体 shēntǐ水果ugǒ蔬菜ūcài ∪(255<⎕UCS text)/text ⍝ Unique symbols outside ANSI range 身体ēǐ水果ǒ蔬菜ū text[⍋text] ⍝ OK, pretty meaningless :-) cghhhhinnssssttuuàēēūǐǐǐǒ体体果水菜蔬身身 ⎕dr text ⍝ Data Type (160 = 16-bit Unicode chars) 160 -----Original Message----- From: bill lam [mailto:[email protected]] Sent: 26. maj 2009 17:55 To: [email protected] Subject: Re: [Jchat] No More APL I agree that approach taken by dyalog can be less surprising for Europeans. OTOH UTF-8 will not be that confusing in the Far East because CJK have use multibyte character set for decades. -- regards, ==================================================== GPG key 1024D/4434BAB3 2008-08-24 gpg --keyserver subkeys.pgp.net --recv-keys 4434BAB3 唐詩154 許渾 秋日赴闕題潼關驛樓 紅葉晚蕭蕭 長亭酒一瓢 殘雲歸太華 疏雨過中條 樹色隨山迥 河聲入海遙 帝鄉明日到 猶自夢漁樵 ---------------------------------------------------------------------- For information about J forums see http://www.jsoftware.com/forums.htm ---------------------------------------------------------------------- For information about J forums see http://www.jsoftware.com/forums.htm
