On Sat, Oct 16, 2010 at 4:41 AM, Michal Suchanek <[email protected]>wrote:
> 2010/10/16 Jonathan S. Shapiro <[email protected]>: > > Ben: Do you have a sense of what the frequency and distribution is of > > extended code points in typical Chinese text? > > If you look at Chinese manual for your mainboard or hardrive you will > likely notice that it's mostly Chinese ideograms with occasional Latin > word or two for technical terms and trademarks and occasional strings > of "arabic" numerals to represent a number. > Yes. But you suggested that the most common 200,000 were within the UCS16 space, so it wasn't clear to me how many of those ideogram runs might be UCS16-encodable. Is there a syllabic script (something similar to a kana) in China? shap
_______________________________________________ bitc-dev mailing list [email protected] http://www.coyotos.org/mailman/listinfo/bitc-dev
