On Sat, Oct 16, 2010 at 4:41 AM, Michal Suchanek <[email protected]>wrote:

> 2010/10/16 Jonathan S. Shapiro <[email protected]>:
> > Ben: Do you have a sense of what the frequency and distribution is of
> > extended code points in typical Chinese text?
>
> If you look at Chinese manual for your mainboard or hardrive you will
> likely notice that it's mostly Chinese ideograms with occasional Latin
> word or two for technical terms and trademarks and occasional strings
> of "arabic" numerals to represent a number.
>

Yes. But you suggested that the most common 200,000 were within the UCS16
space, so it wasn't clear to me how many of those ideogram runs might be
UCS16-encodable.

Is there a syllabic script (something similar to a kana) in China?


shap
_______________________________________________
bitc-dev mailing list
[email protected]
http://www.coyotos.org/mailman/listinfo/bitc-dev

Reply via email to