Re: Micro Python -- a lean and efficient implementation of Python 3

Ian Kelly Tue, 03 Jun 2014 22:05:26 -0700

On Tue, Jun 3, 2014 at 10:40 PM, Rustom Mody <[email protected]> wrote:
>> 1) Most or all Chinese and Japanese characters
>
> Dont know how you count 'most'
>
> | One possible rationale is the desire to limit the size of the full
> | Unicode character set, where CJK characters as represented by discrete
> | ideograms may approach or exceed 100,000 (while those required for
> | ordinary literacy in any language are probably under 3,000). Version 1
> | of Unicode was designed to fit into 16 bits and only 20,940 characters
> | (32%) out of the possible 65,536 were reserved for these CJK Unified
> | Ideographs. Later Unicode has been extended to 21 bits allowing many
> | more CJK characters (75,960 are assigned, with room for more).
>
> | From http://en.wikipedia.org/wiki/Han_unification


So there are 20,940 CJK characters in the BMP, and approximately
55,000 more in the SIP.  I'd count 55,000 out of 75,960 as "most".
-- 
https://mail.python.org/mailman/listinfo/python-list

Re: Micro Python -- a lean and efficient implementation of Python 3

Reply via email to