Kenichi Handa <[EMAIL PROTECTED]> writes:

> Florian Weimer <[EMAIL PROTECTED]> writes:
>> What does 'via surrogate pair' mean?  I guess the second line should
>> read:
>
>>>    00 xxxx xxxxxxxx xxxxxxxx   Unicode 20bit (U+10000 - U+FFFFF)
>
> Yes.   That's correct, and the third line shoud read as below:
>
>    01 0000 xxxxxxxx xxxxxxxx   Unicode 20bit (U+100000 - U+10FFFF)

I'm still not convinced it's correct.  My current understanding is
that it should be:

  00 xxxx xxxxxxxx xxxxxxxx   Unicode 20 bit       (U+000000 - U+0FFFFF)
  01 0000 xxxxxxxx xxxxxxxx   Unicode 20.08... bit (U+100000 - U+10FFFF)

I'm currently reading the emacs-unicode mailing list, and it seems a
few essential issues weren't on the horizon back then.  Shall I send a
comment to the emacs-unicode mailing list if I'm finished?
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to