Kenichi Handa <[EMAIL PROTECTED]> writes: > Florian Weimer <[EMAIL PROTECTED]> writes: >> What does 'via surrogate pair' mean? I guess the second line should >> read: > >>> 00 xxxx xxxxxxxx xxxxxxxx Unicode 20bit (U+10000 - U+FFFFF) > > Yes. That's correct, and the third line shoud read as below: > > 01 0000 xxxxxxxx xxxxxxxx Unicode 20bit (U+100000 - U+10FFFF)
I'm still not convinced it's correct. My current understanding is that it should be: 00 xxxx xxxxxxxx xxxxxxxx Unicode 20 bit (U+000000 - U+0FFFFF) 01 0000 xxxxxxxx xxxxxxxx Unicode 20.08... bit (U+100000 - U+10FFFF) I'm currently reading the emacs-unicode mailing list, and it seems a few essential issues weren't on the horizon back then. Shall I send a comment to the emacs-unicode mailing list if I'm finished? -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/