On Mon, 10 Sep 2001, Roozbeh Pournader wrote:
> > What will these restrictions be? Big changes?
>
> Well, UTF-8 will be made simpler. Currently, Unicode-conformant UTF-8
> decoders should accept 'irregular' UTF-8 (which is codepoint coded as
> UTF-16, and then reencoded as UTF-8). With the change, there will be no
> need for that anymore, and the decoder will be allowed to reject
> irregulars, or even forget about their existance.

The ISO 10646-1:2000 definition of UTF-8 had that already. Unicode is just
getting better aligned with the ISO standard here.

Markus

-- 
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org,  WWW: <http://www.cl.cam.ac.uk/~mgk25/>

-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to