I found the answers in http://www.unicode.org/unicode/reports/tr27/
> -----Original Message-----
> From: [EMAIL PROTECTED]
> [mailto:[EMAIL PROTECTED]]On Behalf Of Carl W. Brown
> Sent: Monday, September 10, 2001 9:22 AM
> To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
> Subject: RE: Encoding conversions
>
>
> Edmund,
>
> >
> > Note also that "\xe0\x84\x80" is illegal, for example, as U+0100
> > should be represented only by "\xc4\x80".
>
> Likewise \xF0\x80\x84\x80 would be ilegal as well. I had not
> considered it.
>
> I guess I should also stop encoding spaces as \xC0\xA0 ;-}
>
> >
> > Perhaps you want to exclude U+FFFF, too.
> >
>
> You are right.
>
> Carl
>
>
>
> -
> Linux-UTF8: i18n of Linux on all levels
> Archive: http://mail.nl.linux.org/linux-utf8/
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/