---------- Forwarded message ----------
Date: Fri, 6 Feb 2004 12:02:14 -0500
From: Anton Okmianski <[EMAIL PROTECTED]>
To: 'Rainer Gerhards' <[EMAIL PROTECTED]>,
     "'Harrington, David'" <[EMAIL PROTECTED]>, [EMAIL PROTECTED]
Subject: RE: -international: trailer

Rainer:

> I am still tempted to allow only octets in the range of 1..255. ;)

I think at least technically this restriction is possible because 0x00
never appears as part of any characters encoded as multi-octet
characters in UTF-8.  See table here:
http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8

However, I don't know if restricting this bit sequence is necessary.
What do other UTF-8 based protocols do?

Anton.




Reply via email to