---------- Forwarded message ----------
Date: Fri, 6 Feb 2004 12:02:14 -0500
From: Anton Okmianski <[EMAIL PROTECTED]>
To: 'Rainer Gerhards' <[EMAIL PROTECTED]>,
"'Harrington, David'" <[EMAIL PROTECTED]>, [EMAIL PROTECTED]
Subject: RE: -international: trailer
Rainer:
> I am still tempted to allow only octets in the range of 1..255. ;)
I think at least technically this restriction is possible because 0x00
never appears as part of any characters encoded as multi-octet
characters in UTF-8. See table here:
http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8
However, I don't know if restricting this bit sequence is necessary.
What do other UTF-8 based protocols do?
Anton.