On Tue Nov 6 14:46:32 2007, Tomasz Sterna wrote:
Dnia 06-11-2007, Wt o godzinie 14:35 +0000, Dave Cridland pisze:
> > Let's take first 256 allowable UTF-8 characters [...]
> Can't do that, because many of those characters are going to be
> illegal even in CDATA sections.
First _allowable_ 256 UTF-8 characters are for sure legal in CDATA
section.
I'm not following something. So encode the octets #x00 #x01 #x02 #x5D
#x5D #x3E, and tell me what you get.
I get three bytes that are not legal in a CDATA section, followed by
a sequence of bytes which decode (via UTF-8) to "]]>", which in turn
would end the CDATA section.
As far as I can tell, all those octet values would need to be further
escaped.
Dave.
--
Dave Cridland - mailto:[EMAIL PROTECTED] - xmpp:[EMAIL PROTECTED]
- acap://acap.dave.cridland.net/byowner/user/dwd/bookmarks/
- http://dave.cridland.net/
Infotrope Polymer - ACAP, IMAP, ESMTP, and Lemonade