-----BEGIN PGP SIGNED MESSAGE----- Doug Ewell wrote: > Markus Scherer <[EMAIL PROTECTED]> wrote: > > In UTF-7, byte sequences overlap and many bytes in the encoding > > (2 out of 8 I think) contain pieces of two adjacent code units. > > This is more like in Huffman codes. > > This is one reason why I'm a little uncomfortable with the wording in > UTR #17, which specifically mentions SCSU as a Transfer Encoding Syntax > (in contrast to a Character Encoding Scheme) but does not mention UTF-7, > which to my mind fits the definition of a TES much better.
Neither SCSU nor UTF-7 are TES's; they are both CES's (and are even registered as IANA charsets, to prove the point). There is of course nothing in the definition of a CES/charset (in the IETF or W3C or UTR#17 models) that requires it to be stateless, or that requires encodings of code points not to "overlap". - -- David Hopwood <[EMAIL PROTECTED]> Home page & PGP public key: http://www.users.zetnet.co.uk/hopwood/ RSA 2048-bit; fingerprint 71 8E A6 23 0E D3 4C E5 0F 69 8C D4 FA 66 15 01 Nothing in this message is intended to be legally binding. If I revoke a public key but refuse to specify why, it is because the private key has been seized under the Regulation of Investigatory Powers Act; see www.fipr.org/rip -----BEGIN PGP SIGNATURE----- Version: 2.6.3i Charset: noconv iQEVAwUBPLaIajkCAxeYt5gVAQG8vwf/cQhmge87rpsRblGLuuWMmen2CAkzVl4Z 0us/SP5HieGI0yIaAl/Ugi0idXjBZg4PHVCmKWl/fFbPOIhJBTx5oaRDRU4p2/eV K1cklegHLpU5n13PMQx/bDtZFCA79ChmdaQTQdUXiHpvCK/BWpqcWrXkaTm6oxO1 tUPNWzWOYVd89uz4jmhvA651v8lkemT5Ir8ELqNjTBA8pe5xlEpHnPxfO1b4MYg8 FvKMWHUeAhavmwJZjzZG7ff4O+sZCKYbZlc7EXtKK2JBtTOwAskkAPzB49oBGUUC Nw7elg8LjxRDNOigu7IlDLnGXUbOsfJnaTSFq79ROmbn7UBj/srjVw== =bYBh -----END PGP SIGNATURE-----

