On 26/11/2003 07:05, D. Starner wrote:

...

The whole point of such a tool would be to send binary data on a transport that
only allowed Unicode text. In practice, you'd also have to remap C0 and C1
characters; but even then 0x00-0x1F -> U+0250-026F and 0x80-0x9F to U+0270-U+028F
wouldn't be too complex. Unless you've added a Unicode library to what could
otherwise be coded in 4k, normalization would add a lot of complexity.



You could encode your 256 bytes as the Unicode PUA code points, cf. how Microsoft encodes symbol fonts. You wouldn't have to worry about normalisation or canonical equivalence as there are no canonical equivalents to PUA characters.

--
Peter Kirk
[EMAIL PROTECTED] (personal)
[EMAIL PROTECTED] (work)
http://www.qaya.org/





Reply via email to