Re: Encoding/Use of pontial unpaired UTF-16 surrogate pair specifiers

Doug Ewell Sat, 30 Jan 2016 13:50:26 -0800

Chris Jacobs wrote:

UTF16 has no way to define a code point that is D800-DFFF; this is
an issue if I want to apply some sort of encryption algorithm and
still have the result treated as text for transmission and encoding
to other string systems.


This is not an issue at all. You don't have to restrict the input to
text to be able to generate an output that can be treated as text.

I gathered that J wanted to generate arbitrary output that could beinterpreted as UTF-16 code units. I admit to being less than 100% sureof this.

Certainly there is no shortage of algorithms to map arbitrary byte inputto text output, usually limited to some subset of ASCII. One interestingapproach for the Unicode era was Markus Scherer's "Base16k" concept, athttps://sites.google.com/site/markusicu/unicode/base16k .

--

Doug Ewell | http://ewellic.org | Thornton, CO 🇺🇸

Re: Encoding/Use of pontial unpaired UTF-16 surrogate pair specifiers

Reply via email to