Re: Code pages and Unicode

Ken Whistler Mon, 22 Aug 2011 16:30:50 -0700

On 8/22/2011 3:15 PM, Richard Wordingham wrote:

On Monday 22 August 2011, Andrew West<[email protected]>  wrote:

>

>  >  Can anyone think of a way to extend UTF-16 without adding new
>  >  surrogates or inventing a new general category?

> >> > Andrew

>> How about a triple sequence of two high surrogates followed by one

>  low surrogate?


How about Clause 12.5 of ISO/IEC 10646:

<001B, 0025, 0040>

You "escape" out of UTF-16 to ISO 2022, and then you can do whatever the
heck you want, including exchange and processing of complete 4-byte forms,
with all the billions of characters folks seem to think they need.

Of course you would have to convince implementers to honor the ISO 2022

escape sequence and "liberate" themselves into a high-level world ofnosebleed

character numerosity. But then I guess by the time this is needed, folks are
counting on the need being self-evident. ;-)

--Ken

Re: Code pages and Unicode

Reply via email to