Re: validity of lone surrogates

Marcin 'Qrczak' Kowalczyk Tue, 03 Jul 2001 23:55:31 -0700

Tue, 3 Jul 2001 11:19:05 +0100, Michael Everson <[EMAIL PROTECTED]> pisze:

>>I would be glad if the resolution allowed UTF-8 and UTF-32 encoders and
>>decoders to not worry about surrogates at all. Please leave surrogate
>>issues to UTF-16.
> 
> But what if I want to put up a Web page in Etruscan?

UTF-8 and UTF-32 handle characters above U+FFFF with no problem.
I mean: forget about surrogates, i.e. about encoding those characters
as pairs of words in the range 0xD800..DFFF in encodings other than
UTF-16. For those encodings U+D800..DFFF are just code points like
others; they encode the whole contiguous range U+0000..10FFFF (maximum
would be U+7FFFFFFF if the idea of UTF-16 wasn't pushed so hard).

-- 
 __("<  Marcin Kowalczyk * [EMAIL PROTECTED] http://qrczak.ids.net.pl/
 \__/
  ^^                      SYGNATURA ZASTĘPCZA
QRCZAK

Re: validity of lone surrogates

Reply via email to