How about UTF-32 sequence which the 4 bytes represent value > U+10FFFF ? Are they considered ill-formed? Should they?
This discussion has been centered around UTF-8. But I hope the corresponding rules apply to UTF-16 and UTF-32 for Unicode 4.0:
. for UTF-32: occurrences of 'surrogates' are ill-formed.
- Re: Unicode 4.0 BETA available for review Yung-Fong Tang
- Re: Unicode 4.0 BETA available for review Kenneth Whistler
- Re: Unicode 4.0 BETA available for review Yung-Fong Tang
- Re: Unicode 4.0 BETA available for review Kenneth Whistler
- Re: Unicode 4.0 BETA available for review Yung-Fong Tang
- Re: Unicode 4.0 BETA available for review Markus Scherer
- Re: UTF-8 (was:Unicode 4.0 BETA availab... Asmus Freytag
- Re: Unicode 4.0 BETA available for review Kenneth Whistler
- Re: Unicode 4.0 BETA available for review Stefan Persson
- Re: Unicode 4.0 BETA available for review Yung-Fong Tang
- Re: Unicode 4.0 BETA available for review Yung-Fong Tang
- Re: Unicode 4.0 BETA available for review Yung-Fong Tang
- Re: Unicode 4.0 BETA available for review Kenneth Whistler
- Re: Unicode 4.0 BETA available for review Yung-Fong Tang
- Re: Unicode 4.0 BETA available for review Mark Davis
- Re: Unicode 4.0 BETA available for revi... Roozbeh Pournader
- Re: Unicode 4.0 BETA available for... Yung-Fong Tang
- Re: Unicode 4.0 BETA available for review Doug Ewell
- Re: Unicode 4.0 BETA available for revi... Yung-Fong Tang
- Re: Unicode 4.0 BETA available for review Kenneth Whistler

