Re: Utility to report and repair broken surrogate pairs in UTF-16 text

Markus Scherer Fri, 05 Nov 2010 14:52:38 -0700

On Fri, Nov 5, 2010 at 1:56 PM, Doug Ewell <d...@ewellic.org> wrote:

> Right, but as I said, those downstream tasks shouldn't be consumers of
> UTF-16 code units anyway.  They should be consumers of Unicode code
> points, which by definition excludes loose surrogates.
>


Code points include surrogates. Maybe you mean "UTF-32 code units" or
"Unicode scalar values".

markus

Re: Utility to report and repair broken surrogate pairs in UTF-16 text

Reply via email to