On Sun, Mar 18, 2018, 10:30 AM Anders Rundgren <
[email protected]> wrote:

> Violently agree but do not understand (I guess I'm just dumb...) why (for
> example) sorting on UCS2/UTF-16 Code Units would not achieve the same goal
> (although the result would differ).
>

Because there are JavaScript strings which do not form valid UTF-16 code
units.  For example, the one-character string '\uD800'. On the input
validation side, there are 8-bit strings which can not be decoded as
UTF-8.  A complete sorting spec needs to describe how these are to be
handled. For example, something like WTF-8:
http://simonsapin.github.io/wtf-8/
  --scott


>
_______________________________________________
es-discuss mailing list
[email protected]
https://mail.mozilla.org/listinfo/es-discuss

Reply via email to