Comment #11 on issue 761 by [email protected]: Incorrect UTF-8 encoding/decoding for non-BMP characters in String related functions
http://code.google.com/p/v8/issues/detail?id=761
I agree on converting to UTF-16. That means that data won't be corrupted when displayed to the user. Java works the same way; the storage is UTF-16, and the length is in char's (16-bit units).
-- v8-dev mailing list [email protected] http://groups.google.com/group/v8-dev
