Comment #26 on issue 761 by [email protected]: Incorrect UTF-8 encoding/decoding for non-BMP characters in String related functions
http://code.google.com/p/v8/issues/detail?id=761

The string you receive from the websocket, what is its length?
If it's 12, you are receiving the string "\\ud834\\udf56" (i.e., where
the first character is a backslash), and not the string "\ud834\udf56"
(where the first character is the surrogate pair starter U+D834).

Also, do you know which *bytes* are sent by the websocket server?
It should be the four byte UTF-8 encoding: f0 9d 8d 96.

/L


--
v8-dev mailing list
[email protected]
http://groups.google.com/group/v8-dev

Reply via email to