Comment #26 on issue 761 by [email protected]: Incorrect UTF-8 encoding/decoding for non-BMP characters in String related functions
http://code.google.com/p/v8/issues/detail?id=761
The string you receive from the websocket, what is its length? If it's 12, you are receiving the string "\\ud834\\udf56" (i.e., where the first character is a backslash), and not the string "\ud834\udf56" (where the first character is the surrogate pair starter U+D834). Also, do you know which *bytes* are sent by the websocket server? It should be the four byte UTF-8 encoding: f0 9d 8d 96. /L -- v8-dev mailing list [email protected] http://groups.google.com/group/v8-dev
