[whatwg] 9.2.2: replacement characters. How many?

Elliotte Harold Fri, 03 Nov 2006 03:52:50 -0800

Section 9.2.2 of the current Web Apps 1.0 draft states:

Bytes or sequences of bytes in the original byte stream that could notbe converted to Unicode characters must be converted to U+FFFDREPLACEMENT CHARACTER code points.

I'm concerned about the "or". For example, suppose there are six upperhalves of a Unicode surrogate pair in a row and no lower halves. Doesthat turn into six replacement characters or one? Both interpretationsseem possible.

I suppose I prefer six rather than one, but I don't care a great deal aslong as this is locked down one way or the other.


--
Elliotte Rusty Harold  [EMAIL PROTECTED]
Java I/O 2nd Edition Just Published!
http://www.cafeaulait.org/books/javaio2/
http://www.amazon.com/exec/obidos/ISBN=0596527500/ref=nosim/cafeaulaitA/

[whatwg] 9.2.2: replacement characters. How many?

Reply via email to