Re: [whatwg] Handling of illegal byte-sequences (typically in UTF-8)

2007-06-14 Thread Ian Hickson
On Fri, 24 Nov 2006, �istein E. Andersen wrote: Section 8.1.4: Bytes that are not valid UTF-8 sequences must be interpreted as [...] U+FFFD Section 9.2.2: Bytes or sequences of bytes [...] that could not be converted to Unicode characters must be converted to U+FFFD If I read this

Re: [whatwg] Handling of illegal byte-sequences (typically in UTF-8)

2006-11-24 Thread Henri Sivonen
On Nov 24, 2006, at 04:11, Øistein E. Andersen wrote: Section 8.1.4: Bytes that are not valid UTF-8 sequences must be interpreted as [...] U+FFFD Section 9.2.2: Bytes or sequences of bytes [...] that could not be converted to Unicode characters must be converted to U+FFFD If I read

Re: [whatwg] Handling of illegal byte-sequences (typically in UTF-8)

2006-11-24 Thread Øistein E . Andersen
On 24 Nov 2006, at 10:33AM, Henri Sivonen wrote: On Nov 24, 2006, at 04:11, Øistein E. Andersen wrote: Section 8.1.4: Bytes [-] U+FFFD Section 9.2.2: Bytes or sequences of bytes [-] U+FFFD I'm inclined to think that interop[erability] in error situations doesn't need to go as deep as

[whatwg] Handling of illegal byte-sequences (typically in UTF-8)

2006-11-23 Thread Øistein E . Andersen
Section 8.1.4: Bytes that are not valid UTF-8 sequences must be interpreted as [...] U+FFFD Section 9.2.2: Bytes or sequences of bytes [...] that could not be converted to Unicode characters must be converted to U+FFFD If I read this correctly, section 8.1.4 requires that an illegal UTF-8