On Sat, 07 Jan 2012 02:55:15 +0100, Jarred Nicholls <[email protected]>
wrote:
Not exact, but close. For discussion's sake and in this context, you
could call it the "Unicode" text decoder that does BOM detection and
switches Unicode codecs automatically. For enforced UTF-8 I'd (have to)
disable the BOM detection, but additionally could avoid decoding
altogether if the specified encoding is not explicitly UTF-8 (and that
was a part of the spec). We'll make it work either way :)
FYI, if WebKit cannot do pure UTF-8 decoding (i.e. ignoring everything
else), WebKit has bugs in its server-sent events (EventSource), Web
Workers, WebVTT, and Web Sockets implementation. Potentially more, I'm not
sure if this list is still complete.
--
Anne van Kesteren
http://annevankesteren.nl/