On Tue, 03 Apr 2012 13:59:25 +0200, Henri Sivonen <[email protected]> wrote:
On Wed, Jan 4, 2012 at 12:34 AM, Leif Halvard Silli
<[email protected]> wrote:
A solution that would border on reasonable would be decoding as
US-ASCII up to the first non-ASCII byte

Thus possibly prescan of more than 1024 bytes?

I didn't mean a prescan.  I meant proceeding with the real parse and
switching decoders in midstream. This would have the complication of
also having to change the encoding the document object reports to
JavaScript in some cases.

On IRC (#whatwg) zcorpan pointed out this would break URLs where entities are used to encode non-ASCII code points in the query component.


--
Anne van Kesteren
http://annevankesteren.nl/

Reply via email to