[whatwg] URL query component

Anne van Kesteren Fri, 20 Apr 2012 02:15:29 -0700

The URL query component for URLs found in HTML (exact set still be to bedefined I think) uses the page encoding when the page encoding is notutf-8/utf-16 (then it uses utf-8).


E.g. "?&euro;" maps to "?%80" in a windows-1252 encoded page.

Currently browsers differ for what happens when the code point cannot beencoded. E.g. "?€"

Opera uses "?". Internet Explorer uses "?" (but when the URL hits thenetwork layer, not when you inspect it via script). WebKit uses "&#...;".Gecko encodes it using utf-8.


What Gecko does makes the resulting data impossible to interpret.

What WebKit does is consistent with form submission. I like it.

Also, given that encoding behavior is not exposed besides form submissionand URLs, consistently using "&#...;" for code points not represented inlegacy encodings makes sense to me. Am I missing something?



--
Anne van Kesteren
http://annevankesteren.nl/

[whatwg] URL query component

Reply via email to