Re: [whatwg] Encoding Standard (mostly complete)

And Clover Thu, 19 Apr 2012 06:11:55 -0700

On 2012-04-18 22:34, Glenn Maynard wrote:

(It would be pretty neat if that could be changed to *always* using HTML
escapes for non-ASCII, except when encoding to UTF-8, since that's not
introducing anything new--you can already receive&x1234; escapes in POST
data--and it would alleviate the "form submit encoding depends on the
source page's encoding" problem.  I guess this must break pages somehow, or
vendors would have done this long ago.)

It naturally would break any page that's deliberately using a non-UTFencoding. Web applications do not - and should not be -HTML-character-reference-decoding their input because this would mangleliteral use of & characters (which are *not* escaped to &). There isno way to correctly recover a value that has been through this form oflossy encoding.

The charref-encoding-fallback is an ugly legacy hack that confuses webauthors and tempts them into using submitted strings directly withoutHTML-escaping, resulting in security holes. Its use should be minimisedwherever possible.


--
And Clover
mailto:[email protected]
http://www.doxdesk.com/
gtalk:[email protected]

Re: [whatwg] Encoding Standard (mostly complete)

Reply via email to