Re: [whatwg] Internal character encoding declaration

Henri Sivonen Mon, 13 Mar 2006 06:43:20 -0800

On Mar 13, 2006, at 16:12, Lachlan Hunt wrote:

Henri Sivonen wrote:
Authors are adviced not to use the UTF-32 encoding or legacyencodings. (Note: I think UTF-32 on the Web is harmful and utterlypointless,
I agree about it being pointless, but why is it considered harmful?

Opportunity cost: The time that is spent implementing somethingpointless could be better spend doing something else--likeimplementing something useful.

Backwards incompatibility: Using UTF-32 instead of UTF-8 makes pagesincompatible with older UAs for no good reason.

Size: UTF-32 takes more bytes to transfer than UTF-8--slow load, baduser experience.

 I'd like to have some text in the spec that justifies whining
about legacy encodings.
What are your reasons for whining about legacy encodings and whatwould you like the spec to say?

Using a legacy encoding that user agents are not guaranteed tosupport introduces incompatibility for no good reason. (I do notconsider laziness or unwillingness to use UTF-8 good reasons.)

Even with well-supported legacy encodings form submission is problem.The same as incoming policy combined with an encoding that cannotencode all of Unicode leads to data loss.

I would like the spec to say that if the page has forms, using anencoding other than UTF-8 is trouble. And even for pages that don'thave forms, using an encoding that is not known to be extremely wellsupported introduces incompatibility for no good reason.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/

Re: [whatwg] Internal character encoding declaration

Reply via email to