On Mar 13, 2006, at 16:12, Lachlan Hunt wrote:

Henri Sivonen wrote:
Authors are adviced not to use the UTF-32 encoding or legacy encodings. (Note: I think UTF-32 on the Web is harmful and utterly pointless,

I agree about it being pointless, but why is it considered harmful?

Opportunity cost: The time that is spent implementing something pointless could be better spend doing something else--like implementing something useful.

Backwards incompatibility: Using UTF-32 instead of UTF-8 makes pages incompatible with older UAs for no good reason.

Size: UTF-32 takes more bytes to transfer than UTF-8--slow load, bad user experience.

 I'd like to have some text in the spec that justifies whining
about legacy encodings.

What are your reasons for whining about legacy encodings and what would you like the spec to say?

Using a legacy encoding that user agents are not guaranteed to support introduces incompatibility for no good reason. (I do not consider laziness or unwillingness to use UTF-8 good reasons.)

Even with well-supported legacy encodings form submission is problem. The same as incoming policy combined with an encoding that cannot encode all of Unicode leads to data loss.

I would like the spec to say that if the page has forms, using an encoding other than UTF-8 is trouble. And even for pages that don't have forms, using an encoding that is not known to be extremely well supported introduces incompatibility for no good reason.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


Reply via email to