On 19 Oct 2009, at 05:52, Ian Hickson wrote:

I've noted your e-mail here [...] and moved the whole thing out of the spec.

That does not seem to apply to the last part of the original e-mail, quoted below.

Øistein E. Andersen



Other character encoding issues:
--------------------------------

ASCII-compatibility:
The note in ‘2.1.5 Character encodings’ seems to say that ‘variants of ISO-2022’ (presumably including common ones like ISO-2022-CN, ISO-2022KR and ISO-2022-JP) are ASCII-compatible, whereas HZ-GB-2312 is not, and I cannot find anything in Section 2.1.5 that would explain this difference.


Discouraged encodings:
‘4.2.5.5 Specifying the document's character encoding’ advises against certain encodings. (Incidentally, this advice probably deserves not to be ‘hidden’ in a section nominally reserved for character encoding *declaration* issues.) In particular:

Authors should not use JIS-X-0208 (JIS_C6226-1983), JIS-X-0212 (JIS_X0212-1990), encodings based on ISO-2022, and encodings based on EBCDIC.

It is not clear what this means (e.g., the character set JIS_C6226-1983 in any encoding, or only when encoded alone according to RFC1345 as described above); the list of discouraged encodings seems conspicuously short if it is supposed to be complete; and the lack of rationale makes it difficult to understand why these encodings are considered particularly harmful (JIS_C6226-1983 v. JIS_C6226-1978 or ISO-2022 v. HZ, to mention but two at least initially puzzling cases). It might be better to say *why* particular encodings are better avoided, whether or not the list of discouraged encodings be presented as definitive.

Minor grammar detail in 4.2.5.5:
Conformance checkers may advise against authors using legacy encodings.

This is ambiguous. It should probably be ‘advise against authors’ using legacy encodings’ or better ‘advise authors against using legacy encodings’.

Reply via email to