Rune Saetre wrote:

Hi

I just want to remind you that if there is no charset specified in the
HTTP Content-Type header this means that the document is encoded using the
iso-8859-1 charset.

The RFC is explicitely states that if no charset is indicated in the
Content-Type header the recipient should not try to guess the charset, but
the charset is in fact iso-8859-1.

Of course, "web applications" might not know what charset the webserver
would announce in the Content-Type header, so it makes sense to let this
be overridden in the document preamble. (I guess this is not strictly
legal regarding to the HTTP RFC...)

But once again, if no charset is indicated whatsoever, then the document
should be interpreted according to the iso-8859-1 charset, and NOT utf-8
(or whatever).

Here in Norway we have a lot of servers not indicating any charset at all,
but they usually do send out content encoded with the iso-8859-1 charset,
and is thus RFC compliant.

These sites did not to work with kannel earlier, so I sent a patch some
time ago to fix this. It should now be in CVS.

It wouldn't do much good for speed, but if conversion fails using the
implied iso-8859-1 charset, one could perhaps try another conversion based
on a "qualified guess" at the charset? In this way Kannel would be RFC
compliant so far as the documents are, and then still be able to process
non-RFC compliant documents with meaningfull content.



Yes, you are right, but 90% popular wap resources is NON-RFC complaint :((

May be iso-8859-1 charset must be set in wml_compile (wml_compiler.c) ?
like:

if (charset && octstr_len(charset)) {
if(octstr_case_compare(charset, octstr_imm("UTF-8")) == -1) {
debug("wml_compile", 0, "WML compiler: Transcoding from <%s> to UTF-8",
octstr_get_cstr(charset));
set_charset(wml_text, charset);
}
} else {
/* If charset not detected assume ISO-8859-1 encoding */
debug("wml_compile", 0, "WML compiler: Unknown encoding, assuming ISO-8859-1");
debug("wml_compile", 0, "WML compiler: Transcoding from ISO-8859-1 to UTF-8");
set_charset(wml_text, octstr_create("ISO-8859-1"));
}



--

Best regards,
Yury Mikhienko.
IT ERP group head, ZAO "Mobikom-Kavkaz"
mob: +7 (928) 2010199
tel: +7 (863) 2704188




Reply via email to