On Sun, 16 Feb 2003, Doug Ewell wrote:

> The Unicode home page includes the following line, right where it should
> be, in the <head> section:
> 
> <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
> 
> Any User Agent that takes a page properly marked as UTF-8, as above, and
> still tries to autodetect a local code page, is badly misguided.  How
> would it handle a real UTF-8-encoded euro sign (0xE2 0x82 0xAC)?

AFAICR, there is supposed to be no single non-ASCII character before that
<meta> tag. I really don't like to search the specs again, but I'm sure I
saw it somewhere. The HTML renderer sees those characters and thinks the
document has already started (since the <html>, <head> and <body> tags are
are not mandatory in HTML 4 Transitional). So it goes into autodetection
mode. The same situation happens with MS FrontPage 2000 (but I've already 
explained that).

roozbeh


Reply via email to