Re: [whatwg] UTF-16 encoding default

Oliver Hunt Tue, 23 Jun 2009 19:04:01 -0700

Have you checked for a byte order marker in the source document? (see http://unicode.org/faq/utf_bom.html#BOM)


--Oliver


On Jun 23, 2009, at 6:42 PM, Kartikaya Gupta wrote:

There's a page (http://www.microsoft.com/windowsmobile/mobile/en-us/totalaccess/software/software/eula-sw-netflix.mspxspecifically) that has a Content-Type header of "text/html;charset=utf-16" and has no BOM. The references I've seen (RFC2781,as well as http://unicode.org/faq/utf_bom.html#gen7) say that thismeans the content should be assumed to be UTF-16BE. The page,however, is actually in UTF-16LE.
All browsers seem to do some sort of unspecified magic and figureout that the page is in LE. I was wondering if that magic could bedescribed and added to the HTML5 spec so that it covers renderingthe above page as expected. According to the draft spec as itstands, I believe that page should be rendered as garbage.
Cheers,
kats
PS - the page also has a meta tag that says the charset isiso-8859-1. *sigh*

Re: [whatwg] UTF-16 encoding default

Reply via email to