* David Hopwood | | Presumably the "NT 4.0" mapping at <http://www.autumn.org/etc/unidif.html> | (in Japanese, but the table is readable by non-Japanese-speakers). | | That mapping is a superset of CP932 | (<ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP932.TXT>), | with additional mappings from 0xF040..0xF9FC to U+E000..E757, from | 0x80 to U+0080 (why?), and from the other 4 reserved single-byte codes | to U+F8F0..F8F3.
Hmmmm. It seems like this table has a little more information (provided I interpret it correctly). | I wouldn't know, but the private use codes can't be assumed to mean | anything in particular, regardless of what charset they start out as. | Such pages are broken, and should be using NCRs or images instead. OK, so it seems like this characters have no fixed interpretation. I did a little test, by generating a Shift-JIS test page and viewing it in MSIE. It turns out that MSIE supports only the 0xFA40 - 0xFC4B range, and not the rest of the 0xF0F0 - 0xFCFC range, which means that the tables I've already been referred to contain all the information I need. An interesting question is whether this range is particular to Shift-JIS, or whether these characters should be considered to have been added to JIS 0208, so that EUC-JP and ISO 2022-JP also can use them. Does anyone know? --Lars M.