Thomas Chan wrote: > - NTT-DoCoMo pictographs[1] in webpages for cell phones > http://www.nttdocomo.co.jp/i/tag/emoji/ .
Something is definitely weird in this page. They suggests to use shift-JIS codes for numerical character references, but such a thing is not allowed by the current definition of HTML: NCR's unambiguously and exclusively represent Unicode/ISO-10646 code points, regardless of the character set used in the page. How can such a thing work? For instance, inserting the sequence "說" in an HTML file does *not* insert the JIS character 0xF9A1 (a user code assigned to icon meaning "angry"), but rather the Unicode character U+F9A1 (CJK COMPATIBILITY IDEOGRAPH-F9A1, canonically equivalent to U+8AAA). BTW, Ironically, U+F9A1 (as well as U+8AAA) means "to SCOLD, to UPBRAID", which is what ANGRY people do all the time... :-) _ Marco