On 5 Oct 2001, Lars Marius Garshol wrote:

> I've received data encoded in ISO 2022-JP that I am unable to figure
> out how to map to Unicode. The characters in question do not appear in
> the old JIS0208.TXT, and I can't find them in UniHan.txt either

> These characters are the problem:
>
> (7A22, 7C22, 7964 and 7B64)

  Assuming what I wrote below is correct (and I didn't make
any mistake in 'math'), they are:

  u+605D, u+91DE, u+5953, u+FA21

> Does anyone know what characters these are, and how to map them to
> Unicode? Are they part of some vendor extension to JIS 0208? If so,

  I'm not sure, but it seems like they're a part of NEC Kanji
character set (ref. Ken Lunde, CJKV Information Processing, p. 592).
According to CJKV Information Processing NEC Kanji adds 360 Kanjis and
14 Non-Kanji from IBM Japanese character sets in rows 89-92.
IBM Japanese is listed as 'kIBMJapan' in UniHan.txt.

> does anyone know of a conversion table for that extension?

  Provided that what's above is the case, I guess you can rather easily
construct a conversion table from UniHan.txt by comparing the table in
p.585 of CJKV I.P. and the table in p. 594 of the same book.

  Jungshik Shin


Reply via email to