Hello folks,

This will be of interest to only a few people, but it will be good to have it in the archives for when we need it.

Here is a list of Korean character sets that represent hangul (Korean symbols) and hanja (Sino-Korean):

- EUC-KR (KSC 5601, renamed to KS X 1001) or Microsoft's superset UHC
- ISO-2022 comes in both -JP and -KR versions.
- johab is a legacy 16-bit encoding, leading bit = 1 + 3 * 5 bits for leading consonant, vowel, optional consonant(s) at the end
http://trade.chonbuk.ac.kr/~leesl/code/johap.gif


The URL above goes to a useful table for working with johab. I do know it is a legacy charset, but I don't know how much it is still used. Technically, ASCII is legacy, too. :)

Do we have any local experts on Japanese charsets? If not, I can do a little bit of research there, too.

Cheers,

~kj

Reply via email to