Re: supporting XIM

Jungshik Shin Sun, 30 Mar 2003 18:29:57 -0800

Edward Cherlin wrote:

On Sunday 30 March 2003 03:26 am, Jungshik Shin wrote:

The wish list for modern writing systems is mainly made up of systems with complex rendering.
Some of Indic (but some is already done)
Sinhalese
Burmese
Cambodian
Laotian
Tibetan
Mongolian
Thaana and Ethiopic are not difficult,

The way Ethiopic is encoded in Unicode (as 'syllabary' instead of as an 'alphabet' ), I don't think Ethiopic counts as a complex script. It could have been if the encoding model used for Ethiopic were like that of Indic scripts.

but need somebody who wants to work on them. Cherokee, CAS, and some others fall into the same category.

Mandrake Linux provides keyboard support for Cyrillic, Greek, Israeli Hebrew, Armenian, Georgian, Bengali, Devanagari, Gujarati, Gurmukhi, Tamil, Thai, Laotian, and Burmese, but not Arabic. There is a lack of rendering for Burmese, but I have not had problems typing Sanskrit. Not all of the conjuncts exist in the fonts available, but that is not the fault of the apps.

On the other hand, all these scripts are not supported in text-terminal based programs and it's not even clear what to do in that situation.

I can't test some of the others myself, and haven't heard any detailed information on them. I have not found any problems with diacritics in Latin and Cyrillic.

Well, you do have problems with characters with diacritics in Latin,Greek and Cyrillic for which Unicode does NOT have assigned and will NEVER assign separate codepoints. That's what I was talking about. There are tens , if not hundreds, of combinations (base character + one or more diacritic mark(s)) that can ONLY be represented by combining character sequences. They're necessary when you deal with Old and Middle English for instance. Pango does not yet have support for those cases (Latin,Greek and Cyrillic). However, Pango is not much behind because it's not much long ago that MS added support for Latin/Greek/Cyrillic combining character support to Uniscribe.

out a way to funnel IME input through the normal character input calls, we might well achieve CJK support in the majority of apps.

 Well right now, the majority of programs in modern Linux
distros DO  work well with CJK IMEs. In case of gtk2
applications, they also work well with any gtk2 input modules
including those for CJK.  Of course, this doesn't mean that
there's very little to  do when it comes to CJ(K) support, but
I don't share Kubota-san's concern.

I have a Chinese HOWTO, but I can't find a Japanese or Korean HOWTO. Any pointers? I can type Chinese with Cangjie, Korean Hangul, and Japanese with romaji conversion in software where I know how to activate them. I would be delighted if I could do it in e-mail.

You don't even need HOWTO documents these days because modern Linux distros come with virtually everything you need for CJK support. (I'm here assumming that you have a pretty good command of CJK languages, which must be the case judging from what you have been writing on this list) I thought you had been on this list for a while and heard of most of things you need for CJK. For Korean, you can either use 'Ami' (when you launch your program under ko_KR.EUC-KR locale or ko_KR.UTF-8 locale. http://kldp.net/projects/ami) or imhangul (http://kldp.net/projects/imhangul) for gtk2 applications. Hopefully, Pablo on this list picked up 'imhangul' I mentioned several times on the list and included in Mandrake 9.1. Even if not, you can just install the rpm available at the site above. In case of Ami, SuSe, RH, Mandrake and others have had it for a couple of years. The same is true of Japanese IMEs and back-end servers like Canna. For gtk2 applications, you may try im-ja (http://im-ja.sourceforge.net).

Jungshik

--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Re: supporting XIM

Reply via email to