On Sunday 30 March 2003 03:26 am, Jungshik Shin wrote:
The wish list for modern writing systems is mainly made up of systems with complex rendering.
Some of Indic (but some is already done) Sinhalese Burmese Cambodian Laotian Tibetan Mongolian
Thaana and Ethiopic are not difficult,
The way Ethiopic is encoded in Unicode (as 'syllabary' instead of as an
'alphabet' ), I don't think Ethiopic counts as a complex script. It could have
been if the encoding model used for Ethiopic were like that of Indic scripts.
but need somebody who wants to work on them. Cherokee, CAS, and some others fall into the same category.On the other hand, all these scripts are not supported in text-terminal based programs
Mandrake Linux provides keyboard support for Cyrillic, Greek, Israeli Hebrew, Armenian, Georgian, Bengali, Devanagari, Gujarati, Gurmukhi, Tamil, Thai, Laotian, and Burmese, but not Arabic. There is a lack of rendering for Burmese, but I have not had problems typing Sanskrit. Not all of the conjuncts exist in the fonts available, but that is not the fault of the apps.
and it's not even clear what to do in that situation.
I can't test some of the others myself, and haven't heard any detailed information on them. I have not found any problems with diacritics in Latin and Cyrillic.
Well, you do have problems with characters with diacritics in Latin,Greek and Cyrillic for which
Unicode does NOT have assigned and will NEVER assign separate codepoints. That's
what I was talking about. There are tens , if not hundreds, of combinations
(base character + one or more diacritic mark(s)) that can ONLY be represented by
combining character sequences. They're necessary when you deal with
Old and Middle English for instance. Pango does not yet have support for those
cases (Latin,Greek and Cyrillic). However, Pango is not much behind because
it's not much long ago that MS added support for Latin/Greek/Cyrillic
combining character support to Uniscribe.
out a way to funnel IME input through the normal character
input calls, we might well achieve CJK support in the
majority of apps.
Well right now, the majority of programs in modern Linux distros DO work well with CJK IMEs. In case of gtk2 applications, they also work well with any gtk2 input modules including those for CJK. Of course, this doesn't mean that there's very little to do when it comes to CJ(K) support, but I don't share Kubota-san's concern.
I have a Chinese HOWTO, but I can't find a Japanese or Korean HOWTO. Any pointers? I can type Chinese with Cangjie, Korean Hangul, and Japanese with romaji conversion in software where I know how to activate them. I would be delighted if I could do it in e-mail.
You don't even need HOWTO documents these days because modern Linux
distros come with virtually everything you need for CJK support. (I'm here
assumming that you have a pretty good command of CJK languages, which
must be the case judging from what you have been writing on this list)
I thought you had been on this list for a while and heard of most of things you need
for CJK. For Korean, you can either use 'Ami' (when you launch your
program under ko_KR.EUC-KR locale or ko_KR.UTF-8 locale.
http://kldp.net/projects/ami) or imhangul (http://kldp.net/projects/imhangul)
for gtk2 applications. Hopefully, Pablo on this list picked up 'imhangul' I mentioned several
times on the list and included in Mandrake 9.1. Even if not, you
can just install the rpm available at the site above. In case of Ami,
SuSe, RH, Mandrake and others have had it for a couple of years.
The same is true of Japanese IMEs and back-end servers like
Canna. For gtk2 applications, you may try im-ja (http://im-ja.sourceforge.net).
Jungshik
-- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
