On Sunday 30 March 2003 06:29 pm, Jungshik Shin wrote: > Edward Cherlin wrote: > >On Sunday 30 March 2003 03:26 am, Jungshik Shin wrote: > > > > > >The wish list for modern writing systems is mainly made up of > >systems with complex rendering. > > > >Some of Indic (but some is already done) > >Sinhalese > >Burmese > >Cambodian > >Laotian > >Tibetan > >Mongolian > > > >Thaana and Ethiopic are not difficult, > > The way Ethiopic is encoded in Unicode (as 'syllabary' > instead of as an 'alphabet' ), I don't think Ethiopic counts > as a complex script. It could have > been if the encoding model used for Ethiopic were like that of > Indic scripts. > > >but need somebody who > >wants to work on them. Cherokee, CAS, and some others fall > > into the same category. > > > >Mandrake Linux provides keyboard support for Cyrillic, Greek, > >Israeli Hebrew, Armenian, Georgian, Bengali, Devanagari, > >Gujarati, Gurmukhi, Tamil, Thai, Laotian, and Burmese, but > > not Arabic. There is a lack of rendering for Burmese, but I > > have not had problems typing Sanskrit. Not all of the > > conjuncts exist in the fonts available, but that is not the > > fault of the apps. > > On the other hand, all these scripts are not supported in > text-terminal based programs > and it's not even clear what to do in that situation. > > > I > >can't test some of the others myself, and haven't heard any > >detailed information on them. I have not found any problems > > with diacritics in Latin and Cyrillic. > > Well, you do have problems with characters with diacritics > in Latin,Greek and Cyrillic for which > Unicode does NOT have assigned and will NEVER assign separate > codepoints. That's > what I was talking about. There are tens , if not hundreds,
thousands, if not tens of thousands. I'm a mathematician. > of combinations > (base character + one or more diacritic mark(s)) that can ONLY > be represented by combining character sequences. Like this? à̀ It's an a with two accents, and it composes and displays correctly in kwrite and kmail, with one accent above the other. Let's try some more. á̀ế̀̀î́̀ổ́̀̀û̀̀n̂́̀x̂̉́̀̀ Not too bad, except that only the first three accents on each letter are actually displayed, and the dot on the i isn't removed. Curiously, Yudit doesn't handle multiple accents as well as these simple-minded apps do. What do you see in your mail? > They're necessary when you deal > with Old and Middle English for instance. Pango does not yet > have support for those > cases (Latin,Greek and Cyrillic). However, Pango is not much > behind because it's not much long ago that MS added support > for Latin/Greek/Cyrillic combining character support to > Uniscribe. > > >>>out a way to funnel IME input through the normal character > >>>input calls, we might well achieve CJK support in the > >>>majority of apps. > >> > >> Well right now, the majority of programs in modern Linux > >>distros DO work well with CJK IMEs. In case of gtk2 > >>applications, they also work well with any gtk2 input > >> modules including those for CJK. Of course, this doesn't > >> mean that there's very little to do when it comes to CJ(K) > >> support, but I don't share Kubota-san's concern. > >> > >> > >> > >> > >>I have a Chinese HOWTO, but I can't find a Japanese or > >> Korean HOWTO. Any pointers? I can type Chinese with > >> Cangjie, Korean Hangul, and Japanese with romaji conversion > >> in software where I know how to activate them. I would be > >> delighted if I could do it in e-mail. > > You don't even need HOWTO documents these days because > modern Linux distros come with virtually everything you need > for CJK support. (I'm here assumming that you have a pretty > good command of CJK languages, which must be the case judging > from what you have been writing on this list) I thought you > had been on this list for a while and heard of most of things > you need > for CJK. For Korean, you can either use 'Ami' (when you launch > your program under ko_KR.EUC-KR locale or ko_KR.UTF-8 locale. > http://kldp.net/projects/ami) or imhangul > (http://kldp.net/projects/imhangul) > for gtk2 applications. Hopefully, Pablo on this list picked up > 'imhangul' I mentioned several > times on the list and included in Mandrake 9.1. Even if not, > you can just install the rpm available at the site above. In > case of Ami, SuSe, RH, Mandrake and others have had it for a > couple of years. The same is true of Japanese IMEs and > back-end servers like Canna. For gtk2 applications, you may > try im-ja > (http://im-ja.sourceforge.net). The starting point of this discussion was the inability to use Chinese, Korean, and Japanese IMEs in the same locale. I write documents in all three languages, and I would do it more often if it were actually convenient. > Jungshik -- Edward Cherlin Generalist & activist--Linux, languages, literacy and more "A knot! Oh, do let me help to undo it!" --Alice in Wonderland -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
