Maybe it makes a sence to make something like this in Plan9 (an analog
kbmap) for typing complex symbols like an hieroglyph ?

Your method is in essence what Microsoft's IME on Windows and various IMEs on UNIX-likes (such as SCUM) use. However, an IME for inputting from a list of over twenty thousand characters takes quite an effort to devise before it can be practical and useful. Right now even display of CJK is not quite fully supported on any existing FOSS platform (Ruby character display was added to Firefox only somewhere after version 3). Non-integrated pieces of FOSS with great capabilities do exist.

In case of (Simplified and Traditional) Chinese there apparently exist only two successful IMEs out there: one is Microsoft's, the other belongs to a Chinese company that has put lots of money and effort into developing the software. I believe both support input by Pinyin romanization, although I may be wrong. There's also Google's Pinyin IME which was involved in a lawsuit with said Chinese company.

In case of Japanese an IME needs to support three writing systems at once, firstly the two kana, and then transforming from kana to kanji. Abundance of homonyms in Japanese as well as a certain writing strategy called ateji (using kanji for phonetic value rather than semantic value) makes embedding of a dictionary into the IME unavoidable. Good dictionaries for this purpose don't come free--they must either be bought from professional companies or compiled by people who intimately know the language, preferably native speakers. This latter, I believe, is how IMEs on UNIX-likes came to be. Anyhow, Japanese IMEs, too, rely on input based on a romanization of the language. The actual number of distinct kanji required for input of text at a high school literate level is around two thousand--JLPT Level One roughly corresponds to that--but people, of course, expect a much larger dictionary. Microsoft IME also provides semantic aid by offering short descriptions of kanji so that people can decide which corresponds to the meaning they want to convey. Although unnecessary, it is a most welcome addition.

I don't know anything about Korean writing system or IMEs but since CJK ideographs (most importantly Han characters) are involved similar statements may apply.

Overall, there's no easy way that is light on financial and/or human resources--the two types of resources are interchangeable, i.e. if you have an active user base you may be able to avoid expenditure--to put CJK input support into a UI, which is probably why Plan 9 doesn't have that at the moment. It isn't a computer thing--it's a human thing. I might add porting IMEs from some UNIX-like system is probably the best option (for those with the technical prowess).

**********

DISTRACTION

While googling around for the existence of IMEs on Plan 9 I came across this document from 1996 titled "Unicode: Writing in the Global Village:"

Despite these hurdles, Unicode may soon become the most common
multilingual character-coding system. Support for multiple-language use
is quickly growing. New operating systems—AT&T's Plan 9, Windows NT,
Novell's Netware 4.01 Directory Services, Sybase's Gain Momentum, and
Apple's Newton already support Unicode.

--
<http://www.nyu.edu/its/pubs/connect/archives/96fall/hargitaivillage.html>

It's funny how the author assumes display and input are the same thing while they so greatly differ, input being times harder to implement.



--On Friday, September 11, 2009 15:29 +0400 Alexander Sychev <[email protected]> wrote:

Hello!

Some time ago I wrote for inferno an analog of kbmap with an extention -
a  possibility to print complex symbols via sequences of more basic
symbols.
I use it for typing by the russian translit.
Here is a piece of file for my kbmap:
<------------cut --------------->
1       45      0
1       46      'Ц
1       47      'В
1       48      'Б
1       49      'Н
1       50      'М
C       цх      'ч
C       Цх      'Ч
C       сх      'ш
C       Сх      'Ш
C       сцх     'щ
C       Сцх     'Щ
<------------cut--------------->

The latin symbols are mapped to russian when it is possible. Other
russian symbols are presented via sequences of mapped symbols, e.g.
russian symbol  'Ч' [ch] is presented like an sequence of 'ц' [c] и
'х' [h].
A sequence can be broken by pressing any non-symbol key.
There is at least one big disadvantage of this method - the input focus
can be changed, e.g. by mouse. In inferno I didn't resolve this problem,
because /dev/pointer can be opened only once.

Maybe it makes a sence to make something like this in Plan9 (an analog
kbmap) for typing complex symbols like an hieroglyph ?

On Fri, 11 Sep 2009 14:23:02 +0400, erik quanstrom
<[email protected]> wrote:

HI..everyone:
       Is there some ways to input Simplified Chinese in plan 9 ? I
know plan 9 supports Unicode, so it is no questions for plan 9 to
display Simplified Chinese....... and i have seen some pictures on
Internet to prove it...so i have a question like that above...
   I'm looking forward for the answer........... Thanks first......!!!!

the only way to input simplified chinese currently
is to use the general codepoint input method.
<compose> 'x' + four hex digits.  on a pc compose =
<alt>.  that's probably not what you're looking for.
i am not aware that anyone has written an input
method specificly for simplified chinese.

- erik


--
Best regards,
   santucco


Reply via email to