[ibus-devel] Re: Issue 695 in ibus: Output traditional Chinese characters with simplified Chinese key combinations, vice versa.

ibus Wed, 30 Jul 2014 07:10:48 -0700

Comment #7 on issue 695 by [email protected]: Output traditionalChinese characters with simplified Chinese key combinations, vice versa.

http://code.google.com/p/ibus/issues/detail?id=695


Now what if not using pinyin but for example wubi-jidian86:

In wubi-jidian86.txt the characters 东 and 東 have the entries:

ai      东       111149194
gjii    東       137000000

I.e. one has to tuype different input sequences to get them.

Do I understand this issue correctly, that the reporter wants to type
the input sequence for 东, i.e. type “ai”, but get the traditional
Chinese result 東 even though it has a completely different input
sequence?

This problem is a bit more complicated because simplified
and traditional characters are not always just a pair like
with 东 and 東. For example, the simplified character 台 U+53F0 has
4 traditional variants, one of them is 台 itself (because 台
is both traditional and simplified). The Unihan_Variants.txt
(from http://www.unicode.org/Public/7.0.0/ucd/Unihan.zip) contains:

    U+53F0      kSimplifiedVariant      U+53F0
    U+53F0      kTraditionalVariant     U+53F0 U+6AAF U+81FA U+98B1
    U+6AAF      kSimplifiedVariant      U+53F0
    U+81FA      kSimplifiedVariant      U+53F0
    U+98B1      kSimplifiedVariant      U+53F0

I.e. these 4 characters are all used in traditional Chinese:

    台 U+53F0 檯 U+6AAF 臺 U+81FA 颱 U+98B1

but only 台 is used in simplified Chinese.

Whe wubi-jidian86 codes for these 4 characters are:

    176:ck      台       75648549
   1389:ckf     台       486000000
  96294:sfkf    檯       6490000
  32798:fkpf    臺       32600000
  68265:mtjk    颱       1110000

So when typing “ck” or “ckf” matching 台, it is not clear which
of the 4 traditional characters the user really means if he wants
to type traditional Chinese using the wubi codes for simplified Chinese.

I could possibly implement something as follows:

- add an option “Automatically add SC/TC variants:  [yes/no]”
  to the setup tool
- if that option is set and the user types for example
  “ckf” matching only 台, ibus-table looks up all SC/TC variants
  of this character and finds 檯, 臺, 颱.
  Or, if “ai” has been typed matching 东, ibus-table would find
  the variant 東.
- add all variants found to the candidate list.
- Finally the Chinese mode filter is applied, als always.

For example, if the Chinese mode were set to “traditional” only,
it would then be possible to type “ai” and see only 東
in the candidate list, not 东. After matching 东, 東 would be
added to the candidate list. And finally 东 would be thrown away
by the “only traditional” filter.

Or, the other way round, if the Chinese mode were set to “simplified only”,
one could type “sfkf” for 檯, then 台, 臺, and 颱 would be added to
the candidate list as the variants found. And finally the “simplified
only” filter would throw away all except 台. So one could type
any of the input sequences for one of the characters 台, 檯, 臺, or 颱
and get only 台 in the final result.

But would that really be a useful feature?

I think I can implement that, but it would slow down things a
little bit (not too bad I think, should still be fast enough).
And I guess such a feature should be optional and off by default.

Would such a feature really make sense? Is this worth
implementing?




--

You received this message because this project is configured to send allissue notifications to this address.

You may adjust your notification preferences at:
https://code.google.com/hosting/settings

--
--
You received this message because you are subscribed to the Google
Groups "ibus-devel" group.
iBus project web page: http://code.google.com/p/ibus/
iBus dev group: http://groups.google.com/group/ibus-devel?hl=en

---You received this message because you are subscribed to the Google Groups "ibus-devel" group.

To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

[ibus-devel] Re: Issue 695 in ibus: Output traditional Chinese characters with simplified Chinese key combinations, vice versa.

回复