Re: Word_Break for Hieroglyphs

Martin J. Dürst via Unicode Wed, 20 Dec 2017 01:15:17 -0800

On 2017/12/20 17:46, Richard Wordingham via Unicode wrote:

In an implementation that offered genuine whole word selection, and
thus tackled with the challenges of Chinese, Japanese, Korean and
Vietnamese (both scripts, not just CJKV) as well as Thai, I would
expect the selections to be bounded by word boundaries.  Thus, if the
cited line break (labelled by '6') were not in the text, I would expect
double-clicking on the quadrat G37:Aa13:Aa13 to select all three words.

This may be common knowledge to some, but I just had a Japanese documentopen in MS Word, and tried what happened on double-clicking. What itdoes is select same-script runs. This means that a run of kanji, a runof hiragana, or a run of katakana (interestingly, the (kata)kana lengthmark is treated as a forth script) is selected. This is of course notthe same as words, but it can match, and it comes close in terms ofoffering something for editorial convenience while being easy to implement.


Regards,   Martin.

Re: Word_Break for Hieroglyphs

Reply via email to