Tomohiro KUBOTA writes:
> Though I have not read any technical report or so on line-breaking,
> there are some Hiragana/Katakana letters which must not be put on the
> start of a line. This is just like a close parenthesis ")" cannot start
> a line. The followings are examples of such letters.
> - U+3041 HIRAGANA LETTER SMALL A
> - U+3043 HIRAGANA LETTER SMALL I
> - U+3045 HIRAGANA LETTER SMALL U
> - U+3047 HIRAGANA LETTER SMALL E
> - U+3049 HIRAGANA LETTER SMALL O
> - U+3063 HIRAGANA LETTER SMALL TSU
> - U+309B KATAKANA-HIRAGANA VOICED SOUND MARK
> - U+309C KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
> - U+309D HIRAGANA ITERATION MARK
> - U+309D HIRAGANA VOICED ITERATION MARK
> - U+30A1 KATAKANA LETTER SMALL A
> - U+30A3 KATAKANA LETTER SMALL I
> - U+30A5 KATAKANA LETTER SMALL U
> - U+30A7 KATAKANA LETTER SMALL E
> - U+30A9 KATAKANA LETTER SMALL O
> - U+30C3 KATAKANA LETTER SMALL TSU
> - U+30E3 KATAKANA LETTER SMALL YA
> - U+30E5 KATAKANA LETTER SMALL YU
> - U+30E7 KATAKANA LETTER SMALL YO
> - U+30F5 KATAKANA LETTER SMALL KA
> - U+30F6 KATAKANA LETTER SMALL KE
> - U+30FC KATAKANA-HIRAGANA PROLONGED SOUND MARK
> - U+30FD KATAKANA ITERATION MARK
> - U+30FE KATAKANA VOICED ITERATION MARK
Thanks. Your list agrees with category "NS" (non-starters) in the
Unicode TR #14. linebreak-0.2 will support them. Due to a mistake,
linebreak-0.1 doesn't support them all.
Bruno
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/