Manuel Mall wrote:
fop uses a pair based line breaking algorithm based on Unicode Standard Annex #14 (http://www.unicode.org/reports/tr14/). This document actually makes quite a few comments on special cases for the Korean language. Most of that is straight above my head as I have no idea what for example a 'Hangul syllable and conjoining jamo' actually is. It would be really helpful if someone with real knowledge of the Korean language could read and interpret for us those parts of the standard and may be point out where exactly we may be doing something wrong. It sounds to me like Korean has two different ways of linebreaking and possibly our default UAX#14 implementation may only support one.
IIRC I've set the line breaking opportunity to "break" for all pairs of CJK code points, because the original Unicode tables didn't provide a useful value (that's the fall back recommended by the spec in case no better value is available). That's simplistic. Maybe looking at pango and perhaps a variety of other OSS text rendering projects can help. I didn't get enough spare time to do it myself yet. J.Pietschmann --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
