Julian Reschke wrote:
I think the non-letter handling in LineArea's hyphenation routine needs to
enhanced.
A lot of people think so. The bad news is that this is the wrong
approach.
There is a relevant standard which guides line breaking: Unicode
UAX#14. Implementing it would take care of your problem as well
as of "hyphenation" for far east languages and a few other things.
The obstacles are 1. the line breaking code in the maintenance
branch is ... messy, and it wont be updated anyway and 2. grokking
UAX#14 requires swapping everything out of your brain, including
the names of your spouse and your cat, and generally doesn't work
well in warm weather.
The next problem is the hyphenation algorithm itself. Problem
zones:
- Gathering characters into words for hyphenation. This should
be controlled by the hyphenator, but it isn't.
- Character normalization. Look into the relevant report.
- Incremental pattern matching for sort of parallel hyphenation,
resulting in proper handling of spelling changing exceptions
without much overhead.
Contributions are welcome.
J.Pietschmann
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]