a_l.delmelle wrote in a bugzilla entry:
Hyphenation is, in fact, only applicable to pure alphabetical characters.
Well, no. The pattern based hyphenator can deal with any Unicode characters (apart from digits, whitespace and the dot, which have a special meaning in the pattern definitions). If the word parser would use the character classes from the active pattern file for parsing words, basically anything could be used. This would only need a proper interface for retrieving the character classes. The class canonicalization could even be folded into the parsing process for better performance. J.Pietschmann