Luca Furini wrote:
note that a word with a soft hyphen in its middle would not be hyphenated, unless we ignore this character when collecting word fragments
Well, in order to prepare for hyphenation, other characters like joiners has to be removed too. We should probably also use Unicode normalization. J.Pietschmann
