> there are rule sets like this for many languages. the English set > consists of 4400, the Norwegian set is 1500. clearly not something to > put in the viewer...
So we could do something in the parser, perhaps. We'd take the rule set and select high-valued splits for long words, putting soft hyphens in those spots. Doesn't seem too tough. Not quite sure how to auto-detect the language, though. Might need a command-line option for that. Bill
