I know TeX has an algorithm for hyphenating words, and fop (Formatting Objects to PDF, on the Apache site) uses the same algorithm. Seeing how it's all open-source, mightn't it be possible to port that algorithm to Python and use the same pattern files that TeX and fop use? If that turns out to be possible, then you've already got multiple languages taken care of, since I know fop, at least, has a many different language hyphenation files.
I also like your idea of only hyphenating long words. Perhaps it could be a word-length limit that is specifiable from the command line? - Jamis On Tue, 2002-04-30 at 13:26, Bill Janssen wrote: > Yes, I think this is the right way to do it. But we'd need an > algorithm (dictionary-based?) to get the insertion of those soft > hyphens right. Anyone know what the right one to use is? Let's > just start with English... > > Bill > -- Jamis Buck [EMAIL PROTECTED] http://hippa-potta.jamisandtarasine.net . "I'd horsewhip you if I had a horse." -- Groucho Marx
