I know TeX has an algorithm for hyphenating words, and fop (Formatting
Objects to PDF, on the Apache site) uses the same algorithm.  Seeing how
it's all open-source, mightn't it be possible to port that algorithm to
Python and use the same pattern files that TeX and fop use?  If that
turns out to be possible, then you've already got multiple languages
taken care of, since I know fop, at least, has a many different language
hyphenation files.

I also like your idea of only hyphenating long words.  Perhaps it could
be a word-length limit that is specifiable from the command line?

- Jamis

On Tue, 2002-04-30 at 13:26, Bill Janssen wrote:
> Yes, I think this is the right way to do it.  But we'd need an
> algorithm (dictionary-based?) to get the insertion of those soft
> hyphens right.  Anyone know what the right one to use is?  Let's
> just start with English...
> 
> Bill
> 
-- 
Jamis Buck
[EMAIL PROTECTED]
http://hippa-potta.jamisandtarasine.net
.
"I'd horsewhip you if I had a horse." -- Groucho Marx

Reply via email to