Hi,

the hyphenation code in OOo is supposed to work like the one in TeX.... but 
it doesn't. I did an evaluation of large word lists and the result is that 
of about 122,000 words in English, 27,000 have a different hyphenation in 
OOo than in TeX. Some examples:

OOo <-> TeX
abor-ted   aborted
guard-i-an    guardian
briskness   brisk-ness

The same problem occurs for German words. In most (all?) cases the TeX 
hyphenation seems to be correct, the one by OOo seems to be incorrect.

Luckily Walter Wojcik is working on a new version of the hyphenation code. 
For the record I'll now explain how this automatic evaluation works so it 
can be done again when the new code is ready:

First, build the word lists using hunpsell's unmunch.

Then get the OOo hyphenator: download ALTLinux_hyph.zip from this page:
http://lingucomponent.openoffice.org/hyphenator.html
Compile it and call:
./example hyph_en_US.dic wordlist

Get the TeX hyphenation: Build a file "hpyhen.tex" with contents like this, 
listing all words:

\documentclass{minimal}
\usepackage[latin1]{inputenc}
\usepackage[T1]{fontenc}
\showhyphens{A}
\showhyphens{abalone}
\showhyphens{abalones}
(...all other words here...)
\makeatletter\@@end

Then call "latex hyphen.tex". The log file then contains the hyphenated 
words, plus some debugging output. Remove that debugging output and you 
can then compare both files with diff.

Regards
 Daniel

-- 
http://www.danielnaber.de

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to