ge wrote:
Javier Sola wrote:

we have a small application (also dictionary based) that goes over an ODF or HTML file and includes the ZWSPs in the text...

Well, that application is not possible to write fully correctly.
You can never know, which is the word limit.

for example:
1.  "abetter" means someone, who abets someone to do a crime.
2. "a better" means the words a and better.

If in a text there is "abetter", which is meant?
Even when you look into the word's environment, the question
still can remain.
It is not hard to construct lots of such examples....
This is true. This has always been the critizism for the ICU breaker, it always chooses the longest match. In this case the two words would be kept together, and spellchecked together. To go further would require statistical analysis of situations... and even then it would not be always perfect.

The system can be fixed by hand in places where the breaker does not break, but it would be interesting to.. by inserting a ZWSP.



-eleonora


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org
For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org




---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org
For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org

Reply via email to