2013/4/22 Marcin Miłkowski <[email protected]>

> >
> >   The case-insensitive comparison seems reasonable to me. A case
> > conversion will be almost always a good suggesion and very probably the
> > best suggestion, I think.
>
> But it's still an edit distance. For some languages, there are essential
> differences between uppercase and lowercase words (think of German).
> This should be a parameter in the .info file.
>
>
Ok, This can be language dependent. But remember that the objective of all
this was to find more suggestions beyond distance=1 (which is the limit we
are using now).



> Do you have '_' as the separator in your dictionary? I can see that the
>  separator is defined as '+' for the spelling dictionary. Note that I did
> not test the speller on files with separators at all (just on pure word
> lists).
>

I'm using the tagger dictionary as a speller dictionary, because now it's
better than the hunspell generated word list and that way there is only one
dictionary to be mantained. The files in the hunspell directory were
pending removal. I realize now that that was probably not a good idea.

I have almost finished the multiple character substitution. I think it
works. I will explain it and send the code in another message.

Regards,
Jaume
------------------------------------------------------------------------------
Try New Relic Now & We'll Send You this Cool Shirt
New Relic is the only SaaS-based application performance monitoring service 
that delivers powerful full stack analytics. Optimize and monitor your
browser, app, & servers with just a few lines of code. Try New Relic
and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_apr
_______________________________________________
Languagetool-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to