W dniu 2013-04-23 09:34, Jaume Ortolà i Font pisze: > 2013/4/22 Marcin Miłkowski <[email protected] <mailto:[email protected]>> > > > > > The case-insensitive comparison seems reasonable to me. A case > > conversion will be almost always a good suggesion and very > probably the > > best suggestion, I think. > > But it's still an edit distance. For some languages, there are essential > differences between uppercase and lowercase words (think of German). > This should be a parameter in the .info file. > > > Ok, This can be language dependent. But remember that the objective of > all this was to find more suggestions beyond distance=1 (which is the > limit we are using now).
I think it's 2 right now. > > Do you have '_' as the separator in your dictionary? I can see that the > separator is defined as '+' for the spelling dictionary. Note that I did > not test the speller on files with separators at all (just on pure word > lists). > > > I'm using the tagger dictionary as a speller dictionary, because now > it's better than the hunspell generated word list and that way there is > only one dictionary to be mantained. The files in the hunspell directory > were pending removal. I realize now that that was probably not a good idea. It would be a good idea if you had defined the separator correctly :) > > I have almost finished the multiple character substitution. I think it > works. I will explain it and send the code in another message. Interesting! Best, Marcin ------------------------------------------------------------------------------ Try New Relic Now & We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, & servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_apr _______________________________________________ Languagetool-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/languagetool-devel
