Hi, W dniu 2013-12-31 13:20, Jaume Ortolà i Font pisze: > Hi, > > In the current implementation the number of possible suggestions grows > exponentially with the replacement pairs, which is not a good thing... > For "Milkowski" you get 6144 possible suggestions in American English. I > fixed a limit of 7 possible simultaneous replacements in a word, which > (if the replacements are one to two) gives you 2^7=128 possible > suggestions. But in the case of "Milkowski" in American English, almost > all letters have 3 o 4 possible replacements (which I didn't forsee), so > the limit is about 4^7=16384.
I'd get rid of these multiple replacements. They are spurious anyway, I guess. Some of them seem to be repeated (I just copied these from hunspell). I will look at them again. > > What we can do? > 1. Limit the number of replacement pairs. Avoid specially one letter > replacements. Add in more "context". > 2. Limit the number of possible replacements in a single word. This will > make the search less exhaustive. > 3. Change the approach. We should try again to traverse directly the > dictionary with the replacement pairs. But the implementation is > difficult. I tried some time ago, and got some results. But there were > details difficult to solve, and I gave up. Perhaps Marcin or Dawid Weiss > can provide some insight... This is indeed a bit tricky... Best, Marcin > > Regards, > Jaume Ortolà > > > > 2013/12/31 Daniel Naber <daniel.na...@languagetool.org > <mailto:daniel.na...@languagetool.org>> > > Hi, > > a large amount of pairs for 'fsa.dict.speller.replacement-pairs' (in the > *.info file) makes creating suggestions very slow. You can reproduce > that if your text contains only the word "Milkowski" and you check it > with en-US (sorry your name is triggering a bug, Marcin :-). It takes > several seconds to finish, while it's fast with en-GB, which has less > replacement pairs. > > I guess it's so slow that we're back to hunspell performance, which we > originally tried to avoid when switching to Morfologik. Is this a bug? > Any better idea than just trimming down the number of replacement pairs? > > Regards > Daniel > > > > ------------------------------------------------------------------------------ > Rapidly troubleshoot problems before they affect your business. Most IT > organizations don't have a clear picture of how application performance > affects their revenue. With AppDynamics, you get 100% visibility > into your > Java,.NET, & PHP application. Start your 15-day FREE TRIAL of > AppDynamics Pro! > > http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk > _______________________________________________ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > <mailto:Languagetool-devel@lists.sourceforge.net> > https://lists.sourceforge.net/lists/listinfo/languagetool-devel > > > > > ------------------------------------------------------------------------------ > Rapidly troubleshoot problems before they affect your business. Most IT > organizations don't have a clear picture of how application performance > affects their revenue. With AppDynamics, you get 100% visibility into your > Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! > http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk > > > > _______________________________________________ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/languagetool-devel > ------------------------------------------------------------------------------ Rapidly troubleshoot problems before they affect your business. Most IT organizations don't have a clear picture of how application performance affects their revenue. With AppDynamics, you get 100% visibility into your Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel