Hi,

W dniu 2013-12-31 13:20, Jaume Ortolà i Font pisze:
> Hi,
>
> In the current implementation the number of possible suggestions grows
> exponentially with the replacement pairs, which is not a good thing...
> For "Milkowski" you get 6144 possible suggestions in American English. I
> fixed a limit of 7 possible simultaneous replacements in a word, which
> (if the replacements are one to two) gives you 2^7=128 possible
> suggestions. But in the case of "Milkowski" in American English, almost
> all letters have 3 o 4 possible replacements (which I didn't forsee), so
> the limit is about 4^7=16384.

I'd get rid of these multiple replacements. They are spurious anyway, I 
guess. Some of them seem to be repeated (I just copied these from 
hunspell). I will look at them again.

>
> What we can do?
> 1. Limit the number of replacement pairs. Avoid specially one letter
> replacements. Add in more "context".
> 2. Limit the number of possible replacements in a single word. This will
> make the search less exhaustive.
> 3. Change the approach. We should try again to traverse directly the
> dictionary with the replacement pairs. But the implementation is
> difficult. I tried some time ago, and got some results. But there were
> details difficult to solve, and I gave up. Perhaps Marcin or Dawid Weiss
> can provide some insight...

This is indeed a bit tricky...

Best,
Marcin

>
> Regards,
> Jaume Ortolà
>
>
>
> 2013/12/31 Daniel Naber <daniel.na...@languagetool.org
> <mailto:daniel.na...@languagetool.org>>
>
>     Hi,
>
>     a large amount of pairs for 'fsa.dict.speller.replacement-pairs' (in the
>     *.info file) makes creating suggestions very slow. You can reproduce
>     that if your text contains only the word "Milkowski" and you check it
>     with en-US (sorry your name is triggering a bug, Marcin :-). It takes
>     several seconds to finish, while it's fast with en-GB, which has less
>     replacement pairs.
>
>     I guess it's so slow that we're back to hunspell performance, which we
>     originally tried to avoid when switching to Morfologik. Is this a bug?
>     Any better idea than just trimming down the number of replacement pairs?
>
>     Regards
>        Daniel
>
>
>     
> ------------------------------------------------------------------------------
>     Rapidly troubleshoot problems before they affect your business. Most IT
>     organizations don't have a clear picture of how application performance
>     affects their revenue. With AppDynamics, you get 100% visibility
>     into your
>     Java,.NET, & PHP application. Start your 15-day FREE TRIAL of
>     AppDynamics Pro!
>     
> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
>     _______________________________________________
>     Languagetool-devel mailing list
>     Languagetool-devel@lists.sourceforge.net
>     <mailto:Languagetool-devel@lists.sourceforge.net>
>     https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>
>
>
>
> ------------------------------------------------------------------------------
> Rapidly troubleshoot problems before they affect your business. Most IT
> organizations don't have a clear picture of how application performance
> affects their revenue. With AppDynamics, you get 100% visibility into your
> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
>
>
>
> _______________________________________________
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>


------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT 
organizations don't have a clear picture of how application performance 
affects their revenue. With AppDynamics, you get 100% visibility into your 
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to