Hi,
I would like to resume this question. The spelling suggestions are one of
the more noticeable things for the users, and there is room for
improvement. We left it here:
2013/7/16 Marcin Miłkowski <list-addr...@wp.pl>
> > The next step for improving the suggestions would be to use a list of
> > frequent words. I'm thinking of just a list of manually selected words
> > or at most a few thousand words from a frequency dictionary.
>
> Yes. Frequency dictionaries would be very useful.
>
> I think we can represent frequency classes as ten ranges of percentages
> with 10 ASCII characters (A-K), as this would be in the tradition of the
> fsa encoding. So "A" would be the most common words (like 'the' and 'a'
> in English), etc. I think we don't need to have a better resolution here.
>
> Or we could simply use a numerical percentage in its decimal (rounded)
> representation from 000 to 100. This, however, would make the dictionary
> slightly bigger.
>
>
The option of using ranges named with characters seems OK.
I would like to add something. One of these characters (say Z) could be
used for the meaning "never show this word in the suggestions". This would
be useful in the same way some words are excluded from the synthesis
dictionary and are never used in synthesis.
If there is agreement on this, can we go into the implementation? I think
we need:
- A method for building the dictionary, assuming that it will be used only
for some languages (backward compatible).
- A way of using the frequency information in the ordering of suggestions.
For example:
new distance = current distance *10 + a number between 0 and 9 (A-K).
Regards,
Jaume Ortolà
------------------------------------------------------------------------------
Shape the Mobile Experience: Free Subscription
Software experts and developers: Be at the forefront of tech innovation.
Intel(R) Software Adrenaline delivers strategic insight and game-changing
conversations that shape the rapidly evolving mobile landscape. Sign up now.
http://pubads.g.doubleclick.net/gampad/clk?id=63431311&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel