W dniu 2014-09-03 10:58, R.J. Baars pisze:
> To add the words frequencis, I am directed by the wiki to an address where
> there is a frequency list indeed. But only 187000 words; while I have 1.2
> million Dutch words and their frequency myself.

Probably the probabilities of their occurrence is quite low. I tried 
replacing that list with a bigger one for Polish and my results indeed 
made the dictionary file bigger but nothing else changed much.

>
> The frequency is just a number; what is expected there? I this number a
> plain ratio, a occurrence count, or something else, like logarithmic?
> Will I have to convert to that format, or is a plain word<tab>number an
> option too?

Log scale, I believe. You might want to filter out some of the lower 
results, as well, as they don't really help and only make files bigger.

Marcin

>
> Ruud
>
>
> ------------------------------------------------------------------------------
> Slashdot TV.
> Video for Nerds.  Stuff that matters.
> http://tv.slashdot.org/
> _______________________________________________
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>
>


------------------------------------------------------------------------------
Slashdot TV.  
Video for Nerds.  Stuff that matters.
http://tv.slashdot.org/
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to