W dniu 2014-09-03 10:58, R.J. Baars pisze: > To add the words frequencis, I am directed by the wiki to an address where > there is a frequency list indeed. But only 187000 words; while I have 1.2 > million Dutch words and their frequency myself.
Probably the probabilities of their occurrence is quite low. I tried replacing that list with a bigger one for Polish and my results indeed made the dictionary file bigger but nothing else changed much. > > The frequency is just a number; what is expected there? I this number a > plain ratio, a occurrence count, or something else, like logarithmic? > Will I have to convert to that format, or is a plain word<tab>number an > option too? Log scale, I believe. You might want to filter out some of the lower results, as well, as they don't really help and only make files bigger. Marcin > > Ruud > > > ------------------------------------------------------------------------------ > Slashdot TV. > Video for Nerds. Stuff that matters. > http://tv.slashdot.org/ > _______________________________________________ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/languagetool-devel > > ------------------------------------------------------------------------------ Slashdot TV. Video for Nerds. Stuff that matters. http://tv.slashdot.org/ _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel