I am myself also maintaining the Hunspell list, with all otheres of the OpenTaal club. But from the words reported as 'incorrect', I coclude that tokenizing is different than what the spellchecker expects. I get spellingerrors on words with: a / in it a ' at the start a . at the end
That I don't understand. Apart from this, I still would like to knwo how to build a Java dictionary ;-) Ruud > On 2014-09-01 07:49, R.J. Baars wrote: > >> Daniel, if I am correct, Hunspell is not used for the Dutch LT, but the >> java speller using the list I supplied a long time ago, right? > > No, it uses hunspell with these *.dic and *.aff files: > https://github.com/languagetool-org/languagetool/tree/master/languagetool-language-modules/nl/src/main/resources/org/languagetool/resource/nl/hunspell > >> It would be great If I could reproduce the steps for this spellchecker >> and >> enhance it a bit; the list that is. > > The "original" hunspell dictionary should be changed, i.e. we should > make sure all changes in LT find their way back to the hunspell > dictionary. > >> Second, there is this 'to be ignored list, auto generated. How come? >> These >> are actual non-Dutch words. How come they are auto-generated into an >> ignore list? > > These are extracted from suggestions in grammar.xml. > > Regards > Daniel > > > ------------------------------------------------------------------------------ > Slashdot TV. > Video for Nerds. Stuff that matters. > http://tv.slashdot.org/ > _______________________________________________ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/languagetool-devel > ------------------------------------------------------------------------------ Slashdot TV. Video for Nerds. Stuff that matters. http://tv.slashdot.org/ _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel