Apart from the trick I am applying now, a good option for more valid output could be to use the words form Wikipedia and Tatoeba as an extra input. If the language is in those databases.
Galician grew to > 3 Mb fast enough when Spanish and Portuguese were used as input. These could also be found in an unmunch of those languages or the LT dictionaries. Whatever is acceptable for the list to create. Ruud ------------------------------------------------------------------------------ _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel