Hi, I've updated our language detection from Tika to language-detector (https://github.com/optimaize/language-detector). Language detection should now be more reliable, and usually a short sentence (maybe 20 to 30 characters) should be enough to detect the language correctly.
Unfortunately I had to disable detection of Asturian and Galician, as they are too close to Spanish and having them activated leads to all three languages not being detected properly. In case you're wondering where this feature is used: in the stand-alone GUI it gets activated by checking the "Automatically detect language" check box, on the command line it gets activated with the -adl option. Regards Daniel ------------------------------------------------------------------------------ Dive into the World of Parallel Programming! The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel