Hi Suneel, On 9 October 2013 14:27, Suneel Marthi <[email protected]> wrote: > an example of a Naive-Bayes classifier trained on character n-grams is the > LangDetect library. > (see http://code.google.com/p/language-detection/) > > Agree with Ted that it should be relatively easy to build one. >
Thanks. Yes, I need to (re-)train a language detector. We have an existing system based on an earlier version of Mahout which I'm looking to switch to using character n-grams instead of word tokens. Dean.
