Hi Suneel,

On 9 October 2013 14:27, Suneel Marthi <[email protected]> wrote:
> an example of a Naive-Bayes classifier trained on character n-grams is the 
> LangDetect library.
> (see http://code.google.com/p/language-detection/)
>
> Agree with Ted that it should be relatively easy to build one.
>

Thanks. Yes, I need to (re-)train a language detector. We have an
existing system based on an earlier version of Mahout which I'm
looking to switch to using character n-grams instead of word tokens.

Dean.

Reply via email to