Quoting Daniel Naber <[EMAIL PROTECTED]>: > On Wednesday 10 August 2005 18:12, Thomas Lange wrote: > > > As for how to detect the language of word or sentence one thing one > > might do is to break down the text into single words and for all those > > build n-grams (for example tri-grams) count all of them and assign them > > probabilities for occurence in the text. > > Code for such implementations is already available, eg. here (although this > one is in Java): > http://issues.apache.org/bugzilla/show_bug.cgi?id=26763
Hi, I think, Libtextcat is ready for use: http://software.wise-guys.nl/libtextcat/ Regards Laci > > Regards > Daniel > > -- > http://www.danielnaber.de > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > ---------------------------------------------------------------- This message was sent using IMP, the Internet Messaging Program. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
