RE: [PHP-I18N] Language detection

Ed Batutis Wed, 13 Aug 2008 07:51:06 -0700

> Thanks Ed. Unless I've misunderstood, this is just doing charset
> detection, with language as a bonus when the charset implies it?


That wouldn't be very useful. No, it uses recognizers for charset/language
combinations.

> difference between say English, French and German, all in UTF-8
> encoding, please let me know.

It does not have data to do any utf-8 language detection, but the structure
is in place.

You might want to consider adding data to their framework for what you want
to do. It isn't complicated. The most important thing you need is good
sample text in quantity so you can generate the n-gram probability table.

I believe the code was taken from Mozilla, so you might look there. Maybe
they've already done what you are looking for.

=Ed



-- 
PHP Unicode & I18N Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

RE: [PHP-I18N] Language detection

Reply via email to