> Thanks Ed. Unless I've misunderstood, this is just doing charset > detection, with language as a bonus when the charset implies it?
That wouldn't be very useful. No, it uses recognizers for charset/language combinations. > difference between say English, French and German, all in UTF-8 > encoding, please let me know. It does not have data to do any utf-8 language detection, but the structure is in place. You might want to consider adding data to their framework for what you want to do. It isn't complicated. The most important thing you need is good sample text in quantity so you can generate the n-gram probability table. I believe the code was taken from Mozilla, so you might look there. Maybe they've already done what you are looking for. =Ed -- PHP Unicode & I18N Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php