[PHP] return language of a word

2008-09-29 Thread shahrzad khorrami
hi all,

is there any function to return us the lanuage of a word in the sentence?

for example : My name is شهرزاد .

when it sees شهرزاد notice that is a persian language.


Thanks


Re: [PHP] return language of a word

2008-09-29 Thread Robin Vickery
2008/9/29 shahrzad khorrami [EMAIL PROTECTED]:
 hi all,

 is there any function to return us the lanuage of a word in the sentence?

 for example : My name is شهرزاد .

 when it sees شهرزاد notice that is a persian language.

As others have said, you can check what unicode block the characters are from.

For segments of text that are a little longer (one or two sentences),
you can use n-gram based language identification, which can sometimes
be spookily accurate. If you want to give that a go, there's a Pear
package called Text::LanguageDetect which will do that
[http://pear.php.net/package/Text_LanguageDetect]. It doesn't have
trigrams for Persian in the lang.dat, but I don't imagine it would be
too hard to add them, if that's what you need.

-robin