> Is it reasonable to guess language info. from target servers geographical
> info.?

Yes, it could be another clue to guess language.
But the problem is then to find how to use all these indices.

For instance, the actual solution is the easiest one, but certainly not the
more efficient one:
For HTML documents, the HTMLLanguageParser scans HTML documents looking at
possible indications of content language:
1. html lang attribute
2. meta dc.language
3. meta http-equiv
The first one found is assumed to be the document's language.
Then if no language is found, the statistical language identifier is
used....

Jérôme

--
http://motrech.free.fr/
http://www.frutch.org/

Reply via email to