> Is it reasonable to guess language info. from target servers geographical > info.?
Yes, it could be another clue to guess language. But the problem is then to find how to use all these indices. For instance, the actual solution is the easiest one, but certainly not the more efficient one: For HTML documents, the HTMLLanguageParser scans HTML documents looking at possible indications of content language: 1. html lang attribute 2. meta dc.language 3. meta http-equiv The first one found is assumed to be the document's language. Then if no language is found, the statistical language identifier is used.... Jérôme -- http://motrech.free.fr/ http://www.frutch.org/
