There are ways to detect which lang is being used based on the encoding and 
word choice and some statistical measurements of the text so its possible to 
detect automaticallly. Its not 100% but is pretty accurate for mainstream 
proper langs. Its gets very complex when you deal with dialects. For example, 
in arabic there is the formal modern standard arabic (.msa) but its not the 
main lang of most arabic speakers - they speak dialects but some have 
significant differences from msa.. Detecting them automatically can be tricky. 
Similar problems exist in all langs.



-----Original Message-----
From: [EMAIL PROTECTED] <[EMAIL PROTECTED]>
To: XMPP Extension Discussion List <[email protected]>
Sent: Mon Feb 18 09:10:57 2008
Subject: Re: [Standards] UPDATED: XEP-0171 (Language Translation)

Well, if sending user does not want to set message language (which is 
understandable, as it is awkward solution), then maybe receiving user 
will do that?
You can provide a way for user to set up translation for any of his 
contacts. I.e. define one (or more) languages that could be used by this 
contact.
Then translator can translate messages from all these languages to 
user-defined language.
User can define one or more languages he understands (with priorities; 
translator would also have it's own priority table to try to translate 
between more similar languages if possible; i.e. if i understand English 
and Russian, then translator will translate from German to English, 
because it's easier).
As i said, translator performs translation for all source languages and 
sends all translated text to user (this is arguable, though, since CPU 
time consumption and bandwidth consumption will increase significantly 
if there is more than one source language). User then can switch 
translations just as he would switch character encoding tables in 
browser: if text is unreadable, then he should try another translation. 
User's client may also use elements of AI to guess which translation is 
the best.
Sender may also specify which language he uses in his messages, but this 
is only a hint for translator.

Reply via email to