José,

The Lingua::Translate module takes a text and translates it to another language (as in the verb Translate). I'd suspect this is somewhat different and considerably more difficult than guessing, by a statistical approach, whether two existing texts are translations of each other. The latter seems more similar to Text::Compare and Text::Similarity::Overlaps or the following modules that take an existing text and try to guess which language (or charset) it uses:

 Lingua::Identify
 Lingua::Ident
 Lingua::DetectCharset

Also, I feel it's somewhat odd to put two verbs in the module name as in

 Lingua::Translate::Identify
 Lingua::Translate::Compare

--davidm

David Golden wrote:

José Castro wrote:

 - you give it a set of texts (usually two, but can be more)
 - without knowing the language of each text, the module tells you how
   likely it is that those texts are translations of each other

It would probably be something in the "Text" of "Lingua" namespace, and it
would probably include something like "Translations" or "Parallel".


I tend to think of language things in Lingua and text processing in Text, so I'd lean towards Lingua. After that, my first thought was the unwieldy:

Lingua::Translation::Identify

or

Lingua::Translation::Compare

I'm not sure if it makes sense to shorten it to the existing Lingua::Translate namespace for consistency sake:

Lingua::Translate::Identify
Lingua::Translate::Compare

As I look at this, I lean slightly towards "compare" since it definitely compares, but doesn't necessarily "identify".

Best of luck,
David Golden



Reply via email to