José,
The Lingua::Translate module takes a text and translates it to another
language (as in the verb Translate). I'd suspect this is somewhat
different and considerably more difficult than guessing, by a
statistical approach, whether two existing texts are translations of
each other. The latter seems more similar to Text::Compare and
Text::Similarity::Overlaps or the following modules that take an
existing text and try to guess which language (or charset) it uses:
Lingua::Identify
Lingua::Ident
Lingua::DetectCharset
Also, I feel it's somewhat odd to put two verbs in the module name as in
Lingua::Translate::Identify
Lingua::Translate::Compare
--davidm
David Golden wrote:
José Castro wrote:
- you give it a set of texts (usually two, but can be more)
- without knowing the language of each text, the module tells you how
likely it is that those texts are translations of each other
It would probably be something in the "Text" of "Lingua" namespace,
and it
would probably include something like "Translations" or "Parallel".
I tend to think of language things in Lingua and text processing in
Text, so I'd lean towards Lingua. After that, my first thought was
the unwieldy:
Lingua::Translation::Identify
or
Lingua::Translation::Compare
I'm not sure if it makes sense to shorten it to the existing
Lingua::Translate namespace for consistency sake:
Lingua::Translate::Identify
Lingua::Translate::Compare
As I look at this, I lean slightly towards "compare" since it
definitely compares, but doesn't necessarily "identify".
Best of luck,
David Golden