Set of Java classes that allow the Lucene search engine to use morphological
information developed for the Apertium open-source machine translation platform
(http://www.apertium.org)
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Key: LUCENE-1284
URL: https://issues.apache.org/jira/browse/LUCENE-1284
Project: Lucene - Java
Issue Type: New Feature
Environment: New feature developed under GNU/Linux, but it should work
in any other Java-compliance platform
Reporter: Felipe Sánchez Martínez
Set of Java classes that allow the Lucene search engine to use morphological
information developed for the Apertium open-source machine translation platform
(http://www.apertium.org). Morphological information is used to index new
documents and to process smarter queries in which morphological attributes can
be used to specify query terms.
The tool makes use of morphological analyzers and dictionaries developed for
the open-source machine translation platform Apertium (http://apertium.org)
and, optionally, the part-of-speech taggers developed for it. Currently there
are morphological dictionaries available for Spanish, Catalan, Galician,
Portuguese,
Aranese, Romanian, French and English. In addition new dictionaries are being
developed for Esperanto, Occitan, Basque, Swedish, Danish,
Welsh, Polish and Italian, among others; we hope more language pairs to be
added to the Apertium machine translation platform in the near future.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]