Set of Java classes that allow the Lucene search engine to use morphological 
information developed for the Apertium open-source machine translation platform 
(http://www.apertium.org)
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

                 Key: LUCENE-1284
                 URL: https://issues.apache.org/jira/browse/LUCENE-1284
             Project: Lucene - Java
          Issue Type: New Feature
         Environment: New feature developed under GNU/Linux, but it should work 
in any other Java-compliance platform
            Reporter: Felipe Sánchez Martínez


Set of Java classes that allow the Lucene search engine to use morphological 
information developed for the Apertium open-source machine translation platform 
(http://www.apertium.org). Morphological information is used to index new 
documents and to process smarter queries in which morphological attributes can 
be used to specify query terms.

The tool makes use of morphological analyzers and dictionaries developed for 
the open-source machine translation platform Apertium (http://apertium.org) 
and, optionally, the part-of-speech taggers developed for it. Currently there 
are morphological dictionaries available for Spanish, Catalan, Galician, 
Portuguese, 
Aranese, Romanian, French and English. In addition new dictionaries are being 
developed for Esperanto, Occitan, Basque, Swedish, Danish, 
Welsh, Polish and Italian, among others; we hope more language pairs to be 
added to the Apertium machine translation platform in the near future.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to