[
https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jörn Kottmann updated UIMA-1299:
--------------------------------
Component/s: (was: Sandbox)
Sandbox-Lucas
> Contribution of Lucene CAS Indexer
> ----------------------------------
>
> Key: UIMA-1299
> URL: https://issues.apache.org/jira/browse/UIMA-1299
> Project: UIMA
> Issue Type: New Feature
> Components: Sandbox-Lucas
> Reporter: Rico Landefeld
> Assignee: Jörn Kottmann
> Attachments: lucene-indexer.tar.gz, pom.xml
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene
> index. It is based on a XML-based "mapping configuration file" in which the
> user can determine which UIMA annotations should be put into which Lucene
> field, and how this field is set up (e.g. indexed and/or stored). In
> addition, some basic functionality for (ontolgical) hypernym indexing is
> provided.
> Additionally, Lucas is able to perform offset-based token stream alignment
> and merging of UIMA annotations (via token position increment) in the same
> Lucene field (e.g. "documenttext" or "title")
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.