In order to move this along, I'll call for a vote. No use prevaricating about the bush ;-)
--Thilo Rico Landefeld (JIRA) wrote: > [ > https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel > ] > > Rico Landefeld updated UIMA-1299: > --------------------------------- > > Attachment: lucene-indexer.tar.gz > >> Contribution of Lucene CAS Indexer >> ---------------------------------- >> >> Key: UIMA-1299 >> URL: https://issues.apache.org/jira/browse/UIMA-1299 >> Project: UIMA >> Issue Type: New Feature >> Components: Sandbox >> Reporter: Rico Landefeld >> Attachments: lucene-indexer.tar.gz >> >> >> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene >> index. It is based on a XML-based "mapping configuration file" in which the >> user can determine which UIMA annotations should be put into which Lucene >> field, and how this field is set up (e.g. indexed and/or stored). In >> addition, some basic functionality for (ontolgical) hypernym indexing is >> provided. >> Additionally, Lucas is able to perform offset-based token stream alignment >> and merging of UIMA annotations (via token position increment) in the same >> Lucene field (e.g. "documenttext" or "title") >
