In order to move this along, I'll call for a vote.
No use prevaricating about the bush ;-)

--Thilo

Rico Landefeld (JIRA) wrote:
>      [ 
> https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
>  ]
> 
> Rico Landefeld updated UIMA-1299:
> ---------------------------------
> 
>     Attachment: lucene-indexer.tar.gz
> 
>> Contribution of Lucene CAS Indexer
>> ----------------------------------
>>
>>                 Key: UIMA-1299
>>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>>             Project: UIMA
>>          Issue Type: New Feature
>>          Components: Sandbox
>>            Reporter: Rico Landefeld
>>         Attachments: lucene-indexer.tar.gz
>>
>>
>> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene 
>> index. It is based on a XML-based "mapping configuration  file" in which the 
>> user can determine which UIMA annotations should be put into which Lucene 
>> field, and how this field is set up (e.g. indexed and/or stored). In 
>> addition, some basic functionality for (ontolgical) hypernym indexing is 
>> provided.
>> Additionally, Lucas is able to perform offset-based token stream alignment 
>> and merging of UIMA annotations (via token position increment) in the same 
>> Lucene field (e.g. "documenttext" or "title")
> 

Reply via email to