[
https://issues.apache.org/jira/browse/UIMA-4568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14709011#comment-14709011
]
Peter Klügl commented on UIMA-4568:
-----------------------------------
For me, this was never a big problem before, because my Ruta-based applications
contained only one monolithic Ruta AE with all scripts. However for UIMA-4484,
this is essential concerning performance.
> Ruta: improve internal indexing for multiple ruta engine calls
> --------------------------------------------------------------
>
> Key: UIMA-4568
> URL: https://issues.apache.org/jira/browse/UIMA-4568
> Project: UIMA
> Issue Type: Improvement
> Components: ruta
> Affects Versions: 2.2.1ruta
> Reporter: Peter Klügl
>
> Right now, the RutaBasics are not cached in the CAS. Therefore, all the
> internal information in beginMap, endMap and partof needs to be recreated for
> each call of an Ruta analysis engine. Investigate how this problem can be
> solved or relaxed.
> Some ideas (brainstorming):
> - wait until annotations in UIMA are represented by a normal java objects
> - try to enforce caching the relevant FSs (RutaBasic)
> - store the information in features (the normal CAS way)
> - add an extra index (can it be injected in order to avoid the problems with
> type priorities?)
> - use an external resource (the probably most unnatural way)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)