UIMA annotators are too static

Jörn Kottmann Thu, 04 Oct 2007 06:03:11 -0700

Hello,

often annotators are more flexible and reusable than its assumed inUIMA.The configuration is static to the annotator because it is set viathe descriptor.

There are annotators which benefit from determining the types to useat runtimevia a configuration parameter. This is already possible (e.g theexample regex annotator),

but it is not possible to set the capabilities at runtime.

To improve this UIMA should ask the annotator for the capabilities.To make the configuration

easier it could be considered to add a "Type" range type for parameters.

This type system mapping makes annotators independent of one specifictype systemand allows the reuse with another type system, e.g. currently itshard to reuse a tokenizerfrom one group and combine it with a pos annotator from another groupsince

the token type would not match.

It is also not possible to set the language during runtime, e.g. anannotator

could have a model/rule file which also specifies the language.
The language setting is then redundant in the descriptor.

It should also be considered to reuse annotators with different settings

e.g. a few name finder instances but with different models anddifferent output types.

This is too already possible but some information in the descriptor must
be duplicated for every instance, e.g. version, implementing class, etc.

What do you think ?

Jörn

UIMA annotators are too static

Reply via email to