On Sep 12, 2008, at 6:13 PM, Julien Nioche wrote:
Dear UIMA devs,
We have recently developed an AnnotationReader for UIMA which uses
Tika to
convert the markup into annotations. The resource consists of a
CollectionReader, a CasAnnotator and a utility class which can
populate a
cas with markup annotations. It is certainly not perfect but it does a
decent job. The type system is inspired see
http://cwiki.apache.org/UIMA/uima-sandbox-components.html
There is already a jira issue to add a Tika Annotator to the sandbox
https://issues.apache.org/jira/browse/UIMA-1095
Jörn