Re: getting original indexes from JCas Text (Stanford NER and UIMA)

Jörn Kottmann Wed, 14 Oct 2009 07:31:13 -0700

Julio C. wrote:

Hi everybody,


I'm working with the Stanford NER and UIMA and I was wondering if there's an
easy and clean way of get the position of word begin/end from the original
JCas document(from .getDocumentText()) after it was converted into
List<List<CoreLabel>> and processed by the NER.

Maybe you can keep an array of your word annotations and
then use the absolute index of a Core Label to map back to
the word annotation which then can be used to retrieve its
offset and length.

Otherwise you could use a map, where you map from Core Label
to word annotation.

Jörn

Re: getting original indexes from JCas Text (Stanford NER and UIMA)

Reply via email to