Julio C. wrote:
Hi everybody,
I'm working with the Stanford NER and UIMA and I was wondering if there's an
easy and clean way of get the position of word begin/end from the original
JCas document(from .getDocumentText()) after it was converted into
List<List<CoreLabel>> and processed by the NER.
Maybe you can keep an array of your word annotations and
then use the absolute index of a Core Label to map back to
the word annotation which then can be used to retrieve its
offset and length.
Otherwise you could use a map, where you map from Core Label
to word annotation.
Jörn