Hi,

In Portuguese we have contractions, that are words composed by, for
example, a preposition + article, pronoun or an adverb.

Example:

Nós acreditávamos nele. (We believed him.)

Where "nele" can be divided into "em" + "ele". (in + him)

To properly analyze this, I created two token annotation with the same
begin and end, but the first I associated with the POS Tag preposition, and
the second pronoun.

This is especially important when we are doing chunking, because the first
token will be part of a prepositional phrase, while the second of a nominal
phrase.

How can I guarantee that when I call UIMAFit JCasUtil.select I will get the
tokens ordered, first the preposition, second the pronoun?

Thank you,
William

Reply via email to