Dear Jim I think I understand what you mean. Playing with external analyzers which add/remove tags to/from the text with regular expressions may lead to this situation. The problem cannot exist with stand-off annotations.
Assuming that only the number of whitespace characters have changed, you may index the annotation on the **rank** of the "non-whitespace character". The rank of the "non-whitespace character" will not change if none of them are added or removed. Thanks to the rank you will be able to align the future annotations over the prior offsets. This is the solution I imagined once. Hope it help for you On Thu, Sep 9, 2010 at 1:00 AM, Jim Hargrave <[email protected]> wrote: > I apologize if my terminology doesn't match with normal UIMA usage - but > hopefully the general idea will be understandable. > > Is it always assumed that UIMA's document text is immutable? Let's say you > have some text and with several position-based annotations. The text changes, > now all of your annotation positions are incorrect. Are there API's that > allow you to change your text, but still preserve the offsets in your > annotations? > > Jim > > > NOTICE: This email message is for the sole use of the intended recipient(s) > and may contain confidential and privileged information. Any unauthorized > review, use, disclosure or distribution is prohibited. If you are not the > intended recipient, please contact the sender by reply email and destroy all > copies of the original message. > > > -- [email protected] -- http://www.univ-nantes.fr/hernandez-n # Laboratoire LINA-TALN CNRS UMR 6241 tel. +33 (0)2 51 12 58 55 # Université de Nantes - Institut Universitaire de Technologie - Département Informatique tel. +33 (0)2 40 30 60 67
