[ 
https://issues.apache.org/jira/browse/UIMA-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12600766#action_12600766
 ] 

Burn Lewis commented on UIMA-1041:
----------------------------------

FWIW - on Windows the offsets look correct, using a build from src.  Python & 
Perl & Tcl XCASes look OK in CVD.

But the tcl example generates duplicate annotations and gets an overflow error 
... problem seems to be that the tcl regexp match sometimes returns pairs such 
as:
65 69
65 69
-1 -1
I tried the ActiveTcl 8.4 build and also built 8.4 & 8.5

> UIMACPP Pythonator issues with annotation offsets and lengths - off by 1 
> errors
> -------------------------------------------------------------------------------
>
>                 Key: UIMA-1041
>                 URL: https://issues.apache.org/jira/browse/UIMA-1041
>             Project: UIMA
>          Issue Type: Bug
>          Components: C++ Framework
>         Environment: RedHat, UIMACPP 2.2.2 release candidate 01, uima base 
> 2.2.2
>            Reporter: Marshall Schor
>
> The sample python script when run in the document analyzer shows annotations 
> where the highlight is always missing the last character, and the details 
> show the offsets for the begin and end to be both one to low.
> To reproduce, run the sample script in the python directory of the 
> scriptators (after doing a build /install of the pythonator following the 
> directions in the python directory in python.html).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to