[ 
https://issues.apache.org/jira/browse/UIMA-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12600771#action_12600771
 ] 

Eddie Epstein commented on UIMA-1041:
-------------------------------------

The problem Marshall reported is only happening for some sample text files, and 
only on Linux; one such is $UIMA_HOME/examples/data/UIMA_Seminars.txt   The 
problem is related to the 3 byte sequence at the head of the file.

Which files are causing problems with tclator? I am not seeing that behavior.

Thanks, Eddie

> UIMACPP Pythonator issues with annotation offsets and lengths - off by 1 
> errors
> -------------------------------------------------------------------------------
>
>                 Key: UIMA-1041
>                 URL: https://issues.apache.org/jira/browse/UIMA-1041
>             Project: UIMA
>          Issue Type: Bug
>          Components: C++ Framework
>         Environment: RedHat, UIMACPP 2.2.2 release candidate 01, uima base 
> 2.2.2
>            Reporter: Marshall Schor
>
> The sample python script when run in the document analyzer shows annotations 
> where the highlight is always missing the last character, and the details 
> show the offsets for the begin and end to be both one to low.
> To reproduce, run the sample script in the python directory of the 
> scriptators (after doing a build /install of the pythonator following the 
> directions in the python directory in python.html).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to