[
https://issues.apache.org/jira/browse/UIMA-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12600771#action_12600771
]
Eddie Epstein commented on UIMA-1041:
-------------------------------------
The problem Marshall reported is only happening for some sample text files, and
only on Linux; one such is $UIMA_HOME/examples/data/UIMA_Seminars.txt The
problem is related to the 3 byte sequence at the head of the file.
Which files are causing problems with tclator? I am not seeing that behavior.
Thanks, Eddie
> UIMACPP Pythonator issues with annotation offsets and lengths - off by 1
> errors
> -------------------------------------------------------------------------------
>
> Key: UIMA-1041
> URL: https://issues.apache.org/jira/browse/UIMA-1041
> Project: UIMA
> Issue Type: Bug
> Components: C++ Framework
> Environment: RedHat, UIMACPP 2.2.2 release candidate 01, uima base
> 2.2.2
> Reporter: Marshall Schor
>
> The sample python script when run in the document analyzer shows annotations
> where the highlight is always missing the last character, and the details
> show the offsets for the begin and end to be both one to low.
> To reproduce, run the sample script in the python directory of the
> scriptators (after doing a build /install of the pythonator following the
> directions in the python directory in python.html).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.