[
https://issues.apache.org/jira/browse/UIMA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Peter Klügl resolved UIMA-2359.
-------------------------------
Resolution: Fixed
I do not want to change the behavior of BREAK, because I think there could be
situations where someone want to distinguish between \r and \n. However, I
added in UIMA-2452 a PlainTextAnnotator, which creates platform-independent
Line annotations. Thus, I'd say this issue is resolved.
> Different results of Text Maker in windows and unix
> ---------------------------------------------------
>
> Key: UIMA-2359
> URL: https://issues.apache.org/jira/browse/UIMA-2359
> Project: UIMA
> Issue Type: Bug
> Components: Sandbox, TextMarker
> Affects Versions: build-resources-2
> Environment: Windows
> Reporter: Luca Dini (CELI)
> Assignee: Peter Klügl
> Priority: Minor
> Labels: patch
>
> The class AbstractApplyScriptHandlerJob when called from the workbenck calls,
> for reding text to be analyzed the method:
> org.apache.uima.pear.util.FileUtil.loadTextFile(new File(each), "UTF-8");
> Such a method return nelines in window as 2 new lines. Therefore basic
> TextMarker annotations appears like:
> line BREAK BREAK
> line BREAK BREAK
> Therefore grammars written on windows must take into account the double break
> which make them not applicable when running on unix or when using other read
> methods, such as:
> Scanner sc = new Scanner(inFile, "UTF-8");
> String out = "";
> while (sc.hasNextLine()) {
> out += sc.nextLine() + "\n";
> }
> Relates to:
> https://issues.apache.org/jira/browse/UIMA-2133t
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira