[ https://issues.apache.org/jira/browse/CTAKES-279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13884562#comment-13884562 ]
James Joseph Masanz commented on CTAKES-279: -------------------------------------------- this was caused by a \r (carriage return) character appearing within the text (not immediately prior to a \n) this caused part of assertion to think there were 7 tokens on a line and part to think there were fewer (stopping before the \r) > ArrayIndexOutOfBoundsException: 7 > generateFeatures(SingleDocumentProcessor.java:274) > ------------------------------------------------------------------------------------ > > Key: CTAKES-279 > URL: https://issues.apache.org/jira/browse/CTAKES-279 > Project: cTAKES > Issue Type: Bug > Components: ctakes-assertion > Affects Versions: 3.1.1 > Environment: using AggregatePlaintextUMLSProcessor within cTAKES 3.1.1 > Reporter: James Joseph Masanz > > I can't post the data that caused this, but I've hit this exception 3 times. > I will debug and post more. Here's part of the stack trace. I am using > AggregatePlaintextUMLSProcessor within cTAKES 3.1.1 > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7 > at > org.mitre.medfacts.i2b2.api.SingleDocumentProcessor.generateFeatures(SingleDocumentProcessor.java:274) > at > org.mitre.medfacts.i2b2.api.SingleDocumentProcessor.processSingleDocument(SingleDocumentProcessor.java:102) > at > org.apache.ctakes.assertion.medfacts.AssertionAnalysisEngine.process(AssertionAnalysisEngine.java:210) > at > org.apache.uima.analysis_component.JCasAnnotator_ImplBase.process(JCasAnnotator_ImplBase.java:48) > at > org.apache.uima.analysis_engine.impl.PrimitiveAnalysisEngine_impl.callAnalysisComponentProcess(PrimitiveAnalysisEngine_impl.java:375) > ... 14 more -- This message was sent by Atlassian JIRA (v6.1.5#6160)