[
https://issues.apache.org/jira/browse/OPENNLP-602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13882808#comment-13882808
]
Tim Miller commented on OPENNLP-602:
------------------------------------
Sorry to do this, but now I'm getting an error from the original fix. i think
the issue is with the sentProbs.remove(si) call on line 245 of
SentenceDetectorME. Not sure why it seemed to work yesterday. I think a simpler
solution might be to have the whitespace sentence happen earlier before the
classifier is even queried. I'll attach a patch.
> SentenceDetector should support new line as and end of sentence char
> --------------------------------------------------------------------
>
> Key: OPENNLP-602
> URL: https://issues.apache.org/jira/browse/OPENNLP-602
> Project: OpenNLP
> Issue Type: Improvement
> Components: Sentence Detector
> Affects Versions: tools-1.5.3
> Reporter: Joern Kottmann
> Assignee: Joern Kottmann
> Priority: Minor
> Fix For: 1.6.0
>
>
> The Sentence Detector should have support to consider new line chars as the
> end of a sentence. This will probably require special handling in the
> training code to assume that there is an new line char if any other eos is
> missing.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)