Full-Stop detection not working during full NLP parse
-----------------------------------------------------
Key: OPENNLP-240
URL: https://issues.apache.org/jira/browse/OPENNLP-240
Project: OpenNLP
Issue Type: Bug
Components: Parser
Affects Versions: tools-1.5.1-incubating
Environment: Win 7, JDK1.6.0_23
Example Instantiation:
public Parse parse(String line) {
if(parser == null)
parser = ParserFactory.create(new ParserModel(new
FileInputStream(_NLPModelPath+"/en-parser-chunking.bin")));
return ParserTool.parseLine(line, parser, 1)[0];
}// :end parse
Reporter: mark meiklejohn
Fix For: tools-1.5.2-incubating
There seems to be an issue with OpenNLP detecting the full stop at the end of
the sentence
(TOP (S (NP (PRP I)) (VP (VBP intend) (S (VP (TO to) (VP (VB quit) (NP (NP (NN
smoking)) (NP (DT this) (NN month.)))))))))
Although it does work fine with the tokenizer on its own
[I, intend, to, quit, smoking, this, month, .]
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira