disagreement between POS of parser and POStagger
------------------------------------------------
Key: OPENNLP-331
URL: https://issues.apache.org/jira/browse/OPENNLP-331
Project: OpenNLP
Issue Type: Bug
Components: Similarity
Reporter: Boris Galitsky
Assignee: Boris Galitsky
ParserTool.parseLine(sentence, parser, 1) gives:
How can I get short focus zoom lens for digital camera
type = S
type = WHADVP
type = WRB, word = How
type = SQ
type = MD, word = can
type = NP
type = PRP, word = I
type = VP
type = VB, word = get
type = NP
type = JJ, word = short
type = NN, word = focus
type = NN, word = zoom // ZOOM is NOUN:
correct
type = NN, word = lens
type = PP
type = IN, word = for
type = NP
type = JJ, word = digital
type = NN, word = camera
BUT
new POSTaggerME(model).tag(toks);
gives
[WRB, MD, PRP, VB, JJ, NN, VBN, NN, IN, JJ, NN]
****
VBN is a problem!
zoom is NOT VBN - Verb, past participle
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira