Alan Wang created OPENNLP-1316:
----------------------------------
Summary: Expand common contractions in the english language
Key: OPENNLP-1316
URL: https://issues.apache.org/jira/browse/OPENNLP-1316
Project: OpenNLP
Issue Type: Improvement
Reporter: Alan Wang
Hi, I want to know if OPENNLP needs to expand contractions.
i.g. +_*n't*_+ -> _*not*_, +_*'ve*_+ -> _*have*_, +_*'m*_+ -> _*am*_, but
+_*'s*_+ can be extended to _*is*_ or *_has_*, +_*'d*_+ can be extended to
*_had_* or _*would*_, depending on the context.
1、Use POSTag to mark contractions to determine which extension is to be used.
2、Like nltk, extend only some acronyms that are not ambiguous.
Thanks!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)