[ 
https://issues.apache.org/jira/browse/OPENNLP-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martin Wiesner updated OPENNLP-1316:
------------------------------------
    Issue Type: Question  (was: Improvement)

> Expand common contractions in the english language
> --------------------------------------------------
>
>                 Key: OPENNLP-1316
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1316
>             Project: OpenNLP
>          Issue Type: Question
>            Reporter: Alan Wang
>            Priority: Minor
>
> Hi, I want to know if OPENNLP needs to expand contractions.
> i.g. +_*n't*_+  -> _*not*_, +_*'ve*_+  -> _*have*_, +_*'m*_+  -> _*am*_, but 
> +_*'s*_+ can be extended to _*is*_ or *_has_*, +_*'d*_+ can be extended to 
> *_had_* or _*would*_, depending on the context.
> 1、Use POSTag to mark contractions to determine which extension is to be used.
> 2、Like nltk, extend only some contractions that are not ambiguous.
> Thanks!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to