[
https://issues.apache.org/jira/browse/OPENNLP-1183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16925694#comment-16925694
]
Jiri Zamecnik commented on OPENNLP-1183:
----------------------------------------
I have some first code
[here|https://github.com/zameji/opennlp/tree/Language-model]. Still a lot to
do, though.
At this point, the main TODOs are:
1. Compression
2. Proper speed/memory benchmark (getting just rough estimates now with
nanoTime & classmexer)
3. Estimators
Conceptually, I was thinking about how to add new estimators in a way that
would permit easy extension. One option I though about was having a separate
estimator class where the smoothing method would be chosen at instance
initialization. Not sure whether that is the most elegant method, though.
> Better language model support
> -----------------------------
>
> Key: OPENNLP-1183
> URL: https://issues.apache.org/jira/browse/OPENNLP-1183
> Project: OpenNLP
> Issue Type: Improvement
> Components: language model
> Reporter: Tommaso Teofili
> Priority: Major
>
> As per
> [ONIP-1|https://cwiki.apache.org/confluence/display/OPENNLP/ONIP-1+Better+language+model+support]
> it would be nice to provide better language modelling support. This means
> more compact models, faster prediction, more accurate estimations.
--
This message was sent by Atlassian Jira
(v8.3.2#803003)