[
https://issues.apache.org/jira/browse/OPENNLP-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15223837#comment-15223837
]
Rakesh commented on OPENNLP-831:
--------------------------------
I created a model using MAXNET with 100 iterations. Training data consists of
500K sentences. Each sentence consists of 5 to 30 words. The training data
contains only six entities. First, I used the command prompt to create a model.
It took nearly 20 hours to create a model. Then I used the Java method to
create a model. In this case, it took only 10 minutes.
> NER model creation is taking much time for large files
> ------------------------------------------------------
>
> Key: OPENNLP-831
> URL: https://issues.apache.org/jira/browse/OPENNLP-831
> Project: OpenNLP
> Issue Type: Improvement
> Components: Name Finder
> Affects Versions: 1.6.0
> Environment: linux
> Reporter: Rakesh
> Priority: Blocker
>
> opennlp 1.6 is taking huge amount of time (nearly 6 hours ) to train a set of
> sentences (sentences count = 20000) . Each sentence may vary in length .You
> must try to decrease the training time. Is there any way to decrease the
> training time.If yes please let me know.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)