[
https://issues.apache.org/jira/browse/OPENNLP-627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joern Kottmann closed OPENNLP-627.
----------------------------------
Resolution: Won't Fix
> Major performance degradation with large chunk of data
> ------------------------------------------------------
>
> Key: OPENNLP-627
> URL: https://issues.apache.org/jira/browse/OPENNLP-627
> Project: OpenNLP
> Issue Type: Bug
> Components: Name Finder
> Affects Versions: 1.6.0
> Environment: Mac OS X, java 1.7, Xmx parameter 1024m[In all cases]
> Reporter: Vihari Piratla
> Labels: performance
>
> I have a web page corpus from which I wish to extract some name entities.
> When I try to do NER on each and every line individually, it took around 6
> sec and when I loaded the whole corpus and tried to do NER on this data then
> it took 600 sec for the same task and for the same data. Is this a bug? This
> is the web-page that I am trying to extract names from:
> www.sec.gov/Archives/edgar/data/1326801/000119312512034517/d287954ds1.htm
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)