[ 
https://issues.apache.org/jira/browse/OPENNLP-627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joern Kottmann closed OPENNLP-627.
----------------------------------
    Resolution: Won't Fix

> Major performance degradation with large chunk of data
> ------------------------------------------------------
>
>                 Key: OPENNLP-627
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-627
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Name Finder
>    Affects Versions: 1.6.0
>         Environment: Mac OS X, java 1.7, Xmx parameter 1024m[In all cases]
>            Reporter: Vihari Piratla
>              Labels: performance
>
> I have  a web page corpus from which I wish to extract some name entities. 
> When I try to do NER on each and every line individually, it took around 6 
> sec and when I loaded the whole corpus and tried to do NER on this data then 
> it took 600 sec for the same task and for the same data. Is this a bug? This 
> is the web-page that I am trying to extract names from: 
> www.sec.gov/Archives/edgar/data/1326801/000119312512034517/d287954ds1.htm



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to