The runtime almost scales with the number of cores your CPU you have. If you have a 4 core CPU you might come down from 3 hours to 1 hour.
To enabled it you need to train with the -params argument and provide a config file for the learner. There are samples shipped with OpenNLP. Jörn On Wed, 2014-11-19 at 20:19 +0000, nikhil jain wrote: > Hi Rodrigo, > No, I am not using multi-threading, it's a simple Java program, took help > from openNLP documentation but it is worth mentioning over here is that as > the corpus is containing 4 million records so my Java program running in > eclipse was frequently giving me java heap space issue (out of memory issue) > so I investigate a bit and found that process was taking around 10GB memory > for building the model so i increased the memory to 10 GB using -Xmx > parameter. so it worked properly but took 3 hours. > Thanks-NIkhil > From: Rodrigo Agerri <rage...@apache.org> > To: "dev@opennlp.apache.org" <dev@opennlp.apache.org>; nikhil jain > <nikhil_jain1...@yahoo.com> > Cc: "us...@opennlp.apache.org" <us...@opennlp.apache.org> > Sent: Wednesday, November 19, 2014 2:17 AM > Subject: Re: Need to speed up the model creation process of OpenNLP > > Hi, > > Are you using multithreading, lots of threads, RAM memory? > > R > > > > > On Tue, Nov 18, 2014 at 5:46 PM, nikhil jain > <nikhil_jain1...@yahoo.com.invalid> wrote: > > Hi, > > I asked below question yesterday, did anyone get a chance to look at this. > > I am new in OpenNLP and really need some help. Please provide some clue or > > link or example. > > ThanksNIkhil > > From: nikhil jain <nikhil_jain1...@yahoo.com.INVALID> > > To: "us...@opennlp.apache.org" <us...@opennlp.apache.org>; Dev at Opennlp > > Apache <dev@opennlp.apache.org> > > Sent: Tuesday, November 18, 2014 12:02 AM > > Subject: Need to speed up the model creation process of OpenNLP > > > > Hi, > > I am using OpenNLP Token Name Finder for parsing the unstructured data. I > > have created a corpus of about 4 million records. When I am creating a > > model out of the training set using openNLP API's in Eclipse using default > > setting (cut-off 5 and iterations 100), process is taking a good amount of > > time, around 2-3 hours. > > Can someone suggest me how can I reduce the time as I want to experiment > > with different iterations but as the model creation process is taking so > > much time, I am not able to experiment with it. This is really a time > > consuming process. > > Please provide some feedback. > > Thanks in advance.Nikhil Jain > > > > > >