Hi Arkadi,

thanks for the great reference. I guess I can't fork child JVMs in my use
case though. Fortunately I don't have to crawl more than 100k sites and also
don't need a lot of multiple threads.
I'm going to take a look at your referenced class, thank you!

Regards,

Hannes

On Mon, May 24, 2010 at 1:07 AM, <[email protected]> wrote:

> Hi Hannes,
>
> This is done in Arch. See the au.csiro.cass.arch.utils.Starter class and
> its use from other classes. It was not straightforward because the used RAM
> tends to grow with iterations. To get around this, I had to fork child JVMs.
>
> You can find Arch here:
>
> http://www.atnf.csiro.au/computing/software/arch/
>
> Regards,
>
> Arkadi
>
> > -----Original Message-----
> > From: Hannes Carl Meyer [mailto:[email protected]]
> > Sent: Friday, May 21, 2010 7:33 PM
> > To: [email protected]
> > Subject: Running Nutch in a single VM
> >
> > Hi,
> >
> > is it possible to run nutch in a single virtual machine for intranet
> > crawling? Even inside a Java Application Server?
> >
> > Normally I'm using custom Nutch crawl scripts and start from the OS
> > command
> > line by cron. In a new project it is required to use a running Virtual
> > Machine for deloyment and invocation of crawler tasks.
> >
> > Does anybody has experiences in deploying Nutch in such a scenario?
> >
> > Kind Regards
> >
> > Hannes
> >
> > --
>



-- 

https://www.xing.com/profile/HannesCarl_Meyer
http://de.linkedin.com/in/hannescarlmeyer
http://twitter.com/hannescarlmeyer

Reply via email to