Goldschmidt, Dave wrote:
Hi Rafi,
Not sure if anyone answered this, but I think you're just after the
segslice command:
$ nutch segslice
If I understand the original request, that's only half of the answer,
but the right half.. ;-)
segslice doesn't slice the Lucene indexes, only the
, December 20, 2005 10:39 AM
To: nutch-dev@lucene.apache.org
Subject: Re: [Nutch-dev] distributed search
Goldschmidt, Dave wrote:
Hi Rafi,
Not sure if anyone answered this, but I think you're just after the
segslice command:
$ nutch segslice
If I understand the original request, that's only half
Thank you Dave, very helfull.
-Ledio
-Original Message-
From: Goldschmidt, Dave [mailto:[EMAIL PROTECTED]
Sent: Tuesday, December 20, 2005 7:24 AM
To: nutch-dev@lucene.apache.org
Subject: RE: [Nutch-dev] distributed search
Hi Rafi,
Not sure if anyone answered this, but I think you're
I tried separating the Tomcat into a different machine and bingo.
The performance went up by 30%%. Right now I only have two machines
with 900K URLs each that act as Nutch servers and one machine that hosts the
Tomcat.
At this time I don't suspect any more that Tomcat is synchronously
By the way, is there an easy way to split the index I have already
have.
I would hate to recrawl all of the 1.9MM URLs again and waste
bandwidth.
Well I do not know any tool that comes with nutch or a other tool
that does it, may there is one.
But to write a java class that creates two
across several machines.
e.g.
bin/nutch org.apache.nutch.tools.FetchListTool ..
Rafi
From: Stefan Groschupf [EMAIL PROTECTED]
Reply-To: nutch-dev@lucene.apache.org
To: nutch-dev@lucene.apache.org
Subject: Re: [Nutch-dev] distributed search
Date: Tue, 20 Dec 2005 00:38:22 +0100
By the way
I have the book so I'll check what I can do with the API.
Thanks Stefan,
Ledio
-Original Message-
From: Stefan Groschupf [mailto:[EMAIL PROTECTED]
Sent: Monday, December 19, 2005 3:38 PM
To: nutch-dev@lucene.apache.org
Subject: Re: [Nutch-dev] distributed search
By the way