Re: [Nutch-dev] distributed search

2005-12-20 Thread Andrzej Bialecki
Goldschmidt, Dave wrote: Hi Rafi, Not sure if anyone answered this, but I think you're just after the segslice command: $ nutch segslice If I understand the original request, that's only half of the answer, but the right half.. ;-) segslice doesn't slice the Lucene indexes, only the

RE: [Nutch-dev] distributed search

2005-12-20 Thread Goldschmidt, Dave
, December 20, 2005 10:39 AM To: nutch-dev@lucene.apache.org Subject: Re: [Nutch-dev] distributed search Goldschmidt, Dave wrote: Hi Rafi, Not sure if anyone answered this, but I think you're just after the segslice command: $ nutch segslice If I understand the original request, that's only half

RE: [Nutch-dev] distributed search

2005-12-20 Thread Ledio Ago
Thank you Dave, very helfull. -Ledio -Original Message- From: Goldschmidt, Dave [mailto:[EMAIL PROTECTED] Sent: Tuesday, December 20, 2005 7:24 AM To: nutch-dev@lucene.apache.org Subject: RE: [Nutch-dev] distributed search Hi Rafi, Not sure if anyone answered this, but I think you're

RE: [Nutch-dev] distributed search

2005-12-19 Thread Ledio Ago
I tried separating the Tomcat into a different machine and bingo. The performance went up by 30%%. Right now I only have two machines with 900K URLs each that act as Nutch servers and one machine that hosts the Tomcat. At this time I don't suspect any more that Tomcat is synchronously

Re: [Nutch-dev] distributed search

2005-12-19 Thread Stefan Groschupf
By the way, is there an easy way to split the index I have already have. I would hate to recrawl all of the 1.9MM URLs again and waste bandwidth. Well I do not know any tool that comes with nutch or a other tool that does it, may there is one. But to write a java class that creates two

Re: [Nutch-dev] distributed search

2005-12-19 Thread Rafi Iz
across several machines. e.g. bin/nutch org.apache.nutch.tools.FetchListTool .. Rafi From: Stefan Groschupf [EMAIL PROTECTED] Reply-To: nutch-dev@lucene.apache.org To: nutch-dev@lucene.apache.org Subject: Re: [Nutch-dev] distributed search Date: Tue, 20 Dec 2005 00:38:22 +0100 By the way

RE: [Nutch-dev] distributed search

2005-12-19 Thread Ledio Ago
I have the book so I'll check what I can do with the API. Thanks Stefan, Ledio -Original Message- From: Stefan Groschupf [mailto:[EMAIL PROTECTED] Sent: Monday, December 19, 2005 3:38 PM To: nutch-dev@lucene.apache.org Subject: Re: [Nutch-dev] distributed search By the way