Does /home/honda/nutch-0.7.2/crawl/segments/20060619230003/index exist at all?

Can you confirm that all segments contain index directory?

Rgrds,. Thomas


On 6/23/06, Honda-Search Administrator <[EMAIL PROTECTED]> wrote:
> To recrawl I use the command:
>
> /home/honda/nutch-0.7.2/recrawl.sh /home/honda/nutch-0.7.2/crawl 1 2
>
> "crawl" is the name of my database directory.
>
> The script "recrawl.sh" is the standard one that comes in the package.  I'm
> pretty sure it's the same for everyone, but I've included a link to the
> recrawl.sh script I'm using:
>
> http://www.honda-search.com/script.html
>
> As you can see I'm crawling with a depth of 1, which is intentional.  I only
> desire to recrawl the specific pages injected each night.  I'm wondering if
> the 'adddays' parameter is messing me up.
>
> Matt
>
> ----- Original Message -----
> From: "TDLN" <[EMAIL PROTECTED]>
> To: <[email protected]>; "Honda-Search Administrator"
> <[EMAIL PROTECTED]>
> Sent: Friday, June 23, 2006 10:46 AM
> Subject: Re: ERROR when recrawling... can ANYONE help?
>
>
> > Please specify what exact sequence of commands you are using.
> >
> > For incremental crawling best to follow the "whole web" style process
> > as outlined in the tutorial. The one stop crawl command cannot be used
> > effectively for that.
> >
> > HTH Thomas
> >
>
>

Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to