To recrawl I use the command:

/home/honda/nutch-0.7.2/recrawl.sh /home/honda/nutch-0.7.2/crawl 1 2

"crawl" is the name of my database directory.

The script "recrawl.sh" is the standard one that comes in the package.  I'm 
pretty sure it's the same for everyone, but I've included a link to the 
recrawl.sh script I'm using:

http://www.honda-search.com/script.html

As you can see I'm crawling with a depth of 1, which is intentional.  I only 
desire to recrawl the specific pages injected each night.  I'm wondering if 
the 'adddays' parameter is messing me up.

Matt

----- Original Message ----- 
From: "TDLN" <[EMAIL PROTECTED]>
To: <[email protected]>; "Honda-Search Administrator" 
<[EMAIL PROTECTED]>
Sent: Friday, June 23, 2006 10:46 AM
Subject: Re: ERROR when recrawling... can ANYONE help?


> Please specify what exact sequence of commands you are using.
>
> For incremental crawling best to follow the "whole web" style process
> as outlined in the tutorial. The one stop crawl command cannot be used
> effectively for that.
>
> HTH Thomas
>


Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to