To recrawl I use the command:
/home/honda/nutch-0.7.2/recrawl.sh /home/honda/nutch-0.7.2/crawl 1 2
"crawl" is the name of my database directory.
The script "recrawl.sh" is the standard one that comes in the package. I'm
pretty sure it's the same for everyone, but I've included a link to the
recrawl.sh script I'm using:
http://www.honda-search.com/script.html
As you can see I'm crawling with a depth of 1, which is intentional. I only
desire to recrawl the specific pages injected each night. I'm wondering if
the 'adddays' parameter is messing me up.
Matt
----- Original Message -----
From: "TDLN" <[EMAIL PROTECTED]>
To: <[email protected]>; "Honda-Search Administrator"
<[EMAIL PROTECTED]>
Sent: Friday, June 23, 2006 10:46 AM
Subject: Re: ERROR when recrawling... can ANYONE help?
Please specify what exact sequence of commands you are using.
For incremental crawling best to follow the "whole web" style process
as outlined in the tutorial. The one stop crawl command cannot be used
effectively for that.
HTH Thomas