To recrawl I use the command: /home/honda/nutch-0.7.2/recrawl.sh /home/honda/nutch-0.7.2/crawl 1 2
"crawl" is the name of my database directory. The script "recrawl.sh" is the standard one that comes in the package. I'm pretty sure it's the same for everyone, but I've included a link to the recrawl.sh script I'm using: http://www.honda-search.com/script.html As you can see I'm crawling with a depth of 1, which is intentional. I only desire to recrawl the specific pages injected each night. I'm wondering if the 'adddays' parameter is messing me up. Matt ----- Original Message ----- From: "TDLN" <[EMAIL PROTECTED]> To: <[email protected]>; "Honda-Search Administrator" <[EMAIL PROTECTED]> Sent: Friday, June 23, 2006 10:46 AM Subject: Re: ERROR when recrawling... can ANYONE help? > Please specify what exact sequence of commands you are using. > > For incremental crawling best to follow the "whole web" style process > as outlined in the tutorial. The one stop crawl command cannot be used > effectively for that. > > HTH Thomas > Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
