After doing an initial crawl how do you keep that directory current. How often should a intranet crawl be run. Should this be a cron job and do I have to restart tomcat after each crawl?
Andy -----Original Message----- From: Tom White [mailto:[EMAIL PROTECTED] Sent: Wednesday, January 11, 2006 4:21 AM To: [email protected] Subject: Introduction to Nutch, Part 1: Crawling Hi, I've written an article about using Nutch at the intranet scale, which you may find interesting: http://today.java.net/pub/a/today/2006/01/10/introduction-to-nutch-1.htm l . Please post any comments on the article page itself. I've updated the wiki to link to it too. Regards, Tom ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
