After doing an initial crawl how do you keep that directory current. How often should a intranet crawl be run. Should this be a cron job and do I have to restart tomcat after each crawl?
Andy -----Original Message----- From: Tom White [mailto:[EMAIL PROTECTED] Sent: Wednesday, January 11, 2006 4:21 AM To: [email protected] Subject: Introduction to Nutch, Part 1: Crawling Hi, I've written an article about using Nutch at the intranet scale, which you may find interesting: http://today.java.net/pub/a/today/2006/01/10/introduction-to-nutch-1.htm l . Please post any comments on the article page itself. I've updated the wiki to link to it too. Regards, Tom
