After doing an initial crawl how do you keep that directory current.
How often should a intranet crawl be run.  Should this be a cron job and
do I have to restart tomcat after each crawl?

Andy
-----Original Message-----
From: Tom White [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, January 11, 2006 4:21 AM
To: [email protected]
Subject: Introduction to Nutch, Part 1: Crawling

Hi,

I've written an article about using Nutch at the intranet scale, which
you may find interesting:
http://today.java.net/pub/a/today/2006/01/10/introduction-to-nutch-1.htm
l .
Please post any comments on the article page itself.

I've updated the wiki to link to it too.

Regards,

Tom

Reply via email to