After doing an initial crawl how do you keep that directory current.
How often should a intranet crawl be run.  Should this be a cron job and
do I have to restart tomcat after each crawl?

Andy
-----Original Message-----
From: Tom White [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, January 11, 2006 4:21 AM
To: [email protected]
Subject: Introduction to Nutch, Part 1: Crawling

Hi,

I've written an article about using Nutch at the intranet scale, which
you may find interesting:
http://today.java.net/pub/a/today/2006/01/10/introduction-to-nutch-1.htm
l .
Please post any comments on the article page itself.

I've updated the wiki to link to it too.

Regards,

Tom


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to