Håvard W. Kongsgård wrote:
- I want to index about 50 – 100 sites with lots of documents, is it best use the Intranet Crawling or Whole-web Crawling method.
The "intranet" style is simpler and hence a good place to start. If it doesn't work well for you then you might try the "whole-web" style.
- Is the crawl auto updated in nutch, or must I run a cron task
It is not auto-updated. Doug ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
