Hello I have still some questions about nutch
- I want to index about 50 – 100 sites with lots of documents, is it
best use the Intranet Crawling or Whole-web Crawling method.
- Is the crawl auto updated in nutch, or must I run a cron task
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general