The most basic shell script would be putting the commands of the tutorial in 
order and have a cron job execute that script once every interval. In case of 
large crawls you need to add some locking mechanism to avoid overlap.

http://wiki.apache.org/nutch/NutchTutorial

On Monday 14 November 2011 01:55:15 xander wrote:
> Hi,
> I wan to write a shell scirpt which will crawl data and update the database
> for me every 2 hours . Can you help me write a shell script for it. I am
> new to this and would appreciate any sort of help. You can divert me to a
> useful link too.
> 
> thanks
> 
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Continuous-crawling-tp3497615p3505600.h
> tml Sent from the Nutch - User mailing list archive at Nabble.com.

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350

Reply via email to