Welcome to the world of post 1.3 Nutch ;) On Thursday, February 21, 2013, Amit Sela <[email protected]> wrote: > I basically just built with ant and copied the contents of deploy (job file > + nutch and crawl scripts) to "nutch" folder in my hadoop-user directory on > the master. > > I changed the crawl script to work only in distributed mode and it seems to > work... though I am getting a lot of Child Error exceptions in one of the > nodes (not the master) > while another node seems to work fine (total 1 master + 2 slaves). > > Could it be so simple ? am I missing something ? > > > Thanks > > > On Thu, Feb 21, 2013 at 6:21 PM, Julien Nioche < > [email protected]> wrote: > >> https://wiki.apache.org/nutch/NutchHadoopTutorial >> >> basically follow the steps in >> http://hadoop.apache.org/docs/stable/cluster_setup.html then install Nutch >> on the master node of your cluster, 'cd runtime/deploy/bin' and use the >> nutch scripts as usual. You can then use the standard Mapreduce webapp to >> monitor the progress of your crawl >> >> Julien >> >> On 21 February 2013 10:00, Amit Sela <[email protected]> wrote: >> >> > Anyone have a good tutorial about deploying nutch (1.6) on a pre-existing >> > Hadoop cluster ? >> > >> > Thanks. >> > >> >> >> >> -- >> * >> *Open Source Solutions for Text Engineering >> >> http://digitalpebble.blogspot.com/ >> http://www.digitalpebble.com >> http://twitter.com/digitalpebble >> >
-- *Lewis*

