Tejas , that [0] is for nutch 1.x which uses hdfs for its data storage. And as new nutch 2.x uses hbase (backend) which is already based on hadoop (hdfs). If I deploy my hbase in cluster mode on 3 different nodes then do I still need to deploy nutch 2.x on these 3 nodes as well ?
Could you please care to add some little more information for nutch2.x + hbase + hadoop ? Regards, Khan [0] : http://wiki.apache.org/nutch/NutchHadoopTutorial On Sat, Jun 29, 2013 at 10:50 PM, Tejas Patil <[email protected]>wrote: > On Sat, Jun 29, 2013 at 10:36 AM, imran khan <[email protected] > >wrote: > > > Greetings, > > > > Is there any guide for setting up nutch2.x in cluster mode ? > > > > [0] is a relevant wiki page .. which has not been updated since a long > time. > I am guessing that you have already tried running in local mode as given in > [1]. For cluster mode, have hadoop 1.2.0 setup and its variables exported, > set nutch configs as per your requirements, run 'ant' and then run nutch > commands from $NUTCH_HOME/runtime/deploy > > > > And which versions of hadopp nutch2.x/hbase works well in cluster mode ? > > > > Use Nutch 2.2 and HBase 0.90.x > > > > > Regards, > > Khan > > > > [0] : http://wiki.apache.org/nutch/NutchHadoopTutorial > [1] : http://wiki.apache.org/nutch/Nutch2Tutorial >

