Hi Ketan, <http://www.mail-archive.com/[email protected]&q=from:%22Ketan+Bhokray%22> <http://www.mail-archive.com/[email protected]&q=from:%22Ketan+Bhokray%22> On Wed, Nov 18, 2015 at 2:00 AM, <[email protected]> wrote:
> > Nutch+Hbase on EMR CLASSPATH issue > 31865 by: Ketan Bhokray > > > I'm a hadoop newbie and trying to run Nutch 2.3, with Hbase as backend, on > EMR. Since Nutch uses hadoop-1.2.0, we chose the AMI version:2.4.2 which > comes with Hadoop 1.0.3 and HBase 0.92.0. > > When I build Nutch, it is crawling without problem on local mode. But when > run in distributed mode, the job stops at injector step with the following > exception: Can you please try running 2.X-SNAPSHOT from source? http://svn.apache.org/repos/asf/nutch/branches/2.x/ This works with the following stack Apache Avro 1.7.6 Apache Hadoop 1.2.1 and 2.5.2 Apache HBase 0.98.8-hadoop2 (although also tested with 1.X) Apache Cassandra 2.0.2 Apache Solr 4.10.3 MongoDB 2.6.X Apache Accumlo 1.5.1 Apache Spark 1.4.1 Please let us know how you get on. If this does not work then Nutch trunk runs flawlesssly on EMR. Thanks

