I am trying Nutch for the first time. I created an automated docker setup
to load
Nutch 2 + Hbase (i had tried cassandra but could not get it to work so i
thought i start with Hbase to give it a try)

The project is available at https://github.com/bizmate/nutch
and with docker compose you can start the containers with a running
instance of Nutch  exposed on 8899 and Hbase.

in gora.properties i already enabled hbase

gora.datastore.default=org.apache.gora.hbase.store.HBaseStore


But i get Hbase class not found error when I run this command.

root@87b87f55835e:/opt/nutch# bin/nutch inject urls.txt

InjectorJob: starting at 2016-05-07 08:37:49

InjectorJob: Injecting urlDir: urls.txt

*InjectorJob: java.lang.ClassNotFoundException:
org.apache.gora.hbase.store.HBaseStore*

at java.net.URLClassLoader$1.run(URLClassLoader.java:366)

at java.net.URLClassLoader$1.run(URLClassLoader.java:355)

at java.security.AccessController.doPrivileged(Native Method)

at java.net.URLClassLoader.findClass(URLClassLoader.java:354)

at java.lang.ClassLoader.loadClass(ClassLoader.java:425)

at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)

at java.lang.ClassLoader.loadClass(ClassLoader.java:358)

at java.lang.Class.forName0(Native Method)

at java.lang.Class.forName(Class.java:190)

at
org.apache.nutch.storage.StorageUtils.getDataStoreClass(StorageUtils.java:89)

at
org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:73)

at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221)

at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)

at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)

at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)

at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)

Suggestions?

Reply via email to