Hello :
I get a problem when I run the Crawl example in the eclipse follow the
RunNutchInEclipse guide
(http://wiki.apache.org/nutch/RunNutchInEclipse#RunNutchInEclipse). Errors will
show below.
My hadoop version is 0.20.2-cdh3u6 and the hbase is 0.90.6-chd3u6 , both are
setted to Pseudo-distributed mode. And they can work well. The nutch version is
2.x, from the SVN (http://svn.apache.org/repos/asf/nutch/branches/2.x).
Exception in thread "main" java.lang.ClassNotFoundException:
org.apache.gora.hbase.store.HBaseStore at
java.net.URLClassLoader$1.run(URLClassLoader.java:217) at
java.security.AccessController.doPrivileged(Native Method) at
java.net.URLClassLoader.findClass(URLClassLoader.java:205) at
java.lang.ClassLoader.loadClass(ClassLoader.java:321) at
sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:294) at
java.lang.ClassLoader.loadClass(ClassLoader.java:266) at
java.lang.Class.forName0(Native Method) at
java.lang.Class.forName(Class.java:188) at
org.apache.nutch.storage.StorageUtils.getDataStoreClass(StorageUtils.java:90)
at org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:74)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221) at
org.apache.nutch.crawl.Crawler.runTool(Crawler.java:68) at
org.apache.nutch.crawl.Crawler.run(Crawler.java:136) at
org.apache.nutch.crawl.Crawler.run(Crawler.java:250) at
org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:
65) at org.apache.nutch.crawl.Crawler.main(Crawler.java:257)
In my ivy.xml :
<!-- Uncomment this to use HBase as Gora backend. -->
<dependency org="org.apache.gora" name="gora-hbase" rev="0.2"
conf="*->default" />
In my nutch-site.xml:
<property>
<name>storage.data.store.class</name>
<value>org.apache.gora.hbase.store.HBaseStore</value>
<description>Default class for storing data</description>
</property>
<property>
<name>http.agent.name</name>
<value>My Nutch Spider</value>
</property>
The nutch's hbase-site.xml file is copyed from the hbase.
XUPT.cn
Hechuan