I've begun trying to get Nutch 2.1 running in Hadoop. I have a single node
"cluster" running, and I have build Nutch 2.1 from source
(http://svn.apache.org/repos/asf/nutch/branches/2.1, r1388524). When I
issue the command to run Nutch thus:
ubuntu:~/apache-nutch-svn-2.1$ ~/hadoop-1.0.3/bin/hadoop jar
build/apache-nutch-2.1.job org.apache.nutch.crawl.Crawler urls -dir urls
-depth 3 -topN 5
then I get the following stack trace:
12/09/26 13:18:03 WARN store.DataStoreFactory: gora.properties not found,
properties will be empty.
12/09/26 13:18:03 WARN store.DataStoreFactory: gora.properties not found,
properties will be empty.
Exception in thread "main" org.apache.gora.util.GoraException:
java.io.IOException: java.lang.NullPointerException
at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java
:167)
at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java
:135)
at
org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:75)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:214)
at org.apache.nutch.crawl.Crawler.runTool(Crawler.java:68)
at org.apache.nutch.crawl.Crawler.run(Crawler.java:136)
at org.apache.nutch.crawl.Crawler.run(Crawler.java:250)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.Crawler.main(Crawler.java:257)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57
)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl
.java:43)
at java.lang.reflect.Method.invoke(Method.java:601)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
Caused by: java.io.IOException: java.lang.NullPointerException
at org.apache.gora.sql.store.SqlStore.getConnection(SqlStore.java:747)
at org.apache.gora.sql.store.SqlStore.initialize(SqlStore.java:160)
at
org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.
java:102)
at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java
:161)
... 13 more
Caused by: java.lang.NullPointerException
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:186)
at org.apache.gora.sql.store.SqlStore.getConnection(SqlStore.java:735)
... 16 more
Unzipping the .job file shows gora.properties exists in the root directory
of the archive.
I'm sure I'm just making a noob error can anyone point me in the right
direction?
Thanks,
-Ian.