I'm having trouble trying to get Nutch 1.12 to run on hadoop 2.7.3.
I get a class not found exception for org.apache.nutch.crawl.Crawl, as in the 
following attempt.
$HADOOP_HOME/bin/hadoop jar 
"/home/mjc/apache-nutch-1.12/runtime/deploy/apache-nutch-1.12.job" 
org.apache.nutch.crawl.Crawl seed -dir seed -depth 1 -topN 5Exception in thread 
"main" java.lang.ClassNotFoundException: org.apache.nutch.crawl.Crawl        at 
java.net.URLClassLoader$1.run(URLClassLoader.java:366)

Searching the web, I see that things seem to have changed in recent versions of 
Nutch. However, I have not been able to find a good tutorial or step-by-step 
guide for how to get this to work. I would appreciate any advice you could 
give. Is there documentation somewhere? Should I be using an older version??

Reply via email to