Hi All,
I am trying to get Nutch running with some custom plugins on top of
HDFS.
It seems like in the runtime/deploy directory there is only a single
.job file and a bin/nutch. I renamed the job to nutch-1.5.job as
suggested in sidbatra's post on 6/18/12, but now I am getting:
Caused by: java.lang.RuntimeException: x point
org.apache.nutch.net.URLNormalizer not found.
at org.apache.nutch.net.URLNormalizers.<init>(URLNormalizers.java:122)
at
org.apache.nutch.crawl.Injector$InjectMapper.configure(Injector.java:70)
... 22 more
Shouldn't this class be on the class path for the job when I run
"runtime/deploy/nutch crawl"? It's in the job file at
org/apache/nutch/net. What am I doing wrong?