Hi All,

I am trying to get Nutch running with some custom plugins on top of
HDFS.

It seems like in the runtime/deploy directory there is only a single
.job file and a bin/nutch. I renamed the job to nutch-1.5.job as
suggested in sidbatra's post on 6/18/12, but now I am getting:

Caused by: java.lang.RuntimeException: x point
org.apache.nutch.net.URLNormalizer not found.
        at org.apache.nutch.net.URLNormalizers.<init>(URLNormalizers.java:122)
        at
org.apache.nutch.crawl.Injector$InjectMapper.configure(Injector.java:70)
        ... 22 more

Shouldn't this class be on the class path for the job when I run
"runtime/deploy/nutch crawl"? It's in the job file at
org/apache/nutch/net.  What am I doing wrong?


Reply via email to