I must have had something hosed in the configuration, because I started
over from scratch and it seems like everything works except for the need
to rename apache-nutch-1.5.jar to nutch-1.5.jar. Thanks again for your
help.
-------- Original Message --------
Subject: Re: Nutch 1.5 On Hadoop?
From: Lewis John Mcgibbney <[email protected]>
Date: Wed, June 20, 2012 4:13 pm
To: [email protected]
What Configuration bean settings are you using for plugin.includes?
Are there any unusual settings? Have you tried running a test crawl
without your custom plugins to ensure that the core Nutch
functionality is working OK?
On Wed, Jun 20, 2012 at 9:04 PM, <[email protected]> wrote:
>
> Hi All,
>
> I am trying to get Nutch running with some custom plugins on top of
> HDFS.
>
> It seems like in the runtime/deploy directory there is only a single
> .job file and a bin/nutch. I renamed the job to nutch-1.5.job as
> suggested in sidbatra's post on 6/18/12, but now I am getting:
>
> Caused by: java.lang.RuntimeException: x point
> org.apache.nutch.net.URLNormalizer not found.
> at
org.apache.nutch.net.URLNormalizers.<init>(URLNormalizers.java:122)
> at
>
org.apache.nutch.crawl.Injector$InjectMapper.configure(Injector.java:70)
> ... 22 more
>
> Shouldn't this class be on the class path for the job when I run
> "runtime/deploy/nutch crawl"? It's in the job file at
> org/apache/nutch/net. What am I doing wrong?
>
>
--
Lewis