Hi,

Nutch loads its configuration from nutch-site and nutch-default.xml and not
from hadoop conf files so the behavior is correct.

HTH,

Gal.


On 3/1/07, "Ricardo J. Méndez" <[EMAIL PROTECTED]> wrote:

Hi,

I'm using nutch-0.9, from the trunk.    I've noticed a behavior
difference on a plugin unit test if I set the plugin.folders property on
nutch-site.xml vs. hadoop-site.xml.  If I set it on nutch-site.xml, the
unit test works well, but an error is raised if it's on hadoop-site.xml

The error is:

   [junit]  WARN [main] (ParserFactory.java:196) - Canno initialize
parser parse-html (cause:
org.apache.nutch.plugin.PluginRuntimeException:
java.lang.ClassNotFoundException: org.apache.nutch.parse.html.HtmlParser


Is there a reason why the HtmlParser wouldn't be loaded when the
directory is specified on hadoop-site.xml?

Thanks in advance,




Ricardo J. Méndez
http://ricardo.strangevistas.net/

Reply via email to