Hi,
I'm using nutch-0.9, from the trunk. I've noticed a behavior
difference on a plugin unit test if I set the plugin.folders property on
nutch-site.xml vs. hadoop-site.xml. If I set it on nutch-site.xml, the
unit test works well, but an error is raised if it's on hadoop-site.xml
The error is:
[junit] WARN [main] (ParserFactory.java:196) - Canno initialize
parser parse-html (cause:
org.apache.nutch.plugin.PluginRuntimeException:
java.lang.ClassNotFoundException: org.apache.nutch.parse.html.HtmlParser
Is there a reason why the HtmlParser wouldn't be loaded when the
directory is specified on hadoop-site.xml?
Thanks in advance,
Ricardo J. Méndez
http://ricardo.strangevistas.net/