Hello,

I have question on the Nutch 2 plugin implementation.

I am implementing an image parser. It used to work fine in Nutch 1.5, but
after I migrate the code to Nutch 2.0, there are some errors which I spend
several hours with it and I was unable to trace the cause of it yet. Would
appreciate the insight here in the mailing list.

While I was parsing the content fetched, I got the following error in the
logs/hadoop.log
2012-08-03 18:28:25,304 ERROR parse.ParserFactory - PluginRuntimeException
org.apache.nutch.plugin.PluginRuntimeException:
java.lang.ClassNotFoundException: <my plugin class name>
        at 
org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:166)
        at 
org.apache.nutch.parse.ParserFactory.getFields(ParserFactory.java:209)
        at org.apache.nutch.parse.ParserJob.getFields(ParserJob.java:191)
        at org.apache.nutch.parse.ParserJob.run(ParserJob.java:243)
        at org.apache.nutch.parse.ParserJob.parse(ParserJob.java:257)
        at org.apache.nutch.parse.ParserJob.run(ParserJob.java:300)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.nutch.parse.ParserJob.main(ParserJob.java:304)
Caused by: java.lang.ClassNotFoundException: <my plugin class name>
        at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:423)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:356)
        at 
org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:156)
        ... 7 more
2012-08-03 18:28:25,654 INFO  crawl.SignatureFactory - Using Signature impl:
org.apache.nutch.crawl.MD5Signature

What I did is that I copied minimal necessary files from other plugin
folders and modify it to what I need. Then I edited nutch-site.xml to
include my plugin, edited parse-plugins.xml to register mimeType. I added
parse-image into the 2 packageset under <nutch-source>/build.xml, and added
ant target under deploy and clean in <nutch-source>/src/plugin/build.xml,
then I rebuild all. (These what I did in Nutch 1.5 and it works, but no luck
for Nutch 2)

Could you advise what else I miss, or what more information I should
provide. Thank you very much !


Regards,
Ake Tangkananond


Reply via email to