HI,
I want to use my own html parser instead of nutch parse-html...
I created the plugin for my parser and then added to the entry
of appropriate mime type in parse-plugins.xml
I also added my plugin to plugin.includes .. while have removed parse-html
from the same...

The problem is that while crawling nutch gives an error that unable to find
any plugin for the mime type application/xhtml + xml and do not parse the
downloaded file.( I have added my parser for the above mime type in
parse-plugin.xml)

Can somebody please help me on this.. whether I have to make some other
changes??
I am using nutch 1.1

I did similar thing for rss parser and it worked successfully..
regards
Sourabh

Reply via email to