I did spend more time digging the code and I think the appropariate would be
have the custom Plugin implementation with Parser Extension point as
proposed in beginning.

All I have to do is 
1) Write a custom Plugin with Parser Extension point.
2) The getParse() of the custom Plugin must return null. The custom parser
plugin will be an additional parser being invoked appart from the the
configured one in this way.
The ParseUtil:;parse(...) code indicates that the multiple parsers can be
invoked if they continue returing null.
3) The getParse implementation should get the content from the WebPage and
parse it and produce the message anf finally push the message to the Kafka
Queue.

I am going to implement and test it.
Any comments and corrections are welcome.

Thanks,
Vicky




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Crawling-to-send-data-to-Kafka-tp4312320p4313952.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to