I am looking at something similar.

I would guess the place to put it is the indexer. As I understand it the parser runs for just about everything fetched, however the indexer is only run for pages you want to index. I am also looking at having static objects (Eg a connection) that is initialise when the plugin is loaded, ideally through the startup method.

Regards

John

Hey all,
I have writen a custom HTML parser and indexer.  I would like to save some
information that I have gathered during the parse in a Mysql DB.  I imagine
there could be some performance hit here (e.g. connecting to db).  What's
the best place to add code to save this information - the parser or the
indexer?

-Mike
--
View this message in context: 
http://www.nabble.com/Saving-Metadata-to-Mysql-t1389216.html#a3732992
Sent from the Nutch - User forum at Nabble.com.


Reply via email to