I am looking at something similar.
I would guess the place to put it is the indexer. As I understand it the
parser runs for just about everything fetched, however the indexer is
only run for pages you want to index.
I am also looking at having static objects (Eg a connection) that is
initialise when the plugin is loaded, ideally through the startup method.
Regards
John
Hey all,
I have writen a custom HTML parser and indexer. I would like to save some
information that I have gathered during the parse in a Mysql DB. I imagine
there could be some performance hit here (e.g. connecting to db). What's
the best place to add code to save this information - the parser or the
indexer?
-Mike
--
View this message in context:
http://www.nabble.com/Saving-Metadata-to-Mysql-t1389216.html#a3732992
Sent from the Nutch - User forum at Nabble.com.