I am looking at something similar.
I would guess the place to put it is the indexer. As I understand it the
parser runs for just about everything fetched, however the indexer is
only run for pages you want to index.
I am also looking at having static objects (Eg a connection) that is
initialise when the plugin is loaded, ideally through the startup method.
Regards
John
Hey all,
I have writen a custom HTML parser and indexer. I would like to save some
information that I have gathered during the parse in a Mysql DB. I imagine
there could be some performance hit here (e.g. connecting to db). What's
the best place to add code to save this information - the parser or the
indexer?
-Mike
--
View this message in context:
http://www.nabble.com/Saving-Metadata-to-Mysql-t1389216.html#a3732992
Sent from the Nutch - User forum at Nabble.com.
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general