I am looking at something similar.

I would guess the place to put it is the indexer. As I understand it the parser runs for just about everything fetched, however the indexer is only run for pages you want to index. I am also looking at having static objects (Eg a connection) that is initialise when the plugin is loaded, ideally through the startup method.

Regards

John

Hey all,
I have writen a custom HTML parser and indexer.  I would like to save some
information that I have gathered during the parse in a Mysql DB.  I imagine
there could be some performance hit here (e.g. connecting to db).  What's
the best place to add code to save this information - the parser or the
indexer?

-Mike
--
View this message in context: 
http://www.nabble.com/Saving-Metadata-to-Mysql-t1389216.html#a3732992
Sent from the Nutch - User forum at Nabble.com.




-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to