Yes, it is an extension of Nutch. You could use its index filter code as an example. It does exactly what you want to do. I will send you the source off the list.
>-----Original Message----- >From: Savannah Beckett [mailto:[email protected]] >Sent: Tuesday, July 20, 2010 2:50 PM >To: [email protected] >Subject: Re: mysql > >I don't understand, it is some kind of a extension of nutch which seems >to have >nothing to do with my question. Even if it does, it seems to be a >overkill... > > > > >________________________________ >From: "[email protected]" <[email protected]> >To: [email protected] >Sent: Mon, July 19, 2010 9:46:03 PM >Subject: RE: mysql > >Hi Savannah, > >>-----Original Message----- >>From: Savannah Beckett [mailto:[email protected]] >>Sent: Tuesday, July 20, 2010 2:42 PM >>To: [email protected] >>Subject: mysql >> >>Hi, >> I use DriverManager.getConnection to connect to my mysql db and do >>query in my >>plugin that extend indexfilter. It seems that indexfilter is run for >>every url >>being indexed. It means that I have to open and close connection to >>mysql db >>each time a url is being indexed, it is unefficient. Is there a way to >>open and >>close connection to mysql db only once for all urls, not for each url? > >Yes, there is. See Arch index filter: > >http://www.atnf.csiro.au/computing/software/arch/ > >Regards, > >Arkadi > >> >>Thanks. >> >> >> > > > >

