Manish - to get a count for db_gone, Nutch readdb needs to check the whole CrawlDB to get stats, collecting additional information is no overhead, just a convenience. You get all the counts, or none, there is no other way and for good reason.
M. -----Original message----- > From:Manish Verma <[email protected]> > Sent: Wednesday 6th July 2016 1:00 > To: [email protected] > Subject: readdb get db_gone count > > Hi, > > We want to check db_gone count before issuing solr clean and if count is high > we don’t want to update solr and stop there itself. > I know we have readdb to pull this but it gives so many info, I just need > db_gone count. > > Regards, > MV

