I *think* newly injected URLs are ready to go next time bin/nutch generate is called. The place to dig is Generator and whichever class is specified as the reducer (the one that selects the best URLs based on their score).
Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: gabriele renzi <[EMAIL PROTECTED]> > To: [email protected] > Sent: Wednesday, April 30, 2008 6:15:50 AM > Subject: score of freshly injected urls > > hi everyone, > > I'm using nutch happily for crawling a small urldb, but I'd like to > know something before digging into the source code to find out: > if I run an inject at the beginning of a crawl loop, what will be the > score of the newly injected data? > > My interest is to have new urls pop up in the very next crawl, but I'm > not sure if this is how things work nor if I can use a setting > somewhere. > If this is not the case (this was my impression) do you have any hints > on where to look to tweak this behaviour? > > Thanks in advance to anyone. > > -- > blog it: http://riffraff.blogsome.com > blog en: http://www.riffraff.info >
