I *think* newly injected URLs are ready to go next time bin/nutch generate is 
called.
The place to dig is Generator and whichever class is specified as the reducer 
(the one that selects the best URLs based on their score).

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

----- Original Message ----
> From: gabriele renzi <[EMAIL PROTECTED]>
> To: [email protected]
> Sent: Wednesday, April 30, 2008 6:15:50 AM
> Subject: score of freshly injected urls
> 
> hi everyone,
> 
> I'm using nutch happily for crawling a small urldb, but I'd like to
> know something before digging into the source code to find out:
> if I run an inject at the beginning of a crawl loop, what will be the
> score of the newly injected data?
> 
> My interest is to have new urls pop up in the very next crawl, but I'm
> not sure if this is how things work nor if I can use a setting
> somewhere.
> If this is not the case (this was my impression) do you have any hints
> on where to look to tweak this behaviour?
> 
> Thanks in advance to anyone.
> 
> -- 
> blog it: http://riffraff.blogsome.com
> blog en: http://www.riffraff.info
> 


Reply via email to