I was thinking about the same question too. My guess is scoring happens
when you run fetch command.

This page may help, https://wiki.apache.org/nutch/NutchScoring

On Sat, Nov 12, 2016 at 10:07 PM, Michael Coffey <[email protected]>
wrote:

> When the generator is used with -topN, it is supposed to choose the
> highest-scoring urls. In my case, all the urls in my db have a score of
> zero, except the ones injected.
> How can I cause scores to be computed and stored? I am using the standard
> crawl script. Do I need to enable the various webgraph lines in the script?
>



-- 
Yongyao Jiang
https://www.linkedin.com/in/yongyao-jiang-42516164
Ph.D. Student in Earth Systems and GeoInformation Sciences
NSF Spatiotemporal Innovation Center
George Mason University

Reply via email to