I was thinking about the same question too. My guess is scoring happens when you run fetch command.
This page may help, https://wiki.apache.org/nutch/NutchScoring On Sat, Nov 12, 2016 at 10:07 PM, Michael Coffey <[email protected]> wrote: > When the generator is used with -topN, it is supposed to choose the > highest-scoring urls. In my case, all the urls in my db have a score of > zero, except the ones injected. > How can I cause scores to be computed and stored? I am using the standard > crawl script. Do I need to enable the various webgraph lines in the script? > -- Yongyao Jiang https://www.linkedin.com/in/yongyao-jiang-42516164 Ph.D. Student in Earth Systems and GeoInformation Sciences NSF Spatiotemporal Innovation Center George Mason University

