Hi, one solution is to include the plugin scoring-opic in the property plugin.includes
That should work given that injected URLs have a non-zero score and are not redirects. OPIC is a good choice for frontier selection (via -topN). For Nutch 1.x the webgraph steps in combination with scoring-link are an alternative, esp. if you want to use the score mainly in the index to rank search results. Best, Sebastian On 11/13/2016 04:17 AM, Yongyao Jiang wrote: > I was thinking about the same question too. My guess is scoring happens > when you run fetch command. > > This page may help, https://wiki.apache.org/nutch/NutchScoring > > On Sat, Nov 12, 2016 at 10:07 PM, Michael Coffey <mcof...@yahoo.com.invalid> > wrote: > >> When the generator is used with -topN, it is supposed to choose the >> highest-scoring urls. In my case, all the urls in my db have a score of >> zero, except the ones injected. >> How can I cause scores to be computed and stored? I am using the standard >> crawl script. Do I need to enable the various webgraph lines in the script? >> > > >