RE: How can I Score?

2016-11-17 Thread Vladimir Loubenski
@nutch.apache.org Subject: Re: How can I Score? Hi, one solution is to include the plugin scoring-opic in the property plugin.includes That should work given that injected URLs have a non-zero score and are not redirects. OPIC is a good choice for frontier selection (via -topN). For Nutch 1.x the webgraph

Re: How can I Score?

2016-11-17 Thread Sebastian Nagel
Hi, one solution is to include the plugin scoring-opic in the property plugin.includes That should work given that injected URLs have a non-zero score and are not redirects. OPIC is a good choice for frontier selection (via -topN). For Nutch 1.x the webgraph steps in combination with

Re: How can I Score?

2016-11-16 Thread Furkan KAMACI
15 > > To: user@nutch.apache.org > > Subject: Re: How can I Score? > > > > Aha! I was wrong when I said I was using all default settings. I forgot > I had followed a tutorial that told mem to put |scoring-depth| instead of > |scoring-opic| into the plugin.includes pro

RE: How can I Score?

2016-11-16 Thread Markus Jelsma
er 2016 7:15 > To: user@nutch.apache.org > Subject: Re: How can I Score? > > Aha! I was wrong when I said I was using all default settings. I forgot I had > followed a tutorial that told mem to put |scoring-depth| instead of > |scoring-opic| into the plugin.includes property. No

Re: How can I Score?

2016-11-15 Thread Michael Coffey
ovember 15, 2016 12:09 AM Subject: Re: How can I Score? Hi Michael, Replies inline On Sat, Nov 12, 2016 at 7:10 PM, <user-digest-h...@nutch.apache.org> wrote: > From: Michael Coffey <mcof...@yahoo.com.invalid> > To: "user@nutch.apache.org" <user@nutch.apache.or

Re: How can I Score?

2016-11-15 Thread lewis john mcgibbney
Hi Michael, Replies inline On Sat, Nov 12, 2016 at 7:10 PM, <user-digest-h...@nutch.apache.org> wrote: > From: Michael Coffey <mcof...@yahoo.com.invalid> > To: "user@nutch.apache.org" <user@nutch.apache.org> > Cc: > Date: Sun, 13 Nov 2016 03:07:16 +

Re: How can I Score?

2016-11-12 Thread Yongyao Jiang
I was thinking about the same question too. My guess is scoring happens when you run fetch command. This page may help, https://wiki.apache.org/nutch/NutchScoring On Sat, Nov 12, 2016 at 10:07 PM, Michael Coffey wrote: > When the generator is used with -topN, it is

How can I Score?

2016-11-12 Thread Michael Coffey
When the generator is used with -topN, it is supposed to choose the highest-scoring urls. In my case, all the urls in my db have a score of zero, except the ones injected. How can I cause scores to be computed and stored? I am using the standard crawl script. Do I need to enable the various