@nutch.apache.org
Subject: Re: How can I Score?
Hi,
one solution is to include the plugin
scoring-opic
in the property
plugin.includes
That should work given that injected URLs have a non-zero score and are not
redirects. OPIC is a good choice for frontier selection (via -topN).
For Nutch 1.x the webgraph
Hi,
one solution is to include the plugin
scoring-opic
in the property
plugin.includes
That should work given that injected URLs have a non-zero score and are
not redirects. OPIC is a good choice for frontier selection (via -topN).
For Nutch 1.x the webgraph steps in combination with
15
> > To: user@nutch.apache.org
> > Subject: Re: How can I Score?
> >
> > Aha! I was wrong when I said I was using all default settings. I forgot
> I had followed a tutorial that told mem to put |scoring-depth| instead of
> |scoring-opic| into the plugin.includes pro
er 2016 7:15
> To: user@nutch.apache.org
> Subject: Re: How can I Score?
>
> Aha! I was wrong when I said I was using all default settings. I forgot I had
> followed a tutorial that told mem to put |scoring-depth| instead of
> |scoring-opic| into the plugin.includes property. No
ovember 15, 2016 12:09 AM
Subject: Re: How can I Score?
Hi Michael,
Replies inline
On Sat, Nov 12, 2016 at 7:10 PM, <user-digest-h...@nutch.apache.org> wrote:
> From: Michael Coffey <mcof...@yahoo.com.invalid>
> To: "user@nutch.apache.org" <user@nutch.apache.or
Hi Michael,
Replies inline
On Sat, Nov 12, 2016 at 7:10 PM, <user-digest-h...@nutch.apache.org> wrote:
> From: Michael Coffey <mcof...@yahoo.com.invalid>
> To: "user@nutch.apache.org" <user@nutch.apache.org>
> Cc:
> Date: Sun, 13 Nov 2016 03:07:16 +
I was thinking about the same question too. My guess is scoring happens
when you run fetch command.
This page may help, https://wiki.apache.org/nutch/NutchScoring
On Sat, Nov 12, 2016 at 10:07 PM, Michael Coffey
wrote:
> When the generator is used with -topN, it is
When the generator is used with -topN, it is supposed to choose the
highest-scoring urls. In my case, all the urls in my db have a score of zero,
except the ones injected.
How can I cause scores to be computed and stored? I am using the standard crawl
script. Do I need to enable the various
8 matches
Mail list logo