Hi Joe,

You don't need a scoring filter for Linkrank. Just follow the wiki and run the 
webgraph tool on your segments. Then you can run the linkrank tool on the 
webgraph you just created from your segments. Finally use the scoreupdater tool 
to write the scores back to your crawldb.

Cheers

https://wiki.apache.org/nutch/NewScoring
 
 
-----Original message-----
> From:Joe Zhang <[email protected]>
> Sent: Sun 16-Jun-2013 21:14
> To: user <[email protected]>
> Subject: Re: Nutch scoring question again
> 
> Is scoring-link preferred over scoring-opic? I saw some disucssion of
> deficiencies of opic.
> 
> 
> On Sun, Jun 16, 2013 at 12:12 PM, Lewis John Mcgibbney <
> [email protected]> wrote:
> 
> > Yes Joe this is correct.
> >
> >
> > On Sun, Jun 16, 2013 at 12:03 PM, Joe Zhang <[email protected]> wrote:
> >
> > > Thanks.
> > >
> > > with regards to (2), is this score the "boost" we see in solr index?
> > >
> > >
> > > On Sun, Jun 16, 2013 at 10:38 AM, Ahme Emre Aladağ
> > > <[email protected]>wrote:
> > >
> > > > Note: I'm a newbie.
> > > >
> > > > As far as I know, new scoring and scoring-link corresponds to LinkRank.
> > > > It's implemented in the scoring.webgraph package. The code in the
> > > > scoring-link might be linking the scoring plugin system to the LinkRank
> > > > class in webgraph.
> > > >
> > > > 1) Yes it works for sorting the pages. The topN most important-seeming
> > > > pages are fetched in the next cycles according to this scoring.
> > > > 2) Relevance in retrieval is affected due to (1). It calculates the
> > > scores
> > > > and gives them to Solr. Solr will rank the search results according to
> > > > these scores and some other external custom scores.
> > > >
> > > >
> > > > ----- Orijinal Mesaj -----
> > > > Kimden: "Joe Zhang" <[email protected]>
> > > > Kime: "user" <[email protected]>
> > > > Gönderilenler: 15 Haziran Cumartesi 2013 23:41:33
> > > > Konu: Nutch scoring question again
> > > >
> > > > The plugins directory only contains two scoring plugs: scoring-link and
> > > > scoring-opic. What about the newscoring, linkrank, etc.? Where are they
> > > > available?
> > > >
> > > > Again, I'm confused about the nature/purpose of such scoring:
> > > >
> > > > 1. Does it work as a sorting function for the frontier of the crawling?
> > > -->
> > > > this seems reasonable.
> > > > 2. Or does it affect relevance in retrieval? If so, why is it handled
> > in
> > > > the crawler, but not solr?
> > > >
> > > > I'd greatly appreciate any enlightment.
> > > >
> > >
> >
> >
> >
> > --
> > *Lewis*
> >
> 

Reply via email to