So this is a stand-alone process after the crawling.
On Sun, Jun 16, 2013 at 12:17 PM, Markus Jelsma <[email protected]>wrote: > Hi Joe, > > You don't need a scoring filter for Linkrank. Just follow the wiki and run > the webgraph tool on your segments. Then you can run the linkrank tool on > the webgraph you just created from your segments. Finally use the > scoreupdater tool to write the scores back to your crawldb. > > Cheers > > https://wiki.apache.org/nutch/NewScoring > > > -----Original message----- > > From:Joe Zhang <[email protected]> > > Sent: Sun 16-Jun-2013 21:14 > > To: user <[email protected]> > > Subject: Re: Nutch scoring question again > > > > Is scoring-link preferred over scoring-opic? I saw some disucssion of > > deficiencies of opic. > > > > > > On Sun, Jun 16, 2013 at 12:12 PM, Lewis John Mcgibbney < > > [email protected]> wrote: > > > > > Yes Joe this is correct. > > > > > > > > > On Sun, Jun 16, 2013 at 12:03 PM, Joe Zhang <[email protected]> > wrote: > > > > > > > Thanks. > > > > > > > > with regards to (2), is this score the "boost" we see in solr index? > > > > > > > > > > > > On Sun, Jun 16, 2013 at 10:38 AM, Ahme Emre Aladağ > > > > <[email protected]>wrote: > > > > > > > > > Note: I'm a newbie. > > > > > > > > > > As far as I know, new scoring and scoring-link corresponds to > LinkRank. > > > > > It's implemented in the scoring.webgraph package. The code in the > > > > > scoring-link might be linking the scoring plugin system to the > LinkRank > > > > > class in webgraph. > > > > > > > > > > 1) Yes it works for sorting the pages. The topN most > important-seeming > > > > > pages are fetched in the next cycles according to this scoring. > > > > > 2) Relevance in retrieval is affected due to (1). It calculates the > > > > scores > > > > > and gives them to Solr. Solr will rank the search results > according to > > > > > these scores and some other external custom scores. > > > > > > > > > > > > > > > ----- Orijinal Mesaj ----- > > > > > Kimden: "Joe Zhang" <[email protected]> > > > > > Kime: "user" <[email protected]> > > > > > Gönderilenler: 15 Haziran Cumartesi 2013 23:41:33 > > > > > Konu: Nutch scoring question again > > > > > > > > > > The plugins directory only contains two scoring plugs: > scoring-link and > > > > > scoring-opic. What about the newscoring, linkrank, etc.? Where are > they > > > > > available? > > > > > > > > > > Again, I'm confused about the nature/purpose of such scoring: > > > > > > > > > > 1. Does it work as a sorting function for the frontier of the > crawling? > > > > --> > > > > > this seems reasonable. > > > > > 2. Or does it affect relevance in retrieval? If so, why is it > handled > > > in > > > > > the crawler, but not solr? > > > > > > > > > > I'd greatly appreciate any enlightment. > > > > > > > > > > > > > > > > > > > > > -- > > > *Lewis* > > > > > >

