So this is a stand-alone process after the crawling.

On Sun, Jun 16, 2013 at 12:17 PM, Markus Jelsma
<[email protected]>wrote:

> Hi Joe,
>
> You don't need a scoring filter for Linkrank. Just follow the wiki and run
> the webgraph tool on your segments. Then you can run the linkrank tool on
> the webgraph you just created from your segments. Finally use the
> scoreupdater tool to write the scores back to your crawldb.
>
> Cheers
>
> https://wiki.apache.org/nutch/NewScoring
>
>
> -----Original message-----
> > From:Joe Zhang <[email protected]>
> > Sent: Sun 16-Jun-2013 21:14
> > To: user <[email protected]>
> > Subject: Re: Nutch scoring question again
> >
> > Is scoring-link preferred over scoring-opic? I saw some disucssion of
> > deficiencies of opic.
> >
> >
> > On Sun, Jun 16, 2013 at 12:12 PM, Lewis John Mcgibbney <
> > [email protected]> wrote:
> >
> > > Yes Joe this is correct.
> > >
> > >
> > > On Sun, Jun 16, 2013 at 12:03 PM, Joe Zhang <[email protected]>
> wrote:
> > >
> > > > Thanks.
> > > >
> > > > with regards to (2), is this score the "boost" we see in solr index?
> > > >
> > > >
> > > > On Sun, Jun 16, 2013 at 10:38 AM, Ahme Emre Aladağ
> > > > <[email protected]>wrote:
> > > >
> > > > > Note: I'm a newbie.
> > > > >
> > > > > As far as I know, new scoring and scoring-link corresponds to
> LinkRank.
> > > > > It's implemented in the scoring.webgraph package. The code in the
> > > > > scoring-link might be linking the scoring plugin system to the
> LinkRank
> > > > > class in webgraph.
> > > > >
> > > > > 1) Yes it works for sorting the pages. The topN most
> important-seeming
> > > > > pages are fetched in the next cycles according to this scoring.
> > > > > 2) Relevance in retrieval is affected due to (1). It calculates the
> > > > scores
> > > > > and gives them to Solr. Solr will rank the search results
> according to
> > > > > these scores and some other external custom scores.
> > > > >
> > > > >
> > > > > ----- Orijinal Mesaj -----
> > > > > Kimden: "Joe Zhang" <[email protected]>
> > > > > Kime: "user" <[email protected]>
> > > > > Gönderilenler: 15 Haziran Cumartesi 2013 23:41:33
> > > > > Konu: Nutch scoring question again
> > > > >
> > > > > The plugins directory only contains two scoring plugs:
> scoring-link and
> > > > > scoring-opic. What about the newscoring, linkrank, etc.? Where are
> they
> > > > > available?
> > > > >
> > > > > Again, I'm confused about the nature/purpose of such scoring:
> > > > >
> > > > > 1. Does it work as a sorting function for the frontier of the
> crawling?
> > > > -->
> > > > > this seems reasonable.
> > > > > 2. Or does it affect relevance in retrieval? If so, why is it
> handled
> > > in
> > > > > the crawler, but not solr?
> > > > >
> > > > > I'd greatly appreciate any enlightment.
> > > > >
> > > >
> > >
> > >
> > >
> > > --
> > > *Lewis*
> > >
> >
>

Reply via email to