Sorry for the multiple message, Markus and Lewis.

I did read the wiki entry on NewScoring. It is described as a standalone
process AFTER crawling. So what does the scoring-link plugin do? Is it
equivalent to NewScore or not?


On Sun, Jun 16, 2013 at 12:21 PM, Joe Zhang <[email protected]> wrote:

> and scoring-link != LinkRank?
>
>
> On Sun, Jun 16, 2013 at 12:20 PM, Joe Zhang <[email protected]> wrote:
>
>> So this is a stand-alone process after the crawling.
>>
>>
>> On Sun, Jun 16, 2013 at 12:17 PM, Markus Jelsma <
>> [email protected]> wrote:
>>
>>> Hi Joe,
>>>
>>> You don't need a scoring filter for Linkrank. Just follow the wiki and
>>> run the webgraph tool on your segments. Then you can run the linkrank tool
>>> on the webgraph you just created from your segments. Finally use the
>>> scoreupdater tool to write the scores back to your crawldb.
>>>
>>> Cheers
>>>
>>> https://wiki.apache.org/nutch/NewScoring
>>>
>>>
>>> -----Original message-----
>>> > From:Joe Zhang <[email protected]>
>>> > Sent: Sun 16-Jun-2013 21:14
>>> > To: user <[email protected]>
>>> > Subject: Re: Nutch scoring question again
>>> >
>>> > Is scoring-link preferred over scoring-opic? I saw some disucssion of
>>> > deficiencies of opic.
>>> >
>>> >
>>> > On Sun, Jun 16, 2013 at 12:12 PM, Lewis John Mcgibbney <
>>> > [email protected]> wrote:
>>> >
>>> > > Yes Joe this is correct.
>>> > >
>>> > >
>>> > > On Sun, Jun 16, 2013 at 12:03 PM, Joe Zhang <[email protected]>
>>> wrote:
>>> > >
>>> > > > Thanks.
>>> > > >
>>> > > > with regards to (2), is this score the "boost" we see in solr
>>> index?
>>> > > >
>>> > > >
>>> > > > On Sun, Jun 16, 2013 at 10:38 AM, Ahme Emre Aladağ
>>> > > > <[email protected]>wrote:
>>> > > >
>>> > > > > Note: I'm a newbie.
>>> > > > >
>>> > > > > As far as I know, new scoring and scoring-link corresponds to
>>> LinkRank.
>>> > > > > It's implemented in the scoring.webgraph package. The code in the
>>> > > > > scoring-link might be linking the scoring plugin system to the
>>> LinkRank
>>> > > > > class in webgraph.
>>> > > > >
>>> > > > > 1) Yes it works for sorting the pages. The topN most
>>> important-seeming
>>> > > > > pages are fetched in the next cycles according to this scoring.
>>> > > > > 2) Relevance in retrieval is affected due to (1). It calculates
>>> the
>>> > > > scores
>>> > > > > and gives them to Solr. Solr will rank the search results
>>> according to
>>> > > > > these scores and some other external custom scores.
>>> > > > >
>>> > > > >
>>> > > > > ----- Orijinal Mesaj -----
>>> > > > > Kimden: "Joe Zhang" <[email protected]>
>>> > > > > Kime: "user" <[email protected]>
>>> > > > > Gönderilenler: 15 Haziran Cumartesi 2013 23:41:33
>>> > > > > Konu: Nutch scoring question again
>>> > > > >
>>> > > > > The plugins directory only contains two scoring plugs:
>>> scoring-link and
>>> > > > > scoring-opic. What about the newscoring, linkrank, etc.? Where
>>> are they
>>> > > > > available?
>>> > > > >
>>> > > > > Again, I'm confused about the nature/purpose of such scoring:
>>> > > > >
>>> > > > > 1. Does it work as a sorting function for the frontier of the
>>> crawling?
>>> > > > -->
>>> > > > > this seems reasonable.
>>> > > > > 2. Or does it affect relevance in retrieval? If so, why is it
>>> handled
>>> > > in
>>> > > > > the crawler, but not solr?
>>> > > > >
>>> > > > > I'd greatly appreciate any enlightment.
>>> > > > >
>>> > > >
>>> > >
>>> > >
>>> > >
>>> > > --
>>> > > *Lewis*
>>> > >
>>> >
>>>
>>
>>
>

Reply via email to