Yes this is true (if the assumption is that by 'compar' you are referring to
'compare'), but it sounds like you are after some kind of similarity
anaysis.

As Markus said, this is not thr fetchers job but another element of the
Nutch architecture.

In general... I would aks you to understand how the plugin architecture
works in Nutch and what nature of data we can obtain at various stages in
the 'crawling process'. We have a link analysis feature in Nutch but we
currently do not support any similarity analysis.

Please correct me and redefine your problem if the above is not accurate.



On Mon, Sep 12, 2011 at 11:02 AM, Danicela nutch <[email protected]>wrote:

> I want to prioritize URLs containing "compar" for exemple to fetch them
> first.
>
>  Maybe I didn't understand how this is intended to work, but I thought
> pages with higher score meant they are pages first to fetch.
>
> ----- Original Message -----
> From: Markus Jelsma
> Sent: 09/12/11 11:55 AM
> To: [email protected]
> Subject: Re: Modifying fetch order with ScoringFilter
>
>  > Hi, > > I'm making a plugin implementing ScoringFilter. I want to modify
> the fetch > order of pages according to their URL. For that, I have to
> modify the > score. Why would you want to do that? The fetcher has nothing
> to do with scoring. What problem are you trying to solve? > > I tried to
> modify the score on various methods provided by the interface > but none of
> them modified the fetch order during this phase. > > Maybe I don't use this
> interface well. > > Can someone tell me how to use this interface / redirect
> me to a tutorial > ? > > Thanks.
>



-- 
*Lewis*

Reply via email to