[ 
https://issues.apache.org/jira/browse/NUTCH-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15816083#comment-15816083
 ] 

Roannel Fernández Hernández commented on NUTCH-2334:
----------------------------------------------------

Hi [~wastl-nagel]
It is not just a matter of deciding whether an URL is (re)fetched or not. It 
was just an example. The schedulers would work together in all the moments 
where the schedulers are involved.
On the other hand, I don't think it is possible to decide whether an URL is 
fecthed or not only using ScoringFilters. If the enabled scheduler returns 
false when the generator asks if an URL should be visited or not, it will not 
be generated, regardless of the value that the scorings might return, it would 
not even matter if it is less than generate.min.score.
What I know is that the value returned by the ScoringFilters is used to sort 
all the URL that eventually pass to Reduce task until the TopN is reached, like 
a URL's priority.

> Extension point for schedulers
> ------------------------------
>
>                 Key: NUTCH-2334
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2334
>             Project: Nutch
>          Issue Type: New Feature
>          Components: generator
>    Affects Versions: 1.12
>            Reporter: Roannel Fernández Hernández
>            Priority: Minor
>             Fix For: 1.13
>
>
> With an extension point for schedulers, the users should be able to create 
> new schedulers that meet to their own needs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to