[ 
https://issues.apache.org/jira/browse/NUTCH-556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

King Kong updated NUTCH-556:
----------------------------

    Description: 
The spider must could  find the new urls  in time.  and the new urls usually 
are included in some url like index page,list page.

but  the score of url can not reflect it Adequately.

Could we adjust the CrawlDatum.fetchInterval according to the number of newly 
outlinks.

  was:
Usually, the spider must could  find the new urls  in time.

but  the score of url can not reflect it Adequately.

Could we adjust the CrawlDatum.fetchInterval according to the number of newly 
outlinks.


> automatic adjust the CrawlDatum.fetchInterval according to the number of 
> newly outlinks
> ---------------------------------------------------------------------------------------
>
>                 Key: NUTCH-556
>                 URL: https://issues.apache.org/jira/browse/NUTCH-556
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>            Reporter: King Kong
>
> The spider must could  find the new urls  in time.  and the new urls usually 
> are included in some url like index page,list page.
> but  the score of url can not reflect it Adequately.
> Could we adjust the CrawlDatum.fetchInterval according to the number of newly 
> outlinks.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to