[
https://issues.apache.org/jira/browse/NUTCH-556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
King Kong updated NUTCH-556:
----------------------------
Description:
The spider must could find the new urls in time. and the new urls usually
are included in some url like index page,list page.
but the score of url can not reflect it Adequately.
Could we adjust the CrawlDatum.fetchInterval according to the number of newly
outlinks.
was:
Usually, the spider must could find the new urls in time.
but the score of url can not reflect it Adequately.
Could we adjust the CrawlDatum.fetchInterval according to the number of newly
outlinks.
> automatic adjust the CrawlDatum.fetchInterval according to the number of
> newly outlinks
> ---------------------------------------------------------------------------------------
>
> Key: NUTCH-556
> URL: https://issues.apache.org/jira/browse/NUTCH-556
> Project: Nutch
> Issue Type: New Feature
> Components: fetcher
> Reporter: King Kong
>
> The spider must could find the new urls in time. and the new urls usually
> are included in some url like index page,list page.
> but the score of url can not reflect it Adequately.
> Could we adjust the CrawlDatum.fetchInterval according to the number of newly
> outlinks.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.