Sebastian Nagel resolved NUTCH-813.

    Resolution: Duplicate

The described problem is identical to that of NUTCH-578. The provided patch 
(call setPageGoneSchedule when retry counter hits db.fetch.retry.max) is 
included in all patches of NUTCH-578.
> Repetitive crawl 403 status page
> --------------------------------
>                 Key: NUTCH-813
>                 URL: https://issues.apache.org/jira/browse/NUTCH-813
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.1
>            Reporter: Nguyen Manh Tien
>            Priority: Minor
>             Fix For: 1.7
>         Attachments: ASF.LICENSE.NOT.GRANTED--Patch
> When we crawl a page the return a 403 status. It will be crawl repetitively 
> each days with default schedule.
> Even when we restrict by paramter db.fetch.retry.max

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to