Markus Jelsma created NUTCH-1586:
------------------------------------

             Summary: Non-db_success records should have interval.max
                 Key: NUTCH-1586
                 URL: https://issues.apache.org/jira/browse/NUTCH-1586
             Project: Nutch
          Issue Type: Bug
          Components: crawldb
    Affects Versions: 1.7
            Reporter: Markus Jelsma
            Assignee: Markus Jelsma
             Fix For: 1.8


When your default interval is low (e.g. when you start low using adaptive 
scheduling), records with redirect or gone status keep the default interval. 
There should be a switch to force 404's and redirects to use the max interval 
instead because these are usually the least interesting records for recrawling.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to