sebastian-nagel opened a new pull request, #854: URL: https://github.com/apache/nutch/pull/854
Verify the sitemap additional values added from a sitemap to CrawlDB by SitemapProcessor: priority, modification time and change frequency. - use default priority if priority <= 0.0 (a CrawlDatum with score 0.0 is not eligible for fetch) - ensure that the fetch interval (from change frequency) is within db.fetch.schedule.adaptive.min_interval and db.fetch.schedule.adaptive.max_interval - ignore last-modified times in the future -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org