[ https://issues.apache.org/jira/browse/NUTCH-2807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel resolved NUTCH-2807. ------------------------------------ Resolution: Implemented > SitemapProcessor to warn that ignoring robots.txt affects detection of > sitemaps > ------------------------------------------------------------------------------- > > Key: NUTCH-2807 > URL: https://issues.apache.org/jira/browse/NUTCH-2807 > Project: Nutch > Issue Type: Improvement > Components: robots, sitemap > Reporter: Sebastian Nagel > Assignee: Sebastian Nagel > Priority: Minor > Labels: easytask > Fix For: 1.19 > > > Ignoring the robots.txt causes as a site effect that no sitemaps can be > detected via robots.txt. > SitemapProcessor should log a warning if robots.txt is ignored by > configuration (NUTCH-1927/NUTCH-2803). -- This message was sent by Atlassian Jira (v8.20.1#820001)