[ https://issues.apache.org/jira/browse/NUTCH-2807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17461288#comment-17461288 ]
ASF GitHub Bot commented on NUTCH-2807: --------------------------------------- sebastian-nagel merged pull request #710: URL: https://github.com/apache/nutch/pull/710 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org > SitemapProcessor to warn that ignoring robots.txt affects detection of > sitemaps > ------------------------------------------------------------------------------- > > Key: NUTCH-2807 > URL: https://issues.apache.org/jira/browse/NUTCH-2807 > Project: Nutch > Issue Type: Improvement > Components: robots, sitemap > Reporter: Sebastian Nagel > Assignee: Sebastian Nagel > Priority: Minor > Labels: easytask > Fix For: 1.19 > > > Ignoring the robots.txt causes as a site effect that no sitemaps can be > detected via robots.txt. > SitemapProcessor should log a warning if robots.txt is ignored by > configuration (NUTCH-1927/NUTCH-2803). -- This message was sent by Atlassian Jira (v8.20.1#820001)