Chris A. Mattmann created NUTCH-1927:
----------------------------------------
Summary: Create a whitelist of IPs/hostnames to allow skipping of
RobotRules parsing
Key: NUTCH-1927
URL: https://issues.apache.org/jira/browse/NUTCH-1927
Project: Nutch
Issue Type: Bug
Reporter: Chris A. Mattmann
Based on discussion on the dev list, to use Nutch for some security research
valid use cases (DDoS; DNS and other testing), I am going to create a patch
that allows a whitelist:
{code:xml}
<property>
<name>robot.rules.whitelist</name>
<value>132.54.99.22,hostname.apache.org,foo.jpl.nasa.gov</value>
<description>Comma separated list of hostnames or IP addresses to ignore
robot rules parsing for.
</description>
</property>
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)