Author: tejasp
Date: Fri Jan 24 17:46:05 2014
New Revision: 1561088
URL: http://svn.apache.org/r1561088
Log:
NUTCH-1715 RobotRulesParser adds additional * to the robots name
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java
Modified: nutch/branches/2.x/CHANGES.txt
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/CHANGES.txt?rev=1561088&r1=1561087&r2=1561088&view=diff
==============================================================================
--- nutch/branches/2.x/CHANGES.txt (original)
+++ nutch/branches/2.x/CHANGES.txt Fri Jan 24 17:46:05 2014
@@ -2,6 +2,8 @@ Nutch Change Log
Current Development
+* NUTCH-1715 RobotRulesParser adds additional '*' to the robots name (tejasp)
+
* NUTCH-356 Plugin repository cache can lead to memory leak (Enrico Triolo,
DoÄacan Güney via markus)
* NUTCH-1164 Write JUnit tests for protocol-http (Sertac TURKEL via tejasp)
Modified:
nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java?rev=1561088&r1=1561087&r2=1561088&view=diff
==============================================================================
--- nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java
(original)
+++ nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java
Fri Jan 24 17:46:05 2014
@@ -121,8 +121,6 @@ public abstract class RobotRulesParser i
combinedAgentsString.append(", " + agents.get(index));
}
- // always make sure "*" is included in the end
- combinedAgentsString.append(", *");
this.agentNames = combinedAgentsString.toString();
}
}