Author: tejasp
Date: Fri Jan 24 17:46:05 2014
New Revision: 1561088

URL: http://svn.apache.org/r1561088
Log:
NUTCH-1715 RobotRulesParser adds additional * to the robots name

Modified:
    nutch/branches/2.x/CHANGES.txt
    nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java

Modified: nutch/branches/2.x/CHANGES.txt
URL: 
http://svn.apache.org/viewvc/nutch/branches/2.x/CHANGES.txt?rev=1561088&r1=1561087&r2=1561088&view=diff
==============================================================================
--- nutch/branches/2.x/CHANGES.txt (original)
+++ nutch/branches/2.x/CHANGES.txt Fri Jan 24 17:46:05 2014
@@ -2,6 +2,8 @@ Nutch Change Log
 
 Current Development
 
+* NUTCH-1715 RobotRulesParser adds additional '*' to the robots name (tejasp)
+
 * NUTCH-356 Plugin repository cache can lead to memory leak (Enrico Triolo, 
Doğacan Güney via markus)
 
 * NUTCH-1164 Write JUnit tests for protocol-http (Sertac TURKEL via tejasp)

Modified: 
nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java
URL: 
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java?rev=1561088&r1=1561087&r2=1561088&view=diff
==============================================================================
--- nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java 
(original)
+++ nutch/branches/2.x/src/java/org/apache/nutch/protocol/RobotRulesParser.java 
Fri Jan 24 17:46:05 2014
@@ -121,8 +121,6 @@ public abstract class RobotRulesParser i
         combinedAgentsString.append(", " + agents.get(index));
       }
 
-      // always make sure "*" is included in the end
-      combinedAgentsString.append(", *");
       this.agentNames = combinedAgentsString.toString();
     }
   }


Reply via email to