I should not have brought that protocol status thing in here. ProtocolStatusCode = 2 means FAILED (see [0]) CrawlStatus = 2 means FETCHED (see [1])
Get the webdb dump and share it, your problem will get more clear. [0] : http://svn.apache.org/repos/asf/nutch/branches/2.x/src/java/org/apache/nutch/protocol/ProtocolStatusCodes.java [1] : http://svn.apache.org/repos/asf/nutch/branches/2.x/src/java/org/apache/nutch/crawl/CrawlStatus.java On Sat, May 4, 2013 at 12:20 PM, raviksingh <[email protected]>wrote: > Hi, > This link http://nlp.solutions.asia/?p=232 says that "2" means > "fetched". > Is this wrong? > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Nutch-Crawls-Again-and-again-tp4060834p4060842.html > Sent from the Nutch - User mailing list archive at Nabble.com. >

