Otis Gospodnetic wrote:
I don't know of an elegant way, but if you want to hack Nutch
sources, you could set its refetch time to some point in time
veeey far in the future, for example. Or introduce additional
status.
This won't work, because the pages will be checked again after a
, Hadoop, HBase, UIMA, NLP, NER, IR
- Original Message
From: Saurabh Suman saurabhsuman...@rediff.com
To: nutch-user@lucene.apache.org
Sent: Thursday, July 30, 2009 9:59:50 AM
Subject: Meaning of ProtocolStatus.ACCESS_DENIED
Hi
In Fetcher.java, if protacol status of a url
://www.nabble.com/Meaning-of-ProtocolStatus.ACCESS_DENIED-tp24739011p24739011.html
Sent from the Nutch - User mailing list archive at Nabble.com.