I personal prefer protocol-http.

Am 05.02.2006 um 18:26 schrieb Raghavendra Prabhu:

Hi Stefan

My bandwidth is limited .

But i am able to crawl other links with the same host (so he is not denying
i guess)

Is it because of the protocol-httpclient(shud i use protocol-http)

Rgds
Prabhu


On 2/5/06, Stefan Groschupf <[EMAIL PROTECTED]> wrote:

Is the host in your web-browser available?
Does this host block your ip, since he understand nutch as a DOS attack?
Is you bandwidth limited?

Am 05.02.2006 um 18:17 schrieb Raghavendra Prabhu:

Hi

I am running a crawl using protocol-httpclient

I get a
java.io.IOException: java.net.SocketTimeoutException: Read timed out

Can someone tell me the reason why i get the error

After that the crawl hangs and is simply in the same state

Rgds
Prabhu





-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to