Hi,
I am using nutch 1.10 and able to crawl on my local machine, when I try running
crawler on linux server, its failing with socket tome out exception.
I got port 80 open on linux server and checked with nc command it succeed .
[searcht::ma1-searcht-lapp118::~/nutch/apache-nutch-1.10/runtime/local/logs]$
nc -z apple.com 80
Connection to apple.com 80 port [tcp/http] succeeded!
Please suggest what could be the reason.
Below is stack trace from log.
2016-01-05 21:25:20,179 ERROR http.Http - Failed to get protocol output
java.net.SocketTimeoutException: connect timed out
at java.net.PlainSocketImpl.socketConnect(Native Method)
at
java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339)
at
java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:200)
at
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
at java.net.Socket.connect(Socket.java:579)
at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:123)
at org.apache.nutch.protocol.http.Http.getResponse(Http.java:70)
at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:255)
at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:778)
2016-01-05 21:25:20,183 INFO fetcher.Fetcher - fetch of
http://www.apple.com/choose-your-country/ failed with:
java.net.SocketTimeoutException: connect timed out
Thanks
Manish Verma
AML Search
+1 669 224 9924