try playing with http.timeout and http.max.delays , seems it is stuck on county page, can you try crawling some other website and see if still throw error ? However I did try to crawl didnt get such error ? I am using 1.11
thanks Rakesh Kumar On Wed, Jan 6, 2016 at 3:09 AM, Manish Verma <[email protected]> wrote: > Hi, > > I am using nutch 1.10 and able to crawl on my local machine, when I try > running crawler on linux server, its failing with socket tome out exception. > I got port 80 open on linux server and checked with nc command it succeed . > > [searcht::ma1-searcht-lapp118::~/nutch/apache-nutch-1.10/runtime/local/logs]$ > nc -z apple.com 80 > Connection to apple.com 80 port [tcp/http] succeeded! > > Please suggest what could be the reason. > > Below is stack trace from log. > > 2016-01-05 21:25:20,179 ERROR http.Http - Failed to get protocol output > java.net.SocketTimeoutException: connect timed out > at java.net.PlainSocketImpl.socketConnect(Native Method) > at > java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339) > at > java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:200) > at > java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182) > at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) > at java.net.Socket.connect(Socket.java:579) > at > org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:123) > at org.apache.nutch.protocol.http.Http.getResponse(Http.java:70) > at > org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:255) > at > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:778) > 2016-01-05 21:25:20,183 INFO fetcher.Fetcher - fetch of > http://www.apple.com/choose-your-country/ failed with: > java.net.SocketTimeoutException: connect timed out > Thanks > Manish Verma > AML Search > +1 669 224 9924 > >

