try playing with http.timeout and http.max.delays , seems it is stuck on
county page,  can you try crawling some other website and see if still
throw error ? However I did try to crawl didnt get such error ? I am using
1.11

thanks
Rakesh Kumar

On Wed, Jan 6, 2016 at 3:09 AM, Manish Verma <[email protected]> wrote:

> Hi,
>
> I am using nutch 1.10 and able to crawl on my local machine, when I try
> running crawler on linux server, its failing with socket tome out exception.
> I got port 80 open on linux server and checked with nc command it succeed .
>
> [searcht::ma1-searcht-lapp118::~/nutch/apache-nutch-1.10/runtime/local/logs]$
> nc -z apple.com 80
> Connection to apple.com 80 port [tcp/http] succeeded!
>
> Please suggest what could be the reason.
>
> Below is stack trace from log.
>
> 2016-01-05 21:25:20,179 ERROR http.Http - Failed to get protocol output
> java.net.SocketTimeoutException: connect timed out
>         at java.net.PlainSocketImpl.socketConnect(Native Method)
>         at
> java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339)
>         at
> java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:200)
>         at
> java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182)
>         at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
>         at java.net.Socket.connect(Socket.java:579)
>         at
> org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:123)
>         at org.apache.nutch.protocol.http.Http.getResponse(Http.java:70)
>         at
> org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:255)
>         at
> org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:778)
> 2016-01-05 21:25:20,183 INFO  fetcher.Fetcher - fetch of
> http://www.apple.com/choose-your-country/ failed with:
> java.net.SocketTimeoutException: connect timed out
> Thanks
> Manish Verma
> AML Search
> +1 669 224 9924
>
>

Reply via email to