These are transient errors and occur all the time, increasing http.timeout will 
fix it. If this happens, the URL's is, by default, scheduled for recrawl 1 day 
later.
M.

 
-----Original message-----
> From:Zara Parst <[email protected]>
> Sent: Wednesday 6th January 2016 10:08
> To: [email protected]
> Subject: Re: Socket Time Out O Linux Server
> 
> try playing with http.timeout and http.max.delays , seems it is stuck on
> county page,  can you try crawling some other website and see if still
> throw error ? However I did try to crawl didnt get such error ? I am using
> 1.11
> 
> thanks
> Rakesh Kumar
> 
> On Wed, Jan 6, 2016 at 3:09 AM, Manish Verma <[email protected]> wrote:
> 
> > Hi,
> >
> > I am using nutch 1.10 and able to crawl on my local machine, when I try
> > running crawler on linux server, its failing with socket tome out exception.
> > I got port 80 open on linux server and checked with nc command it succeed .
> >
> > [searcht::ma1-searcht-lapp118::~/nutch/apache-nutch-1.10/runtime/local/logs]$
> > nc -z apple.com 80
> > Connection to apple.com 80 port [tcp/http] succeeded!
> >
> > Please suggest what could be the reason.
> >
> > Below is stack trace from log.
> >
> > 2016-01-05 21:25:20,179 ERROR http.Http - Failed to get protocol output
> > java.net.SocketTimeoutException: connect timed out
> >         at java.net.PlainSocketImpl.socketConnect(Native Method)
> >         at
> > java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339)
> >         at
> > java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:200)
> >         at
> > java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182)
> >         at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
> >         at java.net.Socket.connect(Socket.java:579)
> >         at
> > org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:123)
> >         at org.apache.nutch.protocol.http.Http.getResponse(Http.java:70)
> >         at
> > org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:255)
> >         at
> > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:778)
> > 2016-01-05 21:25:20,183 INFO  fetcher.Fetcher - fetch of
> > http://www.apple.com/choose-your-country/ failed with:
> > java.net.SocketTimeoutException: connect timed out
> > Thanks
> > Manish Verma
> > AML Search
> > +1 669 224 9924
> >
> >
> 

Reply via email to