Hi,
I have been using nutch for quite sometime now. All had been working fine. I
crawl some sites once a fortnight. It worked fine till now, except i cant
seem to make it work for last couple of days. I am getting the following
exception when i run the  bin/nutch crawl command:

2012-04-10 11:03:38,783 INFO  api.RobotRulesParser - Couldn't get robots.txt
for http://sitetocrawl.html/: java.net.ConnectException: Connection refused
2012-04-10 11:03:38,784 ERROR http.Http - java.net.ConnectException:
Connection refused
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.PlainSocketImpl.socketConnect(Native Method)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.Socket.connect(Socket.java:529)
2012-04-10 11:03:38,784 ERROR http.Http - at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:97)
2012-04-10 11:03:38,785 ERROR http.Http - at
org.apache.nutch.protocol.http.Http.getResponse(Http.java:64)
2012-04-10 11:03:38,785 ERROR http.Http - at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:224)
2012-04-10 11:03:38,785 ERROR http.Http - at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:627)
2012-04-10 11:03:38,786 INFO  fetcher.Fetcher - fetch of
http://sitetocrawl.html/ failed with: java.net.ConnectException: Connection
refused

I really cant seem to find out why this has stopped working all of a sudden.
Any help?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Connection-refused-tp3898889p3898889.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to