This is just a normal TCP error. You have most likely been banned by
some firewall administrator.
On Tue, 10 Apr 2012 00:05:50 -0700 (PDT), "[email protected]"
<[email protected]> wrote:
Hi,
I have been using nutch for quite sometime now. All had been working
fine. I
crawl some sites once a fortnight. It worked fine till now, except i
cant
seem to make it work for last couple of days. I am getting the
following
exception when i run the bin/nutch crawl command:
2012-04-10 11:03:38,783 INFO api.RobotRulesParser - Couldn't get
robots.txt
for http://sitetocrawl.html/: java.net.ConnectException: Connection
refused
2012-04-10 11:03:38,784 ERROR http.Http - java.net.ConnectException:
Connection refused
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.PlainSocketImpl.socketConnect(Native Method)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
2012-04-10 11:03:38,784 ERROR http.Http - at
java.net.Socket.connect(Socket.java:529)
2012-04-10 11:03:38,784 ERROR http.Http - at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:97)
2012-04-10 11:03:38,785 ERROR http.Http - at
org.apache.nutch.protocol.http.Http.getResponse(Http.java:64)
2012-04-10 11:03:38,785 ERROR http.Http - at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:224)
2012-04-10 11:03:38,785 ERROR http.Http - at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:627)
2012-04-10 11:03:38,786 INFO fetcher.Fetcher - fetch of
http://sitetocrawl.html/ failed with: java.net.ConnectException:
Connection
refused
I really cant seem to find out why this has stopped working all of a
sudden.
Any help?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Connection-refused-tp3898889p3898889.html
Sent from the Nutch - User mailing list archive at Nabble.com.