Hello list,
Having some problems when attempting to index to Solr (experimenting with
running Solr.war on Tomcat instead of usual Jetty config as before). Using
Nutch in singular mode (no Hadoop this time). Exception is present due to input
path not existing, however as far as I am aware I have followed the usual
procedure for updating, fetching URL's etc.
Nutch 1.2
Windows XP
Tomcat 6.0.26
Attached excerpt from Hadoop.log
2011-01-12 18:42:18,929 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection refused:
connect
2011-01-12 18:42:18,929 INFO httpclient.HttpMethodDirector - Retrying request
2011-01-12 18:42:19,835 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection refused:
connect
2011-01-12 18:42:19,835 INFO httpclient.HttpMethodDirector - Retrying request
2011-01-12 18:42:20,835 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection refused:
connect
2011-01-12 18:42:20,835 INFO httpclient.HttpMethodDirector - Retrying request
2011-01-12 18:42:22,851 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection refused:
connect
2011-01-12 18:42:22,851 INFO httpclient.HttpMethodDirector - Retrying request
2011-01-12 18:42:23,851 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection refused:
connect
2011-01-12 18:42:23,851 INFO httpclient.HttpMethodDirector - Retrying request
2011-01-12 18:42:24,867 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection refused:
connect
2011-01-12 18:42:24,867 INFO httpclient.HttpMethodDirector - Retrying request
2011-01-12 18:42:25,898 WARN mapred.LocalJobRunner - job_local_0001
java.io.IOException
at
org.apache.nutch.indexer.solr.SolrWriter.makeIOException(SolrWriter.java:85)
at org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:80)
at
org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:48)
at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:474)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
Caused by: org.apache.solr.client.solrj.SolrServerException:
java.net.ConnectException: Connection refused: connect
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:472)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:243)
at
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49)
at org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:75)
... 4 more
Caused by: java.net.ConnectException: Connection refused: connect
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
at java.net.Socket.connect(Socket.java:529)
at java.net.Socket.connect(Socket.java:478)
at java.net.Socket.<init>(Socket.java:375)
at java.net.Socket.<init>(Socket.java:249)
at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
at
org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$HttpConnectionAdapter.open(MultiThreadedHttpConnectionManager.java:1361)
at
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:416)
... 8 more
2011-01-12 18:42:26,836 ERROR solr.SolrIndexer - java.io.IOException: Job
failed!
Thank you
Lewis
Glasgow Caledonian University is a registered Scottish charity, number SC021474
Winner: Times Higher Education's Widening Participation Initiative of the Year
2009 and Herald Society's Education Initiative of the Year 2009
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html