Christian Johnsson created NUTCH-1348:
-----------------------------------------
Summary: Solrindexer fails with a java.io.IOException error.
Key: NUTCH-1348
URL: https://issues.apache.org/jira/browse/NUTCH-1348
Project: Nutch
Issue Type: Bug
Components: indexer
Affects Versions: 1.5
Environment: Debian Stable AMD64
Reporter: Christian Johnsson
Priority: Minor
I'm unable to reproduce this error but it happens from time to time when i run
solrindexer.
I use the same commands as i did with 1.4 and about the same configuration and
i haven't changed any solr settings.
Have the same plugins active just to be able to compare.
>From time to time the solrindexer throws an error. It happends like 1-2 times
>out of 5 and there is no information in the solr log about it.
Not sure if it's a bug but i though i might as well report it since i've been
running 1.4 since it was released and never came across this error in that
version.
2012-05-01 20:44:14,861 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.SocketException) caught when processing request: Connection reset
2012-05-01 20:44:14,861 INFO httpclient.HttpMethodDirector - Retrying request
2012-05-01 20:44:15,808 INFO solr.SolrWriter - Indexing 250 documents
2012-05-01 20:44:36,153 WARN mapred.LocalJobRunner - job_local_0001
java.io.IOException
at
org.apache.nutch.indexer.solr.SolrWriter.makeIOException(SolrWriter.java:152)
at org.apache.nutch.indexer.solr.SolrWriter.write(SolrWriter.java:126)
at
org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:55)
at
org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:44)
at org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:440)
at
org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:195)
at
org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:51)
at
org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:463)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
Caused by: org.apache.solr.client.solrj.SolrServerException:
org.apache.commons.httpclient.ProtocolException: Unbuffered entity enclosing
request can not be repeated.
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:475)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
at
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
at org.apache.nutch.indexer.solr.SolrWriter.write(SolrWriter.java:124)
... 8 more
Caused by: org.apache.commons.httpclient.ProtocolException: Unbuffered entity
enclosing request can not be repeated.
at
org.apache.commons.httpclient.methods.EntityEnclosingMethod.writeRequestBody(EntityEnclosingMethod.java:487)
at
org.apache.commons.httpclient.HttpMethodBase.writeRequest(HttpMethodBase.java:2114)
at
org.apache.commons.httpclient.HttpMethodBase.execute(HttpMethodBase.java:1096)
at
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:398)
at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:422)
... 11 more
2012-05-01 20:44:37,074 ERROR solr.SolrIndexer - java.io.IOException: Job
failed!
It's running on a single machine and no hadoop.
It's indexing around 50-80 000 smaller documents. Worked flawless in 1.4
Thats about it :-)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira