Christian Johnsson created NUTCH-1348:
-----------------------------------------

             Summary: Solrindexer fails with a java.io.IOException error.
                 Key: NUTCH-1348
                 URL: https://issues.apache.org/jira/browse/NUTCH-1348
             Project: Nutch
          Issue Type: Bug
          Components: indexer
    Affects Versions: 1.5
         Environment: Debian Stable AMD64
            Reporter: Christian Johnsson
            Priority: Minor


I'm unable to reproduce this error but it happens from time to time when i run 
solrindexer.
I use the same commands as i did with 1.4 and about the same configuration and 
i haven't changed any solr settings. 
Have the same plugins active just to be able to compare.

>From time to time the solrindexer throws an error. It happends like 1-2 times 
>out of 5 and there is no information in the solr log about it.
Not sure if it's a bug but i though i might as well report it since i've been 
running 1.4 since it was released and never came across this error in that 
version.

2012-05-01 20:44:14,861 INFO  httpclient.HttpMethodDirector - I/O exception 
(java.net.SocketException) caught when processing request: Connection reset
2012-05-01 20:44:14,861 INFO  httpclient.HttpMethodDirector - Retrying request
2012-05-01 20:44:15,808 INFO  solr.SolrWriter - Indexing 250 documents
2012-05-01 20:44:36,153 WARN  mapred.LocalJobRunner - job_local_0001
java.io.IOException
        at 
org.apache.nutch.indexer.solr.SolrWriter.makeIOException(SolrWriter.java:152)
        at org.apache.nutch.indexer.solr.SolrWriter.write(SolrWriter.java:126)
        at 
org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:55)
        at 
org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:44)
        at org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:440)
        at 
org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:195)
        at 
org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:51)
        at 
org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:463)
        at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
Caused by: org.apache.solr.client.solrj.SolrServerException: 
org.apache.commons.httpclient.ProtocolException: Unbuffered entity enclosing 
request can not be repeated.
        at 
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:475)
        at 
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
        at 
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
        at org.apache.nutch.indexer.solr.SolrWriter.write(SolrWriter.java:124)
        ... 8 more
Caused by: org.apache.commons.httpclient.ProtocolException: Unbuffered entity 
enclosing request can not be repeated.
        at 
org.apache.commons.httpclient.methods.EntityEnclosingMethod.writeRequestBody(EntityEnclosingMethod.java:487)
        at 
org.apache.commons.httpclient.HttpMethodBase.writeRequest(HttpMethodBase.java:2114)
        at 
org.apache.commons.httpclient.HttpMethodBase.execute(HttpMethodBase.java:1096)
        at 
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:398)
        at 
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
        at 
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
        at 
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
        at 
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:422)
        ... 11 more
2012-05-01 20:44:37,074 ERROR solr.SolrIndexer - java.io.IOException: Job 
failed!

It's running on a single machine and no hadoop.
It's indexing around 50-80 000 smaller documents. Worked flawless in 1.4

Thats about it :-)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to