Solr read timeout

Tod Thu, 18 Aug 2011 07:06:50 -0700

I'm using perl to indirectly call the solr ExtractingRequestHandler tostream remote documents into a solr index instance. Every 100 URL's Iprocess I do a commit. I've got about 30K documents to be indexed. I'musing a stock, out of the box version of solr 1.4.1 with the necessaryschema changes for the fields I'm indexing.

I seem to be running into performance problems about 40 documents in. Istart getting Failed: 500 read timeouts that last about 4 minutes eachslowing processing down to a crawl. I've tried a later version of tika(0.8) and that didn't seem to help. I'm also not sure it's the problem.

Given I'm using a pretty much unaltered version of Solr could it be myproblem? I'm running everything under a typical Tomcat install on aLinux VM. I understand there are performance tweaks I can make to theSolr config but would like to focus them first on resolving this problemrather than blanket tweaking the entire config.

Is there anything in particular I should look at? Can I provide anymore information?



Thanks - Tod

Solr read timeout

Reply via email to