Hey guys, I am troubleshooting an issue on a 4.3.1 SolrCloud: 1 collection and 2 shards over 4 Solr instances, (which results in 1 core per Solr instance).
After some time in Production without issues, we are seeing errors related to the IndexWriter all over our logs and an infinite loop of failing replication from Leader on our 2 replicas. We see a flood of: "org.apache.lucene.store.AlreadyClosedException: this IndexWriter is closed" stacktraces, then the Solr replica tries to replicate/recover, then fails replication and then the following 2 errors show up: 1) "SolrIndexWriter was not closed prior to finalize(), indicates a bug -- POSSIBLE RESOURCE LEAK!!!" 2) "Error closing IndexWriter, trying rollback" (which results in a null-pointer exception). I'm guessing the best way forward would be to upgrade to latest, but that is an undertaking that will take significant time/testing. In the meantime, is there anything I can do to mitigate or understand the issue more? Does anyone know what the IndexWriter errors refer to? Below is a URL to a .txt file with summarized portions of my solr.log. Any help is really appreciated as always!! http://timvaillancourt.com.s3.amazonaws.com/tmp/solr.log-summarized.txt Thanks all, Tim