Hello, We are going to implement distributed indexing for Solr - without the use of SolrCloud (so it can be easily up-scaled). We have a deadline by February to get this done, so we need to get cracking ;)
So far, we've had a look at the solr classes and thought about distributed indexing on Solr, and we have come up with these ideas: 1. We plan to modify SimplePostTool to accommodate posting to specific shards. We are going to add an optional system property to allow the user to specify a list of shards to index to Solr. Example of this being "java -Durl=http://localhost:7574/solr/collection1/update -Dshards=localhost:8983/solr,localhost:7574/solr -jar post.jar <list of XML files>" We also plan to modify server request processing to handle distributed indexing. We are looking at CommonsHttpSolrServer.java for ways to accomplish this. With all these changes, we realise that we are only modifying the Java version, and that other languages need to be updated to accommodate our changes (e.g. perl). We were wondering if there was a simple way of applying these changes we wrote in Java across all the other languages. 2. We are going to make an interface to handle distributed writing. We plan for it to sit between the Solr server and the shards - if no shards are specified, then the post.jar tool will work exactly the same way it does now. However, if the user specifies shards for post.jar, then we want a class that has extended our interface to kick into action. 3. We plan to test our results by acceptance testing (we run Solr and see if it works ourselves) and writing a test class. Does anyone have any comments to share? Thanks, Soheb Mahmood --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org