Hello,

We are going to implement distributed indexing for Solr - without the
use of SolrCloud (so it can be easily up-scaled). We have a deadline by
February to get this done, so we need to get cracking ;) 

So far, we've had a look at the solr classes and thought about
distributed indexing on Solr, and we have come up with these ideas:

1. We plan to modify SimplePostTool to accommodate posting to specific
shards. We are going to add an optional system property to allow the
user to specify a list of shards to index to Solr.
Example of this being "java
-Durl=http://localhost:7574/solr/collection1/update
-Dshards=localhost:8983/solr,localhost:7574/solr -jar post.jar <list of
XML files>"

We also plan to modify server request processing to handle distributed
indexing. We are looking at CommonsHttpSolrServer.java for ways to
accomplish this.

With all these changes, we realise that we are only modifying the Java
version, and that other languages need to be updated to accommodate our
changes (e.g. perl). We were wondering if there was a simple way of
applying these changes we wrote in Java across all the other languages.

2. We are going to make an interface to handle distributed writing. We
plan for it to sit between the Solr server and the shards - if no shards
are specified, then the post.jar tool will work exactly the same way it
does now. However, if the user specifies shards for post.jar, then we
want a class that has extended our interface to kick into action. 

3. We plan to test our results by acceptance testing (we run Solr and
see if it works ourselves) and writing a test class.

Does anyone have any comments to share?

Thanks,
Soheb Mahmood


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to