: If that is the case, I think that my settings are correct.   I still do not
: explain why I have such growth on all the shards at the same time.

you are missunderstanding how SolrCLoud works.

Unless you go out of your way to override hte document routing, Solr will 
compute a logical shard to assign each doc to using a hash on the id -- it 
doesn't matter which physical node you send the doc to, solr will 
internally forward it to the correct physical nodes of the logical shard 
it belongs to.

If it is important to you that a single shard represents a calander year, 
then you need to override the shard assignemnt algorithm -- either that, 
or use a distinct *collection* per claander year, and then do 
multi-collection queries when you want to execute queries across multiple 
years ... it all depends on what your "common case" queries are going to 
look like...

https://cwiki.apache.org/confluence/display/solr/Shards+and+Indexing+Data+in+SolrCloud
https://cwiki.apache.org/confluence/display/solr/Distributed+Requests

: One thing I noticed is that three of them are leaders in the SolrCloud
: admin UI graph.  Is that normal?

if you have 4 shards, then there should be 4 leaders -- leaders are about 
coordinating the duplicate physical copies of each doc in each replica of 
hte logical shard -- if you only have 1 phyiscal replica of each logical 
shard, then every replica is it's own leader.


-Hoss

Reply via email to