Re: Limit the documents for each shard in solr cloud

2015-05-08 Thread Jilani Shaik
Hi, Actually we are facing lot of issues with Solr shards in our environment. Our environment is fully loaded with around 150 million documents where each document will have around 50+ stored fields which has multiple values. And also we have lot of custom components in this environment which are

Re: Limit the documents for each shard in solr cloud

2015-05-07 Thread Jilani Shaik
Hi Daniel, Thanks for the detailed explanation. My understanding is also similar to you that we should not provide limit over the shard for number of documents that it can index. Usually it will depend on shard routing provided by Solr and I am not expecting any change to document routing

Re: Limit the documents for each shard in solr cloud

2015-05-07 Thread Daniel Collins
Not sure I understand your problem. If you have 20m documents, and 8 shards, then each shard is (broadly speaking) only going to have 2.5m docs each, so I don't follow the 5m limit? That is with the default routing/hashing, obviously you can write your own hash algorithm or you can shard at your

Re: Limit the documents for each shard in solr cloud

2015-05-07 Thread Erick Erickson
bq: We will not be able to limit the documents per shard in Solr Cloud. As Solr will accept all the documents as long as space is there for it to index. True, end of story ;). How does Solr know it will run out of space? It hits an exception, there's really no this doesn't look like it will fit

Re: Limit the documents for each shard in solr cloud

2015-05-07 Thread Jack Krupansky
Wait a minute, guys... aren't we in the 21st century, where disk is ultra-cheap and ultra plentiful? So... what's the REAL problem here? Seriously, when multi-terabyte drives are so common on servers and Solr really doesn't work well with more than 100 to 250 million docs per server anyway, which

Re: Limit the documents for each shard in solr cloud

2015-05-07 Thread Jack Krupansky
A leader is also a replica - SolrCloud is not a master/slave architecture. Any replica can be elected to be the leader, but that is only temporary and can change over time. You can place multiple shards on a single node, but was that really your intention? Generally, number of nodes equals

Re: Limit the documents for each shard in solr cloud

2015-05-07 Thread Daniel Collins
Jilani, you did say My team needs that option if at all possible, my first response would be why?. Why do they want to limit the number of documents per shard, what's the rationale/use case behind that requirement? Once we understand that, we can explain why its a bad idea. :) I suspect I'm

Limit the documents for each shard in solr cloud

2015-05-06 Thread Jilani Shaik
Hi, Is it possible to restrict number of documents per shard in Solr cloud? Lets say we have Solr cloud with 4 nodes, and on each node we have one leader and one replica. Like wise total we have 8 shards that includes replicas. Now I need to index my documents in such a way that each shard will