Re: Solr Cloud sharding strategy

2016-03-07 Thread Erick Erickson
be routed to shard1 while deu goes > to shard2, and esp and chs gets indexed in either of them. Or, all of them > can potentially end up getting indexed in the same shard, either 1 or 2, > leaving one shard under-utilized. > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-Cloud-sharding-strategy-tp4262274p4262336.html > Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Cloud sharding strategy

2016-03-07 Thread shamik
will be routed to shard1 while deu goes to shard2, and esp and chs gets indexed in either of them. Or, all of them can potentially end up getting indexed in the same shard, either 1 or 2, leaving one shard under-utilized. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Cloud

Re: Solr Cloud sharding strategy

2016-03-07 Thread Walter Underwood
size >> point of view, the size can grow up to half a TB from its current state. >> Honestly, my perception of "big" index is still vague :-) . All I'm trying >> to make sure is that decision I take is scalable in the long term and will >> be able to sustain the growth without

Re: Solr Cloud sharding strategy

2016-03-07 Thread Erick Erickson
romising the performance. > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-Cloud-sharding-strategy-tp4262274p4262304.html > Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Cloud sharding strategy

2016-03-07 Thread shamik
e.com/Solr-Cloud-sharding-strategy-tp4262274p4262304.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Cloud sharding strategy

2016-03-07 Thread Erick Erickson
20M docs is actually a very small collection by the "usual" Solr standards unless they're _really_ large documents, i.e. large books. Actually, I wouldn't even shard to begin with, it's unlikely that it's necessary and it adds inevitable overhead. If you _must_ shard, just go with <1>, but again

Solr Cloud sharding strategy

2016-03-07 Thread Shamik Bandopadhyay
Hi, I'm trying to figure the best way to design/allocate shards for our Solr Cloud environment.Our current index has around 20 million documents, in 10 languages. Around 25-30% of the content is in English. Rest are almost equally distributed among the remaining 13 languages. Till now, we had