Hi,
Actually we are facing lot of issues with Solr shards in our environment.
Our environment is fully loaded with around 150 million documents where
each document will have around 50+ stored fields which has multiple values.
And also we have lot of custom components in this environment which are
Hi Daniel,
Thanks for the detailed explanation.
My understanding is also similar to you that we should not provide limit
over the shard for number of documents that it can index. Usually it will
depend on shard routing provided by Solr and I am not expecting any change
to document routing
Not sure I understand your problem. If you have 20m documents, and 8
shards, then each shard is (broadly speaking) only going to have 2.5m docs
each, so I don't follow the 5m limit? That is with the default
routing/hashing, obviously you can write your own hash algorithm or you can
shard at your
bq: We will not be able to limit the documents per shard in Solr
Cloud. As Solr will accept all the documents as long as space is there
for it to index.
True, end of story ;).
How does Solr know it will run out of space? It hits an exception,
there's really no this doesn't look like it will fit
Wait a minute, guys... aren't we in the 21st century, where disk is
ultra-cheap and ultra plentiful? So... what's the REAL problem here?
Seriously, when multi-terabyte drives are so common on servers and Solr
really doesn't work well with more than 100 to 250 million docs per server
anyway, which
A leader is also a replica - SolrCloud is not a master/slave architecture.
Any replica can be elected to be the leader, but that is only temporary and
can change over time.
You can place multiple shards on a single node, but was that really your
intention?
Generally, number of nodes equals
Jilani, you did say My team needs that option if at all possible, my
first response would be why?. Why do they want to limit the number of
documents per shard, what's the rationale/use case behind that
requirement? Once we understand that, we can explain why its a bad idea. :)
I suspect I'm
Hi,
Is it possible to restrict number of documents per shard in Solr cloud?
Lets say we have Solr cloud with 4 nodes, and on each node we have one
leader and one replica. Like wise total we have 8 shards that includes
replicas. Now I need to index my documents in such a way that each shard
will