Shalin Shekhar Mangar created SOLR-12654:
--------------------------------------------
Summary: Improve split shard to split by document counts
Key: SOLR-12654
URL: https://issues.apache.org/jira/browse/SOLR-12654
Project: Solr
Issue Type: Improvement
Security Level: Public (Default Security Level. Issues are Public)
Components: SolrCloud
Reporter: Shalin Shekhar Mangar
Fix For: master (8.0), 7.5
Today, shard splitting works by partitioning the hash range in half (or user
defined hash ranges). This works fine when pseudo-random distribution is
assumed. However, composite ID routing can lead to uneven distribution of data
due to co-location requirements. In such cases, splitting hash range by half
may not create equal shards anymore. Although we cannot split the hash domains
across ranges but we may be able to select a suitable mid-point such that two
hash domains having semi-equal document counts go to separate ranges.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]