Shalin Shekhar Mangar created SOLR-12654:
--------------------------------------------

             Summary: Improve split shard to split by document counts
                 Key: SOLR-12654
                 URL: https://issues.apache.org/jira/browse/SOLR-12654
             Project: Solr
          Issue Type: Improvement
      Security Level: Public (Default Security Level. Issues are Public)
          Components: SolrCloud
            Reporter: Shalin Shekhar Mangar
             Fix For: master (8.0), 7.5


Today, shard splitting works by partitioning the hash range in half (or user 
defined hash ranges). This works fine when pseudo-random distribution is 
assumed. However, composite ID routing can lead to uneven distribution of data 
due to co-location requirements. In such cases, splitting hash range by half 
may not create equal shards anymore. Although we cannot split the hash domains 
across ranges but we may be able to select a suitable mid-point such that two 
hash domains having semi-equal document counts go to separate ranges.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to