Better key splitting in RegionSplitter
--------------------------------------

                 Key: HBASE-4489
                 URL: https://issues.apache.org/jira/browse/HBASE-4489
             Project: HBase
          Issue Type: Improvement
            Reporter: Dave Revell


The RegionSplitter utility allows users to create a pre-split table from the 
command line or do a rolling split on an existing table. It supports pluggable 
split algorithms that implement the SplitAlgorithm interface. The only/default 
SplitAlgorithm is one that assumes keys fall in the range from ASCII string 
"00000000" to ASCII string "7FFFFFFF". This is not a sane default, and seems 
useless to most users. Users are likely to be surprised by the fact that all 
the region splits occur in in the byte range of ASCII characters.

A better default split algorithm would be one that evenly divides the space of 
all bytes, which is what this patch does. Making a table with five regions 
would split at \x33\x33..., \x66\x66...., \x99\x99..., \xCC\xCC..., and 
\xFF\xFF.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to