[ 
https://issues.apache.org/jira/browse/HBASE-22285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16827535#comment-16827535
 ] 

Aman Poonia commented on HBASE-22285:
-------------------------------------

Few thoughts
 # If we have pre-split the table then the creation time of the table and the 
regions will be very near. This way we can roughly say that this is pre-split 
region.
 # If the store’s earliest flush time and the region creation time 
match(approx) then it is a pre-split scenario(and/or it is a new region)

 

I think 2nd is a cleaner approach here.

 

> A normalizer which merges very small size regions with adjacent 
> regions.(MergeToNormalize)
> ------------------------------------------------------------------------------------------
>
>                 Key: HBASE-22285
>                 URL: https://issues.apache.org/jira/browse/HBASE-22285
>             Project: HBase
>          Issue Type: New Feature
>          Components: regionserver
>            Reporter: Aman Poonia
>            Assignee: Aman Poonia
>            Priority: Minor
>
> There are scenarios where we have seen around 5% of total regions with a size 
> of 0 bytes and another 5-6 % regions with size in a few bytes. These kinds of 
> regions increase with time considering we have TTL over the rows. 
> After exploring the option of RegionNormalizer and doing some quick runs we 
> found that that is not suitable considering it also splits the regions and 
> merges to normalize. What we really want is to split as per Split policy and 
> merge very small regions with adjacent regions to make sure we reduce 0-byte 
> regions.
> We can plugin this normalizer using the property 
> "hbase.master.normalizer.class"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to