Charles Connell created HBASE-27496:
---------------------------------------

             Summary: Limit size of plans produced by SimpleRegionNormalizer
                 Key: HBASE-27496
                 URL: https://issues.apache.org/jira/browse/HBASE-27496
             Project: HBase
          Issue Type: Improvement
          Components: Normalizer
            Reporter: Charles Connell


My company (Hubspot) is starting to use {{{}SimpleRegionNormalizer{}}}. We turn 
the normalizer switch on for 30 minutes each day, when our database traffic is 
at a low point. We're using theĀ 
{{hbase.normalizer.throughput.max_bytes_per_sec}} setting to create a rate 
limit. I've found that while the {{SimpleRegionNormalizer}} only produces new 
plans for 30 minutes each day, the plans often take many hours to execute. This 
leds to region splits, merges, and moves occurring in our HBase clusters during 
hours we'd prefer them not to.{color:#067d17}
{color}

I propose two new settings:
 * {{hbase.normalizer.merge.plans_size_limit.mb}}
 * {{hbase.normalizer.split.plans_size_limit.mb}}

This will allow HBase administrators to limit the number of plans produced by a 
run of {{{}SimpleRegionNormalizer{}}}. This will give you a way to limit 
approximately how long it takes to execute the plans. Because the current limit 
to execute plans is primarily determined by a per-byte rate limit, I propose 
that the new settings also work on a similar basis. This will make it feasible 
to reason about how your rate limit and your size limits interact.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to