[GitHub] [hbase] bbeaudreault commented on a diff in pull request #4888: HBASE-27496 Optionally limit the cumulative size of normalization plans produced by SimpleRegionNormalizer

GitBox Mon, 21 Nov 2022 13:10:37 -0800


bbeaudreault commented on code in PR #4888:
URL: https://github.com/apache/hbase/pull/4888#discussion_r1028517667



##########
hbase-server/src/main/java/org/apache/hadoop/hbase/master/normalizer/RegionNormalizerWorker.java:
##########
@@ -207,14 +214,34 @@ private List<NormalizationPlan> calculatePlans(final 
TableName tableName) {
       return Collections.emptyList();
     }
 
-    final List<NormalizationPlan> plans = 
regionNormalizer.computePlansForTable(tblDesc);
+    List<NormalizationPlan> plans = 
regionNormalizer.computePlansForTable(tblDesc);
+
+    plans = truncateForSize(plans);
+
     if (CollectionUtils.isEmpty(plans)) {
       LOG.debug("No normalization required for table {}.", tableName);
       return Collections.emptyList();
     }
     return plans;
   }
 
+  private List<NormalizationPlan> truncateForSize(List<NormalizationPlan> 
plans) {

Review Comment:
   Sorry I think you misunderstood my request. I wouldn't normally push further 
on a logging change, but I think getting the right data will be helpful here.  
   
   We're adding a configurable limit on total bytes processed. If someone is 
ever going to want to tune this, it'll be helpful to know how many total bytes 
are necessary to normalize. Even logging this over time will be useful to have 
a sense for throughput over time. So in addition to the list sizes, I think we 
should include the size we truncated at and the total size calculated by 
`calculatePlans`. That's why I was saying we might need to remove the `break`, 
since it'll require iterating all plans. I think it's still worth it since this 
isn't a super cpu intensive process.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@hbase.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[GitHub] [hbase] bbeaudreault commented on a diff in pull request #4888: HBASE-27496 Optionally limit the cumulative size of normalization plans produced by SimpleRegionNormalizer

Reply via email to