[ https://issues.apache.org/jira/browse/HBASE-13965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14621457#comment-14621457 ]
Lei Chen commented on HBASE-13965: ---------------------------------- I have found a problem related to HBASE-5231(per-table load balancing). It seems that the balancing is done by an iteration of tables. https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/master/HMaster.java#L1219-L1228 This can be configured to be in per-table mode or ensemble mode. https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/master/RegionStates.java#L956-L962 In ensemble mode, all the tables are copied into an "ensemble" table for balancing. The configuration is set by {{hbase.master.loadbalancer.bytable}} My question is how to name the metrics if the balancing is in ensemble mode. For example, suppose we have two tables, "Table1" and "Table2", and N cost functions. In the per-table mode, each table will have an overall cost and one for each cost function. {{Table1_Overall}} {{Table1_costFunction}} x N {{Table2_Overall}} {{Table2_costFunction}} x N In the ensemble mode, there will be only one overall and one set of function costs. {{ensemble_Overall}} {{ensemble_costFunction}} x N Can we use a special name for the combined table, e.g. "ensemble"? The problem is that the user may have already created a table named "ensemble", which may cause confusion. Any idea on this problem? > Stochastic Load Balancer JMX Metrics > ------------------------------------ > > Key: HBASE-13965 > URL: https://issues.apache.org/jira/browse/HBASE-13965 > Project: HBase > Issue Type: Improvement > Components: Balancer, metrics > Reporter: Lei Chen > Assignee: Lei Chen > Attachments: HBASE-13965-v3.patch, HBASE-13965-v4.patch, > HBASE-13965-v5.patch, HBASE-13965-v6.patch, HBASE-13965_v2.patch, > HBase-13965-v1.patch, stochasticloadbalancerclasses_v2.png > > > Today’s default HBase load balancer (the Stochastic load balancer) is cost > function based. The cost function weights are tunable but no visibility into > those cost function results is directly provided. > A driving example is a cluster we have been tuning which has skewed rack size > (one rack has half the nodes of the other few racks). We are tuning the > cluster for uniform response time from all region servers with the ability to > tolerate a rack failure. Balancing LocalityCost, RegionReplicaRack Cost and > RegionCountSkew Cost is difficult without a way to attribute each cost > function’s contribution to overall cost. > What this jira proposes is to provide visibility via JMX into each cost > function of the stochastic load balancer, as well as the overall cost of the > balancing plan. -- This message was sent by Atlassian JIRA (v6.3.4#6332)