[
https://issues.apache.org/jira/browse/HBASE-14309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14721303#comment-14721303
]
Hudson commented on HBASE-14309:
--------------------------------
FAILURE: Integrated in HBase-TRUNK #6766 (See
[https://builds.apache.org/job/HBase-TRUNK/6766/])
HBASE-14309 Allow load balancer to operate when there is region in transition
by adding force flag (tedyu: rev df341c4299ea21e4e1ca09652f6126633f2307c5)
*
hbase-client/src/main/java/org/apache/hadoop/hbase/protobuf/RequestConverter.java
* hbase-server/src/main/java/org/apache/hadoop/hbase/master/HMaster.java
* hbase-protocol/src/main/protobuf/Master.proto
* hbase-shell/src/main/ruby/hbase/admin.rb
* hbase-client/src/main/java/org/apache/hadoop/hbase/client/Admin.java
*
hbase-protocol/src/main/java/org/apache/hadoop/hbase/protobuf/generated/MasterProtos.java
* hbase-client/src/main/java/org/apache/hadoop/hbase/client/HBaseAdmin.java
* hbase-server/src/main/java/org/apache/hadoop/hbase/master/RegionStates.java
*
hbase-server/src/main/java/org/apache/hadoop/hbase/master/MasterRpcServices.java
* hbase-shell/src/main/ruby/shell/commands/balancer.rb
> Allow load balancer to operate when there is region in transition by adding
> force flag
> --------------------------------------------------------------------------------------
>
> Key: HBASE-14309
> URL: https://issues.apache.org/jira/browse/HBASE-14309
> Project: HBase
> Issue Type: Improvement
> Reporter: Ted Yu
> Assignee: Ted Yu
> Fix For: 2.0.0, 1.3.0
>
> Attachments: 14309-branch-1.1.txt, 14309-v1.txt, 14309-v2.txt,
> 14309-v3.txt, 14309-v4.txt, 14309-v5-branch-1.txt, 14309-v5.txt,
> 14309-v5.txt, 14309-v6.txt, 14309-v7-branch-1.txt, 14309-v7.txt
>
>
> This issue adds boolean parameter, force, to 'balancer' command so that admin
> can force region balancing even when there is region in transition - assuming
> RIT being transient.
> This enhancement was requested by some customer.
> The assumption of this change is that the operator has run hbck and has a
> reasonable idea why regions are stuck in transition before using the force
> flag.
> There was a recent event at the customer where a cluster ended up with a
> small number of regionservers hosting most of the regions on the cluster (one
> regionserver had 50% of the roughly 20,000 regions). The balancer couldn't be
> run due to the small number of regions that were stuck in transition. The
> admin ended up killing the regionservers so that reassignment would yield a
> more equitable distribution of the regions.
> On a different cluster, there was a single store file that had corrupt HDFS
> blocks (the SSDs on the cluster were known to lose data). However, since this
> single region (out of 10s of 1000s of regions on this cluster) was stuck in
> transition, the balancer couldn't run.
> While the state keeping in HBase isn't so good yet that the admin can kick
> off the balancer automatically in such scenarios knowing when it is safe to
> do so and when it is not, having this option available for the operator to
> use as he / she sees fit seems prudent.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)