[
https://issues.apache.org/jira/browse/HBASE-23958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17065191#comment-17065191
]
Nick Dimiduk commented on HBASE-23958:
--------------------------------------
[~ram_krish] give this a spin with the latest branch-2 or branch-2.3.
HBASE-23984 fixes a minor accounting bug in RIT tracking in the master.
> Balancer keeps balancing indefinitely
> --------------------------------------
>
> Key: HBASE-23958
> URL: https://issues.apache.org/jira/browse/HBASE-23958
> Project: HBase
> Issue Type: Bug
> Components: Region Assignment
> Affects Versions: 2.0.2
> Reporter: ramkrishna.s.vasudevan
> Assignee: ramkrishna.s.vasudevan
> Priority: Major
> Fix For: 2.3.0
>
>
> Before raising this issue - am not sure if this got fixed directly or
> indirectly in other latest versions of hbase.
> The steps are
> 1) Create a cluster and create some tables. (assume we have RS 1,2,3, 4 and
> 5)
> 2) After the table creation and some ops done, the cluster was restarted. Due
> to this some regions are in RIT. the RIT in progress was to be assigned to
> RS 3.
> 3) After the cluster comes back RS 3 and 4 are stopped. (RS 3 will have
> newer timestamp)
> 4) Now the master that comes up sees there are some RIT in place and tries to
> load the entries to process the procedures again. As part of this the
> RegionStateStore is populated with the old RS 3 hostname. (older timestamp).
> This adds to the ServerStateNode creating a RS 3 with old timestamp as one
> server.
> 5) Now after the master restarts and all regions assigned, the balancer
> infinitely tries to balance the region to the RS 3 (old timestamp server)
> thinking it is part of the cluster.
> 6)the other problem is the MoveProcedure has the target as RS 3 (with old
> timestamp) but the AM realizes that it is a down server and move it to the
> one of the active server. But this is not recorded anywhere.
> I will continue to check the latest code if this case is valid.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)