[
https://issues.apache.org/jira/browse/KUDU-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Song Jiacheng updated KUDU-3487:
--------------------------------
Description:
Function CheckCompleteReplace in replace rebalance will try to make the leader
step down if the replica which should be removed is leader, but this may stuck
for a while if the replication factor of the table is 1, since there is no
voter to transfer leadership.
So it will be ok if we make sure voter num of the tablet is greater than 1
before sending the LeaderStepDown request.
was:
Function CheckCompleteReplace in replace rebalance will try to make the leader
step down if theĀ replica which should be removed is leader, but this may stuck
for a while if the replication factor of the table is 1, since there is no
voter to transfer leadership.
So it will be ok if we make sure voter num of the tablet is greater than 1
before sending the LeaderStepDown request.
> Rebalancer: Balance for 1 replication factor tablet might stuck for leader
> step down too early
> ----------------------------------------------------------------------------------------------
>
> Key: KUDU-3487
> URL: https://issues.apache.org/jira/browse/KUDU-3487
> Project: Kudu
> Issue Type: Bug
> Affects Versions: 1.14.0
> Reporter: Song Jiacheng
> Priority: Major
> Attachments:
> Fix_a_bug_that_replace_balance_for_1_replication_factor_tablet_might_stuck_for_leader_step.patch
>
>
> Function CheckCompleteReplace in replace rebalance will try to make the
> leader step down if the replica which should be removed is leader, but this
> may stuck for a while if the replication factor of the table is 1, since
> there is no voter to transfer leadership.
> So it will be ok if we make sure voter num of the tablet is greater than 1
> before sending the LeaderStepDown request.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)