Mike Percy created KUDU-1724:
--------------------------------
Summary: Add ability to elect a leader with an "unsafe" number of
votes
Key: KUDU-1724
URL: https://issues.apache.org/jira/browse/KUDU-1724
Project: Kudu
Issue Type: Bug
Components: consensus, ops-tooling, supportability
Reporter: Mike Percy
As a generalization of KUDU-1720, it would be useful to be able to start a
leader election that does not require a "safe" number of votes for the leader
to consider it successful.
Imagine that we have 5 replicas and 3 replicas are permanently offline. We know
that the remaining 3 replicas will never come back online and there is
currently no leader. As part of an emergency maintenance procedure, and
administrator needs to get a leader back online so that an emergency
configuration change can be forced (see KUDU-1721).
The administrator potentially has a couple of options:
1. Choose a leader based on how full the replica's log is, based on manual or
automatic inspection
2. Run a leader election that only considers the remaining nodes as potential
voters: for example, even though the full config is 5 nodes, if there are only
2 nodes remaining then this particular election can consider there to be only 2
valid voters.
Requiring some form of election to take place, even if it is not a "safe"
election, is more likely to lose the minimum possible amount of data since that
is what the election protocol is designed to do. Only the most up-to-date
remaining replica should get elected.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)