[
https://issues.apache.org/jira/browse/KUDU-2257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Grant Henke updated KUDU-2257:
------------------------------
Labels: supportability (was: )
> Add tool to recover a tablet that has lost a majority of replicas
> -----------------------------------------------------------------
>
> Key: KUDU-2257
> URL: https://issues.apache.org/jira/browse/KUDU-2257
> Project: Kudu
> Issue Type: Improvement
> Reporter: William Berkeley
> Priority: Major
> Labels: supportability
>
> In the unfortunate case where a tablet has lost a majority of its replicas,
> the way to recover is to rewrite the config of the remaining healthy members
> so the unhealthy members are excluded, then wait for normal re-replication.
> This involves k similar commands on the k remaining healthy servers. We could
> simplify the process by adding a simple tool "kudu tablet
> unsafe_promote_minority" that does the rewrite in one step.
> The tool could use ksck to try and identify the healthy minority, or it could
> rely on the user to do that.
> The tool should also make it very clear that it may cause data loss and that
> its use is a last resort.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)