Hexin created KUDU-2915:
---------------------------

             Summary: Support to delete dead tservers from CLI
                 Key: KUDU-2915
                 URL: https://issues.apache.org/jira/browse/KUDU-2915
             Project: Kudu
          Issue Type: Improvement
          Components: CLI
    Affects Versions: 1.10.0
            Reporter: Hexin


Sometimes the nodes in the cluster will crash due to machine problems such as 
disk corruption, which can be very common. However, if there are some dead 
tservers, ksck result will always show error (e.g. Not all Tablet Servers are 
reachable) although all tables have recovered to be healthy.

The only way now to get the healthy status of ksck is to restart all masters 
one by one. In some cases, for example, if the machine has completely 
corrupted, we hope to get healthy status of ksck without restarting, since 
after restarting masters the cluster will take some time to recover, during 
which it will have influence on scanning or upsetting to tables. The recovery 
time can be long which mainly depends on the scale of cluster. This problem can 
be serious and annoying especially tservers crashed with high-frequency in a 
large cluster.

It’s valuable if we have an easier way to delete dead tservers from master, I 
will support a kudu command to realize it.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to