[ 
https://issues.apache.org/jira/browse/HBASE-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16169938#comment-16169938
 ] 

Guangxu Cheng commented on HBASE-18131:
---------------------------------------

bq.The getClusterStatus can return the list of dead servers. Perhaps the 
listDeadServers is unnecessary. WDYT? Guangxu Cheng Thanks.
Sorry ,I did not notice it before. hmmm, on the server side, we may not need to 
implement the method of listDeadServers, but on the client side,  I suggest to 
supports shell command and api that the user can more clearly know how to get 
the crash server list.What do you think? [~chia7712] [[email protected]] 
[~carp84]


> Add an hbase shell command to clear deadserver list in ServerManager
> --------------------------------------------------------------------
>
>                 Key: HBASE-18131
>                 URL: https://issues.apache.org/jira/browse/HBASE-18131
>             Project: HBase
>          Issue Type: New Feature
>          Components: Operability
>            Reporter: Yu Li
>            Assignee: Guangxu Cheng
>             Fix For: 1.4.0, 1.5.0, 2.0.0-alpha-3
>
>         Attachments: HBASE-18131.branch-1.v1.patch, 
> HBASE-18131.branch-1.v2.patch, HBASE-18131.master.v1.patch, 
> HBASE-18131.master.v2.patch, HBASE-18131.master.v3.patch, 
> HBASE-18131.master.v4.patch, HBASE-18131.master.v5.patch, 
> HBASE-18131.master.v6.patch, HBASE-18131.master.v6.patch, 
> HBASE-18131.master.v7.patch, HBASE-18131.patch
>
>
> Currently if a regionserver is aborted due to fatal error or stopped by 
> operator on purpose, it will be added into {{ServerManager#deadservers}} list 
> and shown as "Dead Servers" in the master UI. This is a valid warn for 
> operators to  notice the self-aborted servers and give a sanity check to 
> avoid further issues. However, after necessary checks, even if operator is 
> sure that the node is decommissioned (such as for repair), there's no way to 
> clear the dead server list except restarting master. See more details in 
> [this 
> discussion|http://mail-archives.apache.org/mod_mbox/hbase-user/201705.mbox/%3CCAM7-19%2BD4MLu2b1R94%2BtWQDspjfny2sCy4Qit8JtCgjvTOZzzg%40mail.gmail.com%3E]
>  in mail list
> Here we propose to add a hbase shell command to allow clearing dead server 
> list in {{ServerManager}} for advanced users, and the command should be 
> executed with caution.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to