[ 
https://issues.apache.org/jira/browse/HBASE-3259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934561#action_12934561
 ] 

Jean-Daniel Cryans commented on HBASE-3259:
-------------------------------------------

bq. Like you said, maybe this is bad for usability, not sure this is blocking 
or a bug.

I foresee that a majority of our new users will hit this issue if they have any 
sort of trouble setting up their cluster, so I think this is a blocker.

bq. You want to make it so you can just 'kill' without -9?

Not just kill, but also "hbase-daemon.sh stop regionserver" since it also 
hangs. Imagine a few machines in that state where you have to manually kill -9 
every one of them.

bq. Or you want to add timeout on RS on startup?

A timeout to the blocking, but that we retry until either the data is available 
or the region server is stopped. Like 1 or 2 seconds. 

I'm currently writing the patch.

> Can't kill the region servers when they wait on the master or the cluster 
> state znode
> -------------------------------------------------------------------------------------
>
>                 Key: HBASE-3259
>                 URL: https://issues.apache.org/jira/browse/HBASE-3259
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Jean-Daniel Cryans
>            Assignee: Jean-Daniel Cryans
>            Priority: Blocker
>             Fix For: 0.90.0, 0.92.0
>
>
> With a situation like HBASE-3258, it's easy to have the region servers stuck 
> on waiting for either the master or the cluster state znode since it has no 
> timeout. You have to kill -9 them to have them shutting down. This is very 
> bad for usability.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to