[ 
https://issues.apache.org/jira/browse/HBASE-12450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14203145#comment-14203145
 ] 

Andrew Purtell commented on HBASE-12450:
----------------------------------------

Test failure seems unrelated to this change and Hadoop unit test zombie 
definitely is.

> Unbalance chaos monkey might kill all region servers without starting them 
> back
> -------------------------------------------------------------------------------
>
>                 Key: HBASE-12450
>                 URL: https://issues.apache.org/jira/browse/HBASE-12450
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Virag Kothari
>            Assignee: Virag Kothari
>            Priority: Minor
>             Fix For: 2.0.0, 0.98.8, 0.99.2
>
>         Attachments: HBASE-12450-0.98.patch, HBASE-12450.patch, 
> HBASE-12450.patch
>
>
> UnbalanceKillAndRebalanceAction does kill, balance and then start of region 
> servers. But if the balance fails exception is thrown causing the region 
> servers to not start. For me, the balance always kept on failing with socket 
> timeout (default 1 min) as master runs one iteration of balance for 5 mins 
> (default config). Eventually all servers are killed but never started back.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to