[ https://issues.apache.org/jira/browse/HBASE-16842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15576447#comment-15576447 ]
Dima Spivak commented on HBASE-16842: ------------------------------------- We started seeing this behavior in-house at Cloudera starting with CDH 5.7, when we rebased onto HBase 1.2, though I should note we sometimes saw it with the {{calm}} monkey, as well. This would suggest stability problems with the HBase master, though I haven't opened a separate JIRA as we didn't get any useful logging to go along with the isolated failures we saw. > Chaos policies can terminate all masters for extended periods of time > --------------------------------------------------------------------- > > Key: HBASE-16842 > URL: https://issues.apache.org/jira/browse/HBASE-16842 > Project: HBase > Issue Type: Bug > Components: integration tests > Reporter: Andrew Purtell > > Running ITBLL with the slowDeterministic monkey I observe our primary and > backup masters in the test cluster can be both shut down by signals followed > by no attempt to start replacements for an extended period of time. Meanwhile > other actions continue to run that churn the regionserver fleet. Other > monkeys may enter a similar state, but I haven't observed it. The outcome of > the convergence of these behaviors is the eventual time out and termination > of the running integration test, which is obvious and expected and unhelpful, > so I believe unintentional. -- This message was sent by Atlassian JIRA (v6.3.4#6332)