[
https://issues.apache.org/jira/browse/HBASE-14420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14876614#comment-14876614
]
stack commented on HBASE-14420:
-------------------------------
On my test rig, got these failures in recent runs:
Failed tests:
TestStochasticLoadBalancer2.testRegionReplicasOnMidClusterHighReplication:73->BalancerTestBase.testWithCluster:422->BalancerTestBase.testWithCluster:450->BalancerTestBase.assertRegionReplicaPlacement:225
Two or more region replicas are hosted on the same host after balance
Got this again too...
TestHttpServerLifecycle.testStoppedServerIsNotAlive:97->HttpServerFunctionalTest.stop:195
ยป TestTimedOut
Otherwise stuff is generally passing.
> Zombie Stomping Session
> -----------------------
>
> Key: HBASE-14420
> URL: https://issues.apache.org/jira/browse/HBASE-14420
> Project: HBase
> Issue Type: Umbrella
> Components: test
> Reporter: stack
> Assignee: stack
> Priority: Critical
>
> Patch build are now failing most of the time because we are dropping zombies.
> I confirm we are doing this on non-apache build boxes too.
> Left-over zombies consume resources on build boxes (OOME cannot create native
> threads). Having to do multiple test runs in the hope that we can get a
> non-zombie-making build or making (arbitrary) rulings that the zombies are
> 'not related' is a productivity sink. And so on...
> This is an umbrella issue for a zombie stomping session that started earlier
> this week. Will hang sub-issues of this one. Am running builds back-to-back
> on little cluster to turn out the monsters.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)