[
https://issues.apache.org/jira/browse/HBASE-14420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14945948#comment-14945948
]
stack commented on HBASE-14420:
-------------------------------
Looking at trunk builds, I see these failures in last twenty builds:
{code}
2 Hanging test : org.apache.hadoop.hbase.TestMovedRegionsCleaner
2 Hanging test : org.apache.hadoop.hbase.TestMultiVersions
2 Hanging test : org.apache.hadoop.hbase.TestPartialResultsFromClientSide
2 Hanging test : org.apache.hadoop.hbase.backup.TestHFileArchiving
10 Hanging test : org.apache.hadoop.hbase.client.TestFromClientSide
8 Hanging test :
org.apache.hadoop.hbase.client.TestFromClientSideWithCoprocessor
2 Hanging test : org.apache.hadoop.hbase.client.TestReplicasClient
2 Hanging test : org.apache.hadoop.hbase.http.TestHttpServer
2 Hanging test :
org.apache.hadoop.hbase.mapred.TestMultiTableSnapshotInputFormat
4 Hanging test : org.apache.hadoop.hbase.mapred.TestTableSnapshotInputFormat
2 Hanging test : org.apache.hadoop.hbase.mapreduce.TestHFileOutputFormat
4 Hanging test : org.apache.hadoop.hbase.mapreduce.TestHFileOutputFormat2
2 Hanging test : org.apache.hadoop.hbase.mapreduce.TestImportExport
2 Hanging test :
org.apache.hadoop.hbase.mapreduce.TestSecureLoadIncrementalHFilesSplitRecovery
2 Hanging test : org.apache.hadoop.hbase.master.TestDistributedLogSplitting
4 Hanging test : org.apache.hadoop.hbase.master.TestSplitLogManager
2 Hanging test : org.apache.hadoop.hbase.master.TestTableLockManager
2 Hanging test : org.apache.hadoop.hbase.master.TestWarmupRegion
2 Hanging test :
org.apache.hadoop.hbase.master.balancer.TestStochasticLoadBalancer
2 Hanging test :
org.apache.hadoop.hbase.master.balancer.TestStochasticLoadBalancer2
4 Hanging test :
org.apache.hadoop.hbase.master.procedure.TestMasterFailoverWithProcedures
1 Hanging test :
org.apache.hadoop.hbase.regionserver.TestDefaultCompactSelection
2 Hanging test : org.apache.hadoop.hbase.regionserver.TestMobStoreScanner
2 Hanging test :
org.apache.hadoop.hbase.replication.regionserver.TestRegionReplicaReplicationEndpoint
2 Hanging test :
org.apache.hadoop.hbase.replication.regionserver.TestReplicationWALReaderManager
2 Hanging test :
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsWithDeletes
2 Hanging test : org.apache.hadoop.hbase.util.TestHBaseFsck
2 Hanging test : org.apache.hadoop.hbase.util.TestMiniClusterLoadEncoded
2 Hanging test : org.apache.hadoop.hbase.util.TestMiniClusterLoadParallel
2 Hanging test : org.apache.hadoop.hbase.util.TestRegionSplitter
2 Hanging test : org.apache.hadoop.hbase.wal.TestWALFiltering
2 Hanging test : org.apache.hadoop.hbase.wal.TestWALSplit
2 Hanging test : org.apache.hadoop.hbase.zookeeper.TestHQuorumPeer
{code}
Here is branch-1.1 builds:
{code}
1 Hanging test : org.apache.hadoop.hbase.TestPartialResultsFromClientSide
1 Hanging test : org.apache.hadoop.hbase.client.TestAdmin1
1 Hanging test : org.apache.hadoop.hbase.client.TestCloneSnapshotFromClient
1 Hanging test :
org.apache.hadoop.hbase.client.TestFromClientSideWithCoprocessor
1 Hanging test : org.apache.hadoop.hbase.client.TestHCM
1 Hanging test : org.apache.hadoop.hbase.client.TestRestoreSnapshotFromClient
1 Hanging test : org.apache.hadoop.hbase.mapred.TestTableSnapshotInputFormat
1 Hanging test : org.apache.hadoop.hbase.quotas.TestQuotaAdmin
1 Hanging test : org.apache.hadoop.hbase.quotas.TestQuotaThrottle
1 Hanging test : org.apache.hadoop.hbase.regionserver.TestJoinedScanners
1 Hanging test : org.apache.hadoop.hbase.regionserver.TestTags
1 Hanging test :
org.apache.hadoop.hbase.regionserver.TestZKLessSplitOnCluster
1 Hanging test : org.apache.hadoop.hbase.regionserver.wal.TestLogRolling
1 Hanging test : org.apache.hadoop.hbase.replication.TestMasterReplication
1 Hanging test :
org.apache.hadoop.hbase.replication.TestReplicationSmallTests
1 Hanging test :
org.apache.hadoop.hbase.replication.regionserver.TestRegionReplicaReplicationEndpoint
1 Hanging test :
org.apache.hadoop.hbase.replication.regionserver.TestReplicationWALReaderManager
1 Hanging test : org.apache.hadoop.hbase.security.access.TestAccessController
1 Hanging test :
org.apache.hadoop.hbase.security.access.TestAccessController2
1 Hanging test : org.apache.hadoop.hbase.security.access.TestTablePermissions
1 Hanging test :
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsReplication
1 Hanging test :
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsWithDefaultVisLabelService
2 Hanging test :
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsWithDeletes
1 Hanging test :
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsWithDistributedLogReplay
{code}
> Zombie Stomping Session
> -----------------------
>
> Key: HBASE-14420
> URL: https://issues.apache.org/jira/browse/HBASE-14420
> Project: HBase
> Issue Type: Umbrella
> Components: test
> Reporter: stack
> Assignee: stack
> Priority: Critical
> Attachments: hangers.txt, none_fix.txt
>
>
> Patch build are now failing most of the time because we are dropping zombies.
> I confirm we are doing this on non-apache build boxes too.
> Left-over zombies consume resources on build boxes (OOME cannot create native
> threads). Having to do multiple test runs in the hope that we can get a
> non-zombie-making build or making (arbitrary) rulings that the zombies are
> 'not related' is a productivity sink. And so on...
> This is an umbrella issue for a zombie stomping session that started earlier
> this week. Will hang sub-issues of this one. Am running builds back-to-back
> on little cluster to turn out the monsters.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)