[ 
https://issues.apache.org/jira/browse/HBASE-14420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14945948#comment-14945948
 ] 

stack commented on HBASE-14420:
-------------------------------

Looking at trunk builds, I see these failures in last twenty builds:

{code}
   2 Hanging test : org.apache.hadoop.hbase.TestMovedRegionsCleaner
   2 Hanging test : org.apache.hadoop.hbase.TestMultiVersions
   2 Hanging test : org.apache.hadoop.hbase.TestPartialResultsFromClientSide
   2 Hanging test : org.apache.hadoop.hbase.backup.TestHFileArchiving
  10 Hanging test : org.apache.hadoop.hbase.client.TestFromClientSide
   8 Hanging test : 
org.apache.hadoop.hbase.client.TestFromClientSideWithCoprocessor
   2 Hanging test : org.apache.hadoop.hbase.client.TestReplicasClient
   2 Hanging test : org.apache.hadoop.hbase.http.TestHttpServer
   2 Hanging test : 
org.apache.hadoop.hbase.mapred.TestMultiTableSnapshotInputFormat
   4 Hanging test : org.apache.hadoop.hbase.mapred.TestTableSnapshotInputFormat
   2 Hanging test : org.apache.hadoop.hbase.mapreduce.TestHFileOutputFormat
   4 Hanging test : org.apache.hadoop.hbase.mapreduce.TestHFileOutputFormat2
   2 Hanging test : org.apache.hadoop.hbase.mapreduce.TestImportExport
   2 Hanging test : 
org.apache.hadoop.hbase.mapreduce.TestSecureLoadIncrementalHFilesSplitRecovery
   2 Hanging test : org.apache.hadoop.hbase.master.TestDistributedLogSplitting
   4 Hanging test : org.apache.hadoop.hbase.master.TestSplitLogManager
   2 Hanging test : org.apache.hadoop.hbase.master.TestTableLockManager
   2 Hanging test : org.apache.hadoop.hbase.master.TestWarmupRegion
   2 Hanging test : 
org.apache.hadoop.hbase.master.balancer.TestStochasticLoadBalancer
   2 Hanging test : 
org.apache.hadoop.hbase.master.balancer.TestStochasticLoadBalancer2
   4 Hanging test : 
org.apache.hadoop.hbase.master.procedure.TestMasterFailoverWithProcedures
   1 Hanging test : 
org.apache.hadoop.hbase.regionserver.TestDefaultCompactSelection
   2 Hanging test : org.apache.hadoop.hbase.regionserver.TestMobStoreScanner
   2 Hanging test : 
org.apache.hadoop.hbase.replication.regionserver.TestRegionReplicaReplicationEndpoint
   2 Hanging test : 
org.apache.hadoop.hbase.replication.regionserver.TestReplicationWALReaderManager
   2 Hanging test : 
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsWithDeletes
   2 Hanging test : org.apache.hadoop.hbase.util.TestHBaseFsck
   2 Hanging test : org.apache.hadoop.hbase.util.TestMiniClusterLoadEncoded
   2 Hanging test : org.apache.hadoop.hbase.util.TestMiniClusterLoadParallel
   2 Hanging test : org.apache.hadoop.hbase.util.TestRegionSplitter
   2 Hanging test : org.apache.hadoop.hbase.wal.TestWALFiltering
   2 Hanging test : org.apache.hadoop.hbase.wal.TestWALSplit
   2 Hanging test : org.apache.hadoop.hbase.zookeeper.TestHQuorumPeer
{code}

Here is branch-1.1 builds:

{code}
   1 Hanging test : org.apache.hadoop.hbase.TestPartialResultsFromClientSide
   1 Hanging test : org.apache.hadoop.hbase.client.TestAdmin1
   1 Hanging test : org.apache.hadoop.hbase.client.TestCloneSnapshotFromClient
   1 Hanging test : 
org.apache.hadoop.hbase.client.TestFromClientSideWithCoprocessor
   1 Hanging test : org.apache.hadoop.hbase.client.TestHCM
   1 Hanging test : org.apache.hadoop.hbase.client.TestRestoreSnapshotFromClient
   1 Hanging test : org.apache.hadoop.hbase.mapred.TestTableSnapshotInputFormat
   1 Hanging test : org.apache.hadoop.hbase.quotas.TestQuotaAdmin
   1 Hanging test : org.apache.hadoop.hbase.quotas.TestQuotaThrottle
   1 Hanging test : org.apache.hadoop.hbase.regionserver.TestJoinedScanners
   1 Hanging test : org.apache.hadoop.hbase.regionserver.TestTags
   1 Hanging test : 
org.apache.hadoop.hbase.regionserver.TestZKLessSplitOnCluster
   1 Hanging test : org.apache.hadoop.hbase.regionserver.wal.TestLogRolling
   1 Hanging test : org.apache.hadoop.hbase.replication.TestMasterReplication
   1 Hanging test : 
org.apache.hadoop.hbase.replication.TestReplicationSmallTests
   1 Hanging test : 
org.apache.hadoop.hbase.replication.regionserver.TestRegionReplicaReplicationEndpoint
   1 Hanging test : 
org.apache.hadoop.hbase.replication.regionserver.TestReplicationWALReaderManager
   1 Hanging test : org.apache.hadoop.hbase.security.access.TestAccessController
   1 Hanging test : 
org.apache.hadoop.hbase.security.access.TestAccessController2
   1 Hanging test : org.apache.hadoop.hbase.security.access.TestTablePermissions
   1 Hanging test : 
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsReplication
   1 Hanging test : 
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsWithDefaultVisLabelService
   2 Hanging test : 
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsWithDeletes
   1 Hanging test : 
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsWithDistributedLogReplay
{code}



> Zombie Stomping Session
> -----------------------
>
>                 Key: HBASE-14420
>                 URL: https://issues.apache.org/jira/browse/HBASE-14420
>             Project: HBase
>          Issue Type: Umbrella
>          Components: test
>            Reporter: stack
>            Assignee: stack
>            Priority: Critical
>         Attachments: hangers.txt, none_fix.txt
>
>
> Patch build are now failing most of the time because we are dropping zombies. 
> I confirm we are doing this on non-apache build boxes too.
> Left-over zombies consume resources on build boxes (OOME cannot create native 
> threads). Having to do multiple test runs in the hope that we can get a 
> non-zombie-making build or making (arbitrary) rulings that the zombies are 
> 'not related' is a productivity sink. And so on...
> This is an umbrella issue for a zombie stomping session that started earlier 
> this week. Will hang sub-issues of this one. Am running builds back-to-back 
> on little cluster to turn out the monsters.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to