[
https://issues.apache.org/jira/browse/HBASE-14678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979813#comment-14979813
]
stack commented on HBASE-14678:
-------------------------------
Let me find it [~chenheng]
1.2 builds just failed with this... its the surefire-killed issue:
$ python ./dev-support/findHangingTests.py
https://builds.apache.org/view/H-L/view/HBase/job/HBase-1.2/318/jdk=latest1.7,label=Hadoop/consoleText
Hanging test :
org.apache.hadoop.hbase.replication.regionserver.TestRegionReplicaReplicationEndpoint
Hanging test :
org.apache.hadoop.hbase.replication.regionserver.TestRegionReplicaReplicationEndpointNoMaster
Hanging test :
org.apache.hadoop.hbase.replication.regionserver.TestReplicationWALReaderManager
Hanging test : org.apache.hadoop.hbase.wal.TestWALSplit
.... and this:
kalashnikov:hbase.git.commit2 stack$ python ./dev-support/findHangingTests.py
https://builds.apache.org/view/H-L/view/HBase/job/HBase-1.2/319/jdk=latest1.8,label=Hadoop/consoleText
Fetching
https://builds.apache.org/view/H-L/view/HBase/job/HBase-1.2/319/jdk=latest1.8,label=Hadoop/consoleText
Building remotely on H9 (Mapreduce Falcon Hadoop Pig Zookeeper Tez Hdfs) in
workspace
/home/jenkins/jenkins-slave/workspace/HBase-1.2/jdk/latest1.8/label/Hadoop
Printing hanging tests
Hanging test : org.apache.hadoop.hbase.wal.TestWALSplitCompressed
Hanging test :
org.apache.hadoop.hbase.replication.multiwal.TestReplicationSyncUpToolWithMultipleWAL
Hanging test : org.apache.hadoop.hbase.wal.TestWALSplit
Printing Failing tests
Failing test :
org.apache.hadoop.hbase.security.visibility.TestVisibilityLabelsWithACL
Let me disable TestWALSplit.
> Experiment: Temporarily disable balancer and a few others to see if root of
> crashed/timedout JVMs
> -------------------------------------------------------------------------------------------------
>
> Key: HBASE-14678
> URL: https://issues.apache.org/jira/browse/HBASE-14678
> Project: HBase
> Issue Type: Sub-task
> Components: test
> Reporter: stack
>
> Looking at recent builds of 1.2, I see a few of the runs finishing with kills
> and notice that a JVM exited without reporting back state. Running the
> hanging test finder, I can see at least that in one case that the balancer
> tests seem to be outstanding; looking in test output, seems to be still going
> on.... A few others are reported as hung but they look like they have just
> started running and are just killed by surefire.
> This issue is about trying to disable a few of the problematics like balancer
> tests to see if our overall stability improves. If so, I can concentrate on
> stabilizing these few tests. Else will just undo the experiment and put the
> tests back on line.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)