[
https://issues.apache.org/jira/browse/HBASE-13266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14365810#comment-14365810
]
stack commented on HBASE-13266:
-------------------------------
Yeah. We've been seeing this. What would you recommend Mr. Regex?
Here is what we have up in jenkins to run post build:
{code}
ZOMBIE_TESTS_COUNT=`jps | grep surefirebooter | wc -l`
if [[ $ZOMBIE_TESTS_COUNT != 0 ]] ; then
#It seems sometimes the tests are not dying immediately. Let's give them 10s
echo "Suspicious java process found - waiting 10s to see if there are just
slow to stop"
sleep 10
ZOMBIE_TESTS_COUNT=`jps | grep surefirebooter | wc -l`
if [[ $ZOMBIE_TESTS_COUNT != 0 ]] ; then
echo "There are $ZOMBIE_TESTS_COUNT zombie tests, they should have been
killed by surefire but survived"
echo "************ BEGIN zombies jstack extract"
ZB_STACK=`jps | grep surefirebooter | cut -d ' ' -f 1 | xargs -n 1 jstack
| grep ".test" | grep "\.java"`
jps | grep surefirebooter | cut -d ' ' -f 1 | xargs -n 1 jstack
echo "************ END zombies jstack extract"
JIRA_COMMENT="$JIRA_COMMENT
{color:red}-1 core zombie tests{color}. There are ${ZOMBIE_TESTS_COUNT}
zombie test(s): ${ZB_STACK}"
BAD=1
jps | grep surefirebooter | cut -d ' ' -f 1 | xargs kill -9
else
echo "We're ok: there is no zombie test, but some tests took some time to
stop"
fi
else
echo "We're ok: there is no zombie test"
fi
{code}
... add a '-ei' and search for hbase?
> test-patch.sh can return false positives for zombie tests from tests running
> on the same host
> ---------------------------------------------------------------------------------------------
>
> Key: HBASE-13266
> URL: https://issues.apache.org/jira/browse/HBASE-13266
> Project: HBase
> Issue Type: Bug
> Reporter: Esteban Gutierrez
>
> Just saw this here
> https://builds.apache.org/job/PreCommit-HBASE-Build/13271//consoleFull
> {code}
> [INFO] BUILD SUCCESS
> [INFO]
> ------------------------------------------------------------------------
> [INFO] Total time: 01:27 h
> [INFO] Finished at: 2015-03-16T23:58:30+00:00
> [INFO] Final Memory: 93M/844M
> [INFO]
> ------------------------------------------------------------------------
> Suspicious java process found - waiting 30s to see if there are just slow to
> stop
> There are 1 zombie tests, they should have been killed by surefire but
> survived
> ************ BEGIN zombies jstack extract
> 2015-03-16 23:59:03
> Full thread dump Java HotSpot(TM) Server VM (23.25-b01 mixed mode):
> "Attach Listener" daemon prio=10 tid=0xaa400800 nid=0x17cc waiting on
> condition [0x00000000]
> java.lang.Thread.State: RUNNABLE
> "IPC Client (47) connection to 0.0.0.0/0.0.0.0:4324 from jenkins" daemon
> prio=10 tid=0xa8d03400 nid=0x1759 in Object.wait() [0xa9c7d000]
> java.lang.Thread.State: TIMED_WAITING (on object monitor)
> at java.lang.Object.wait(Native Method)
> - waiting on <0xde1987c8> (a org.apache.hama.ipc.Client$Connection)
> at org.apache.hama.ipc.Client$Connection.waitForWork(Client.java:533)
> - locked <0xde1987c8> (a org.apache.hama.ipc.Client$Connection)
> at org.apache.hama.ipc.Client$Connection.run(Client.java:577)
> ...
> java.lang.Thread.State: TIMED_WAITING (sleeping)
> at java.lang.Thread.sleep(Native Method)
> at
> org.apache.hama.bsp.TestBSPTaskFaults.tearDown(TestBSPTaskFaults.java:618)
> at junit.framework.TestCase.runBare(TestCase.java:140)
> at junit.framework.TestResult$1.protect(TestResult.java:110)
> at junit.framework.TestResult.runProtected(TestResult.java:128)
> at junit.framework.TestResult.run(TestResult.java:113)
> at junit.framework.TestCase.run(TestCase.java:124)
> at junit.framework.TestSuite.runTest(TestSuite.java:232)
> at junit.framework.TestSuite.run(TestSuite.java:227)
> {code}
> Which is getting a jstack from a test from Hama.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)