Barrett Oglesby created GEODE-7499:
--------------------------------------
Summary: CI failure: UpgradeTestOpenJDK8 Build 1299 timed out
Key: GEODE-7499
URL: https://issues.apache.org/jira/browse/GEODE-7499
Project: Geode
Issue Type: Bug
Components: lucene
Reporter: Barrett Oglesby
UpgradeTestOpenJDK8 Build 1299 times out executing lucene tests, although I'm
not sure its a lucene issue.
https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/UpgradeTestOpenJDK8/builds/1299
Test results show 852 tests completed with no tests in
org.apache.geode.cache.lucene package:
http://files.apachegeode-ci.info/builds/apache-develop-main/1.12.0-SNAPSHOT.0047/test-results/upgradeTest/1574709717/
The geode-lucene/build/upgradeTest/upgradeTest-progress.txt file shows 108
tests completed successfully including:
20 RollingUpgradeReindexShouldBeSuccessfulWhenAllServersRollToCurrentVersion
20
RollingUpgradeQueryReturnsCorrectResultsAfterServersRollOverOnPersistentPartitionRegion
20
RollingUpgradeQueryReturnsCorrectResultAfterTwoLocatorsWithTwoServersAreRolled
20 RollingUpgradeQueryReturnsCorrectResultsAfterClientAndServersAreRolledOver
20 RollingUpgradeQueryReturnsCorrectResultsAfterServersRollOverOnPartitionRegion
8
RollingUpgradeQueryReturnsCorrectResultsAfterClientAndServersAreRestartedFromCurrentVersion
The builds before and after build 1299 show 980 total tests executed including
128 tests in org.apache.geode.cache.lucene package.
Build 1298:
https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/UpgradeTestOpenJDK8/builds/1298
http://files.apachegeode-ci.info/builds/apache-develop-main/1.12.0-SNAPSHOT.0046/test-results/upgradeTest/1574707554/
Build 1300:
https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/UpgradeTestOpenJDK8/builds/1300
http://files.apachegeode-ci.info/builds/apache-develop-main/1.12.0-SNAPSHOT.0047/test-results/upgradeTest/1574715336/
In addtition to the 108 tests above, there are:
20
RollingUpgradeQueryReturnsCorrectResultsAfterClientAndServersAreRolledOverAllBucketsCreated
So, this test didn't run at all in build 1299
All three callstacks for this run were stuck in the same place:
One thread was starting a locator in a member:
{noformat}
"Test worker" #25 prio=5 os_prio=0 tid=0x00007f4c90997000 nid=0x5e runnable
[0x00007f4c0fffc000]
java.lang.Thread.State: RUNNABLE
at java.net.SocketInputStream.socketRead0(Native Method)
at java.net.SocketInputStream.socketRead(SocketInputStream.java:116)
at java.net.SocketInputStream.read(SocketInputStream.java:171)
at java.net.SocketInputStream.read(SocketInputStream.java:141)
at java.io.BufferedInputStream.fill(BufferedInputStream.java:246)
at java.io.BufferedInputStream.read(BufferedInputStream.java:265)
- locked <0x00000000fa0c7c00> (a java.io.BufferedInputStream)
at java.io.DataInputStream.readByte(DataInputStream.java:265)
at
sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:222)
at sun.rmi.server.UnicastRef.invoke(UnicastRef.java:161)
at
java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:227)
at
java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:179)
at com.sun.proxy.$Proxy57.executeMethodOnObject(Unknown Source)
at
org.apache.geode.test.dunit.internal.DUnitLauncher.startLocator(DUnitLauncher.java:293)
at
org.apache.geode.test.dunit.internal.DUnitLauncher.launch(DUnitLauncher.java:222)
at
org.apache.geode.test.dunit.internal.DUnitLauncher.launchIfNeeded(DUnitLauncher.java:160)
at
org.apache.geode.test.dunit.internal.JUnit4DistributedTestCase.initializeDistributedTestCase(JUnit4DistributedTestCase.java:103)
{noformat}
That member was stuck here:
{noformat}
"RMI TCP Connection(1)-172.17.0.4" #32 daemon prio=5 os_prio=0
tid=0x00007f5e74001800 nid=0xcf in Object.wait() [0x00007f5e897c8000]
java.lang.Thread.State: RUNNABLE
at
org.apache.geode.distributed.internal.AbstractDistributionConfig.<clinit>(AbstractDistributionConfig.java:931)
at
org.apache.geode.distributed.internal.InternalLocator.<init>(InternalLocator.java:487)
at
org.apache.geode.distributed.internal.InternalLocator.createLocator(InternalLocator.java:289)
- locked <0x00000000f9b6b8a8> (a java.lang.Object)
at
org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:367)
at
org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:330)
at org.apache.geode.distributed.Locator.startLocator(Locator.java:252)
at
org.apache.geode.distributed.Locator.startLocatorAndDS(Locator.java:139)
at
org.apache.geode.test.dunit.internal.DUnitLauncher$1.call(DUnitLauncher.java:311)
{noformat}
So, it looks like that test hung right at the beginning attempting to start a
locator.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)