Barrett Oglesby created GEODE-7499:
--------------------------------------

             Summary: CI failure: UpgradeTestOpenJDK8 Build 1299 timed out
                 Key: GEODE-7499
                 URL: https://issues.apache.org/jira/browse/GEODE-7499
             Project: Geode
          Issue Type: Bug
          Components: lucene
            Reporter: Barrett Oglesby


UpgradeTestOpenJDK8 Build 1299 times out executing lucene tests, although I'm 
not sure its a lucene issue.

https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/UpgradeTestOpenJDK8/builds/1299

Test results show 852 tests completed with no tests in 
org.apache.geode.cache.lucene package:

http://files.apachegeode-ci.info/builds/apache-develop-main/1.12.0-SNAPSHOT.0047/test-results/upgradeTest/1574709717/

The geode-lucene/build/upgradeTest/upgradeTest-progress.txt file shows 108 
tests completed successfully including:

20 RollingUpgradeReindexShouldBeSuccessfulWhenAllServersRollToCurrentVersion
20 
RollingUpgradeQueryReturnsCorrectResultsAfterServersRollOverOnPersistentPartitionRegion
20 
RollingUpgradeQueryReturnsCorrectResultAfterTwoLocatorsWithTwoServersAreRolled
20 RollingUpgradeQueryReturnsCorrectResultsAfterClientAndServersAreRolledOver
20 RollingUpgradeQueryReturnsCorrectResultsAfterServersRollOverOnPartitionRegion
8 
RollingUpgradeQueryReturnsCorrectResultsAfterClientAndServersAreRestartedFromCurrentVersion

The builds before and after build 1299 show 980 total tests executed including 
128 tests in org.apache.geode.cache.lucene package.

Build 1298:

https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/UpgradeTestOpenJDK8/builds/1298

http://files.apachegeode-ci.info/builds/apache-develop-main/1.12.0-SNAPSHOT.0046/test-results/upgradeTest/1574707554/

Build 1300:

https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/UpgradeTestOpenJDK8/builds/1300

http://files.apachegeode-ci.info/builds/apache-develop-main/1.12.0-SNAPSHOT.0047/test-results/upgradeTest/1574715336/

In addtition to the 108 tests above, there are:

20 
RollingUpgradeQueryReturnsCorrectResultsAfterClientAndServersAreRolledOverAllBucketsCreated

So, this test didn't run at all in build 1299

All three callstacks for this run were stuck in the same place:

One thread was starting a locator in a member:
{noformat}
"Test worker" #25 prio=5 os_prio=0 tid=0x00007f4c90997000 nid=0x5e runnable 
[0x00007f4c0fffc000]
   java.lang.Thread.State: RUNNABLE
        at java.net.SocketInputStream.socketRead0(Native Method)
        at java.net.SocketInputStream.socketRead(SocketInputStream.java:116)
        at java.net.SocketInputStream.read(SocketInputStream.java:171)
        at java.net.SocketInputStream.read(SocketInputStream.java:141)
        at java.io.BufferedInputStream.fill(BufferedInputStream.java:246)
        at java.io.BufferedInputStream.read(BufferedInputStream.java:265)
        - locked <0x00000000fa0c7c00> (a java.io.BufferedInputStream)
        at java.io.DataInputStream.readByte(DataInputStream.java:265)
        at 
sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:222)
        at sun.rmi.server.UnicastRef.invoke(UnicastRef.java:161)
        at 
java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:227)
        at 
java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:179)
        at com.sun.proxy.$Proxy57.executeMethodOnObject(Unknown Source)
        at 
org.apache.geode.test.dunit.internal.DUnitLauncher.startLocator(DUnitLauncher.java:293)
        at 
org.apache.geode.test.dunit.internal.DUnitLauncher.launch(DUnitLauncher.java:222)
        at 
org.apache.geode.test.dunit.internal.DUnitLauncher.launchIfNeeded(DUnitLauncher.java:160)
        at 
org.apache.geode.test.dunit.internal.JUnit4DistributedTestCase.initializeDistributedTestCase(JUnit4DistributedTestCase.java:103)
{noformat}
That member was stuck here:
{noformat}
"RMI TCP Connection(1)-172.17.0.4" #32 daemon prio=5 os_prio=0 
tid=0x00007f5e74001800 nid=0xcf in Object.wait() [0x00007f5e897c8000]
   java.lang.Thread.State: RUNNABLE
        at 
org.apache.geode.distributed.internal.AbstractDistributionConfig.<clinit>(AbstractDistributionConfig.java:931)
        at 
org.apache.geode.distributed.internal.InternalLocator.<init>(InternalLocator.java:487)
        at 
org.apache.geode.distributed.internal.InternalLocator.createLocator(InternalLocator.java:289)
        - locked <0x00000000f9b6b8a8> (a java.lang.Object)
        at 
org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:367)
        at 
org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:330)
        at org.apache.geode.distributed.Locator.startLocator(Locator.java:252)
        at 
org.apache.geode.distributed.Locator.startLocatorAndDS(Locator.java:139)
        at 
org.apache.geode.test.dunit.internal.DUnitLauncher$1.call(DUnitLauncher.java:311)
{noformat}
So, it looks like that test hung right at the beginning attempting to start a 
locator.




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to