[
https://issues.apache.org/jira/browse/GEODE-6646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16820333#comment-16820333
]
ASF subversion and git services commented on GEODE-6646:
--------------------------------------------------------
Commit 0e90b5ac2f3ada0e2813d6b518c7c70cf1986b45 in geode's branch
refs/heads/develop from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=0e90b5a ]
GEODE-6646 - CI failure in serverRestarsAfterLocatorReconnects
In this test there is a locator and there are two servers. The
locator and second servers are forced out of the cluster and
auto-reconnect. In the failure the first server becomes membership
coordinator but the other processes fail to join its cluster when
they restart. They are misconfigured to have a max-wait-time-reconnect
that is too short (5 seconds instead of 60) to allow the first server to detect
failures
before the failed server and locator start trying to reconnect.
The fix is in InternalDistributedSystem and ensures that the
waiting period before attempting to reconnect is a sufficient
multiple of the member-timeout setting.
> CI:
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest
> > serverRestartsAfterLocatorReconnects FAILED
> -----------------------------------------------------------------------------------------------------------------------------------------
>
> Key: GEODE-6646
> URL: https://issues.apache.org/jira/browse/GEODE-6646
> Project: Geode
> Issue Type: Bug
> Components: gfsh, membership
> Affects Versions: 1.10.0
> Reporter: Shelley Lynn Hughes-Godfrey
> Assignee: Bruce Schuchardt
> Priority: Major
> Labels: CI
> Time Spent: 40m
> Remaining Estimate: 0h
>
> https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK8/builds/617
> {noformat}
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest
> > serverRestartsAfterLocatorReconnects FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition
> defined as a lambda expression in
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest
>
> Expecting:
> <["locator-0", "server-2", "server-3"]>
> to contain only:
> <["locator-0", "server-1", "server-2", "server-3"]>
> but could not find the following elements:
> <["server-1"]>
> within 300 seconds.
> at
> org.awaitility.core.ConditionAwaiter.await(ConditionAwaiter.java:145)
> at
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:122)
> at
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:32)
> at
> org.awaitility.core.ConditionFactory.until(ConditionFactory.java:902)
> at
> org.awaitility.core.ConditionFactory.untilAsserted(ConditionFactory.java:723)
> at
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.serverRestartsAfterLocatorReconnects(ClusterConfigLocatorRestartDUnitTest.java:81)
> Caused by:
> java.lang.AssertionError:
> Expecting:
> <["locator-0", "server-2", "server-3"]>
> to contain only:
> <["locator-0", "server-1", "server-2", "server-3"]>
> but could not find the following elements:
> <["server-1"]>
> at
> org.apache.geode.test.junit.assertions.CommandResultAssert.tableHasColumnOnlyWithValues(CommandResultAssert.java:308)
> at
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.lambda$serverRestartsAfterLocatorReconnects$0(ClusterConfigLocatorRestartDUnitTest.java:82)
> {noformat}
> Artifacts available here:
> {noformat}
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-= Test Results URI
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-develop-main/1.10.0-SNAPSHOT.0177/test-results/distributedTest/1555101232/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-develop-main/1.10.0-SNAPSHOT.0177/test-artifacts/1555101232/distributedtestfiles-OpenJDK8-1.10.0-SNAPSHOT.0177.tgz
> {noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)