[ 
https://issues.apache.org/jira/browse/GEODE-9920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mark Hanson updated GEODE-9920:
-------------------------------
    Description: 
StopLocatorCommandDUnitTest.testWithInvalidMemberID failured with 
AssertionError and RegionReliabilityDistNoAckDUnitTest > testLimitedAccess 
failed with a suspicious string with a failure to respond to heartbeats. They 
are in the same CI run so it seems like this is a port conflict where there is 
overlap between the two tests as one is shutting down and the other is starting 
up.

 

Updated: This is part of the long standing problem with port binding and the 
imperfection in handling default ports in tests. In this case 41000.
{code:java}
org.apache.geode.management.internal.cli.commands.StopLocatorCommandDUnitTest > 
testWithInvalidMemberID FAILED
    java.lang.AssertionError: 
    Expecting:
     <"Member Count : 1
      Name    | Id
    --------- | --------------------------------------------------------------
    locator-0 | 172.17.0.20(locator-0:108:locator)<ec><v0>:41000 [Coordinator]
    ">
    to contain:
     <"locatorToStop"> 
        at 
org.apache.geode.test.junit.assertions.CommandResultAssert.containsOutput(CommandResultAssert.java:87)
        at 
org.apache.geode.management.internal.cli.commands.StopLocatorCommandDUnitTest.testWithInvalidMemberID(StopLocatorCommandDUnitTest.java:240)
{code}
{code:java}
org.apache.geode.cache30.RegionReliabilityDistNoAckDUnitTest > 
testLimitedAccess FAILED
    org.apache.geode.test.dunit.RMIException: While invoking 
org.apache.geode.cache30.RegionReliabilityTestCase$7.run in VM 0 running on 
Host 07d663f91562 with 4 VMs

        Caused by:
        org.apache.geode.distributed.DistributedSystemDisconnectedException: 
This connection to a distributed system has been disconnected., caused by 
org.apache.geode.ForcedDisconnectException: Member isn't responding to 
heartbeat requests

            Caused by:
            org.apache.geode.ForcedDisconnectException: Member isn't responding 
to heartbeat requests

    java.lang.AssertionError: Suspicious strings were written to the log during 
this run.
    Fix the strings or use IgnoredException.addIgnoredException to ignore.
    -----------------------------------------------------------------------
    Found suspect string in log4j at line 1125

    [fatal 2022/01/04 01:04:33.305 GMT <unicast receiver,07d663f91562-47705> 
tid=100] Membership service failure: Member isn't responding to heartbeat 
requests
    
org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
 Member isn't responding to heartbeat requests
      at 
org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2016)
      at 
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1083)
      at 
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:686)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1325)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1264)
      at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
      at org.jgroups.JChannel.up(JChannel.java:741)
      at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
      at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
      at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
      at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
      at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
      at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
      at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
      at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
      at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
      at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
      at org.jgroups.protocols.TP.receive(TP.java:1714)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:160)
      at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
      at java.lang.Thread.run(Thread.java:748)

    -----------------------------------------------------------------------
    Found suspect string in log4j at line 1191

    [error 2022/01/04 01:04:34.715 GMT <RMI TCP Connection(1)-172.17.0.12> 
tid=33] Cache initialization for GemFireCache[id = 1852143676; isClosing = 
false; isShutDownAll = false; created = Tue Jan 04 01:04:20 GMT 2022; server = 
false; copyOnRead = false; lockLease = 120; lockTimeout = 60] failed because: 
org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
connection to a distributed system has been disconnected., caused by 
org.apache.geode.ForcedDisconnectException: Member isn't responding to 
heartbeat requests

    -----------------------------------------------------------------------
    Found suspect string in log4j at line 1195

    [error 2022/01/04 01:04:34.739 GMT <RMI TCP Connection(1)-172.17.0.12> 
tid=33] org.apache.geode.distributed.DistributedSystemDisconnectedException: 
This connection to a distributed system has been disconnected., caused by 
org.apache.geode.ForcedDisconnectException: Member isn't responding to 
heartbeat requests
{code}

  was:
StopLocatorCommandDUnitTest.testWithInvalidMemberID failured with 
AssertionError and RegionReliabilityDistNoAckDUnitTest > testLimitedAccess 
failed with a suspicious string with a failure to respond to heartbeats. They 
are in the same CI run so it seems like this is a port conflict where there is 
overlap between the two tests as one is shutting down and the other is starting 
up.


{code:java}
org.apache.geode.management.internal.cli.commands.StopLocatorCommandDUnitTest > 
testWithInvalidMemberID FAILED
    java.lang.AssertionError: 
    Expecting:
     <"Member Count : 1
      Name    | Id
    --------- | --------------------------------------------------------------
    locator-0 | 172.17.0.20(locator-0:108:locator)<ec><v0>:41000 [Coordinator]
    ">
    to contain:
     <"locatorToStop"> 
        at 
org.apache.geode.test.junit.assertions.CommandResultAssert.containsOutput(CommandResultAssert.java:87)
        at 
org.apache.geode.management.internal.cli.commands.StopLocatorCommandDUnitTest.testWithInvalidMemberID(StopLocatorCommandDUnitTest.java:240)
{code}

{code:java}
org.apache.geode.cache30.RegionReliabilityDistNoAckDUnitTest > 
testLimitedAccess FAILED
    org.apache.geode.test.dunit.RMIException: While invoking 
org.apache.geode.cache30.RegionReliabilityTestCase$7.run in VM 0 running on 
Host 07d663f91562 with 4 VMs

        Caused by:
        org.apache.geode.distributed.DistributedSystemDisconnectedException: 
This connection to a distributed system has been disconnected., caused by 
org.apache.geode.ForcedDisconnectException: Member isn't responding to 
heartbeat requests

            Caused by:
            org.apache.geode.ForcedDisconnectException: Member isn't responding 
to heartbeat requests

    java.lang.AssertionError: Suspicious strings were written to the log during 
this run.
    Fix the strings or use IgnoredException.addIgnoredException to ignore.
    -----------------------------------------------------------------------
    Found suspect string in log4j at line 1125

    [fatal 2022/01/04 01:04:33.305 GMT <unicast receiver,07d663f91562-47705> 
tid=100] Membership service failure: Member isn't responding to heartbeat 
requests
    
org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
 Member isn't responding to heartbeat requests
      at 
org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2016)
      at 
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1083)
      at 
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:686)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1325)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1264)
      at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
      at org.jgroups.JChannel.up(JChannel.java:741)
      at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
      at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
      at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
      at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
      at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
      at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
      at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
      at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
      at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
      at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
      at org.jgroups.protocols.TP.receive(TP.java:1714)
      at 
org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:160)
      at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
      at java.lang.Thread.run(Thread.java:748)

    -----------------------------------------------------------------------
    Found suspect string in log4j at line 1191

    [error 2022/01/04 01:04:34.715 GMT <RMI TCP Connection(1)-172.17.0.12> 
tid=33] Cache initialization for GemFireCache[id = 1852143676; isClosing = 
false; isShutDownAll = false; created = Tue Jan 04 01:04:20 GMT 2022; server = 
false; copyOnRead = false; lockLease = 120; lockTimeout = 60] failed because: 
org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
connection to a distributed system has been disconnected., caused by 
org.apache.geode.ForcedDisconnectException: Member isn't responding to 
heartbeat requests

    -----------------------------------------------------------------------
    Found suspect string in log4j at line 1195

    [error 2022/01/04 01:04:34.739 GMT <RMI TCP Connection(1)-172.17.0.12> 
tid=33] org.apache.geode.distributed.DistributedSystemDisconnectedException: 
This connection to a distributed system has been disconnected., caused by 
org.apache.geode.ForcedDisconnectException: Member isn't responding to 
heartbeat requests
{code}



> CI Failure: StopLocatorCommandDUnitTest > testWithInvalidMemberID and 
> RegionReliabilityDistNoAckDUnitTest > testLimitedAccess failed with port 
> conflict
> -------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: GEODE-9920
>                 URL: https://issues.apache.org/jira/browse/GEODE-9920
>             Project: Geode
>          Issue Type: Bug
>          Components: tests
>    Affects Versions: 1.12.8
>            Reporter: Hale Bales
>            Assignee: Mark Hanson
>            Priority: Major
>              Labels: CI, needsTriage
>
> StopLocatorCommandDUnitTest.testWithInvalidMemberID failured with 
> AssertionError and RegionReliabilityDistNoAckDUnitTest > testLimitedAccess 
> failed with a suspicious string with a failure to respond to heartbeats. They 
> are in the same CI run so it seems like this is a port conflict where there 
> is overlap between the two tests as one is shutting down and the other is 
> starting up.
>  
> Updated: This is part of the long standing problem with port binding and the 
> imperfection in handling default ports in tests. In this case 41000.
> {code:java}
> org.apache.geode.management.internal.cli.commands.StopLocatorCommandDUnitTest 
> > testWithInvalidMemberID FAILED
>     java.lang.AssertionError: 
>     Expecting:
>      <"Member Count : 1
>       Name    | Id
>     --------- | --------------------------------------------------------------
>     locator-0 | 172.17.0.20(locator-0:108:locator)<ec><v0>:41000 [Coordinator]
>     ">
>     to contain:
>      <"locatorToStop"> 
>         at 
> org.apache.geode.test.junit.assertions.CommandResultAssert.containsOutput(CommandResultAssert.java:87)
>         at 
> org.apache.geode.management.internal.cli.commands.StopLocatorCommandDUnitTest.testWithInvalidMemberID(StopLocatorCommandDUnitTest.java:240)
> {code}
> {code:java}
> org.apache.geode.cache30.RegionReliabilityDistNoAckDUnitTest > 
> testLimitedAccess FAILED
>     org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.cache30.RegionReliabilityTestCase$7.run in VM 0 running on 
> Host 07d663f91562 with 4 VMs
>         Caused by:
>         org.apache.geode.distributed.DistributedSystemDisconnectedException: 
> This connection to a distributed system has been disconnected., caused by 
> org.apache.geode.ForcedDisconnectException: Member isn't responding to 
> heartbeat requests
>             Caused by:
>             org.apache.geode.ForcedDisconnectException: Member isn't 
> responding to heartbeat requests
>     java.lang.AssertionError: Suspicious strings were written to the log 
> during this run.
>     Fix the strings or use IgnoredException.addIgnoredException to ignore.
>     -----------------------------------------------------------------------
>     Found suspect string in log4j at line 1125
>     [fatal 2022/01/04 01:04:33.305 GMT <unicast receiver,07d663f91562-47705> 
> tid=100] Membership service failure: Member isn't responding to heartbeat 
> requests
>     
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Member isn't responding to heartbeat requests
>       at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2016)
>       at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1083)
>       at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:686)
>       at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1325)
>       at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1264)
>       at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>       at org.jgroups.JChannel.up(JChannel.java:741)
>       at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>       at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>       at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>       at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>       at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>       at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>       at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>       at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>       at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>       at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>       at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>       at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>       at org.jgroups.protocols.TP.receive(TP.java:1714)
>       at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:160)
>       at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>       at java.lang.Thread.run(Thread.java:748)
>     -----------------------------------------------------------------------
>     Found suspect string in log4j at line 1191
>     [error 2022/01/04 01:04:34.715 GMT <RMI TCP Connection(1)-172.17.0.12> 
> tid=33] Cache initialization for GemFireCache[id = 1852143676; isClosing = 
> false; isShutDownAll = false; created = Tue Jan 04 01:04:20 GMT 2022; server 
> = false; copyOnRead = false; lockLease = 120; lockTimeout = 60] failed 
> because: org.apache.geode.distributed.DistributedSystemDisconnectedException: 
> This connection to a distributed system has been disconnected., caused by 
> org.apache.geode.ForcedDisconnectException: Member isn't responding to 
> heartbeat requests
>     -----------------------------------------------------------------------
>     Found suspect string in log4j at line 1195
>     [error 2022/01/04 01:04:34.739 GMT <RMI TCP Connection(1)-172.17.0.12> 
> tid=33] org.apache.geode.distributed.DistributedSystemDisconnectedException: 
> This connection to a distributed system has been disconnected., caused by 
> org.apache.geode.ForcedDisconnectException: Member isn't responding to 
> heartbeat requests
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to