[ https://issues.apache.org/jira/browse/GEODE-5429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mark Hanson resolved GEODE-5429. -------------------------------- Resolution: Cannot Reproduce This is a hardware issue where the tests were being run on a heavily loaded machine. {noformat} [vm2] [warn 2018/07/10 05:32:50.614 UTC <Thread-308 StatSampler> tid=0x38e] Statistics sampling thread detected a wakeup delay of 36,307 ms, indicating a possible resource issue. Check the GC, memory, and CPU statistics. [vm2] [info 2018/07/10 05:32:50.616 UTC <unicast receiver,4bc9c1f29425-44222> tid=0x37a] No longer suspecting 172.17.0.4(57:locator)<ec><v0>:32769 [vm2] [info 2018/07/10 05:32:50.618 UTC <unicast receiver,4bc9c1f29425-44222> tid=0x37a] received suspect message from 172.17.0.4(57:locator)<ec><v0>:32769 for 172.17.0.4(158)<v58>:32771: Member isn't responding to heartbeat requests [vm2] [info 2018/07/10 05:32:50.977 UTC <P2P message reader for 172.17.0.4(57:locator)<ec><v0>:32769 shared unordered uid=31 port=59452> tid=0x389] Performing final check for suspect member 172.17.0.4(57:locator)<ec><v0>:32769 reason=member unexpectedly shut down shared, unordered connection [vm2] [info 2018/07/10 05:32:51.024 UTC <P2P message reader for 172.17.0.4(57:locator)<ec><v0>:32769 shared unordered uid=31 port=59452> tid=0x389] Final check passed for suspect member 172.17.0.4(57:locator)<ec><v0>:32769 [vm2] [info 2018/07/10 05:32:51.094 UTC <unicast receiver,4bc9c1f29425-44222> tid=0x37a] received suspect message from 172.17.0.4(57:locator)<ec><v0>:32769 for 172.17.0.4(158)<v58>:32771: Member isn't responding to heartbeat requests [vm2] [info 2018/07/10 05:32:51.108 UTC <unicast receiver,4bc9c1f29425-44222> tid=0x37a] received suspect message from 172.17.0.4(57:locator)<ec><v0>:32769 for 172.17.0.4(153)<v57>:32770: Member isn't responding to heartbeat requests [vm2] [info 2018/07/10 05:32:51.116 UTC <unicast receiver,4bc9c1f29425-44222> tid=0x37a] received suspect message from 172.17.0.4(57:locator)<ec><v0>:32769 for 172.17.0.4(153)<v57>:32770: Member isn't responding to heartbeat requests [vm2] [info 2018/07/10 05:32:51.116 UTC <unicast receiver,4bc9c1f29425-44222> tid=0x37a] Membership received a request to remove 172.17.0.4(165)<v59>:32772 from 172.17.0.4(57:locator)<ec><v0>:32769 reason=Failed to acknowledge a new membership view and then failed tcp/ip connection attempt [vm2] [fatal 2018/07/10 05:32:51.117 UTC <unicast receiver,4bc9c1f29425-44222> tid=0x37a] Membership service failure: Failed to acknowledge a new membership view and then failed tcp/ip connection attempt [vm2] org.apache.geode.ForcedDisconnectException: Failed to acknowledge a new membership view and then failed tcp/ip connection attempt [vm2] at org.apache.geode.distributed.internal.membership.gms.mgr.GMSMembershipManager.forceDisconnect(GMSMembershipManager.java:2543) [vm2] at org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1044) [vm2] at org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processRemoveRequest(GMSJoinLeave.java:657) [vm2] at org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1790) [vm2] at org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1283) [vm2] at org.jgroups.JChannel.invokeCallback(JChannel.java:816) [vm2] at org.jgroups.JChannel.up(JChannel.java:741) [vm2] at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030) [vm2] at org.jgroups.protocols.FRAG2.up(FRAG2.java:165) [vm2] at org.jgroups.protocols.FlowControl.up(FlowControl.java:390) [vm2] at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077) [vm2] at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792) [vm2] at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433) [vm2] at org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:73) [vm2] at org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:72) [vm2] at org.jgroups.protocols.TP.passMessageUp(TP.java:1658) [vm2] at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876) [vm2] at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10) [vm2] at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789) [vm2] at org.jgroups.protocols.TP.receive(TP.java:1714) [vm2] at org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:152) [vm2] at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701) [vm2] at java.lang.Thread.run(Thread.java:748) {noformat} > SUPERFLAKY: Multiple failures from RestAPIsWithSSLDUnitTest > ----------------------------------------------------------- > > Key: GEODE-5429 > URL: https://issues.apache.org/jira/browse/GEODE-5429 > Project: Geode > Issue Type: Task > Reporter: Dan Smith > Assignee: Mark Hanson > Priority: Major > Labels: pull-request-available, swat > Time Spent: 20m > Remaining Estimate: 0h > > Several of the test in this class failed multiple times in a pass over 200 > runs of DistributedTest. > {noformat} > 16 failures > org.apache.geode.rest.internal.web.controllers.RestAPIsWithSSLDUnitTest.testSimpleSSLWithMultiKey_KeyStore_WithInvalidClientKey > (94.02985074626866% success rate) > 7 failures > org.apache.geode.rest.internal.web.controllers.RestAPIsWithSSLDUnitTest.testSSLWithTLSv12ProtocolLegacy > (97.38805970149254% success rate) > 5 failures > org.apache.geode.rest.internal.web.controllers.RestAPIsWithSSLDUnitTest.testSSLWithoutKeyStoreTypeLegacy > (98.13432835820896% success rate) > 2 failures > org.apache.geode.rest.internal.web.controllers.RestAPIsWithSSLDUnitTest.testWithMultipleProtocol > (99.25373134328358% success rate) > 2 failures > org.apache.geode.rest.internal.web.controllers.RestAPIsWithSSLDUnitTest.testSSLWithTLSv11Protocol > (99.25373134328358% success rate) > 2 failures > org.apache.geode.rest.internal.web.controllers.RestAPIsWithSSLDUnitTest.testSSLWithMultipleCipherSuite > (99.25373134328358% success rate) > 1 failures > org.apache.geode.rest.internal.web.controllers.RestAPIsWithSSLDUnitTest.testWithMultipleProtocolLegacy > (99.6268656716418% success rate) > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)