Hello Team,
I faced a strange issue today where suspected member was forcefully
disconnected from distributed system due to lack of heartbeats.
But later on after so many attempts,
Ø Suspected member completely disconnected [DistributionManager closed]
Ø Attempted to reconnect to distributed system by starting membership
services, Jgroup channel, Gemfire P2P listener
Ø Failed due to following exception
org.apache.geode.security.AuthenticationRequiredException: Failed to find
credentials from [host001(event-server-1:3525)<ec>:1026]]
This looks very strange or may be issue. It was part of distributed system
using username/password and security manager from beginning. But after force
disconnection, when it attempted to join distributed system back, it was unable
to find credentials itself.
Could someone help me to validate it?
More,
Even though server was no more part of distributed system after all these
events, spring boot geode app was still running.
Shouldn't that also be stopped? I had to manually kill that for rectifying this.
I have attached detailed logs as well.
Topology
Version: Geode 1.1.1
Java: JDK 8
Platform: Red Hat Enterprise Linux
Host1:
Locator1
EventServer1 [Group = Events]
HopServer1 [Group = Hops]
Host2:
Locatro2
EventServer2 [Group = Events]
HopServer2 [Group = Hops]
Thanks,
Dharam
This message is confidential and subject to terms at:
http://www.jpmorgan.com/emaildisclaimer including on confidentiality, legal
privilege, viruses and monitoring of electronic messages. If you are not the
intended recipient, please delete this message and notify the sender
immediately. Any unauthorized use is strictly prohibited.
[info 2018/04/10 02:39:19.324 EDT event-server-1 <unicast
receiver,host001-11901> tid=0x33] received suspect message from
host002(event-server-2:1487)<ec><v31>:1026 for host001(Locator1:1134
:locator)<ec><v16>:1024: Member isn't responding to heartbeat requests
[warning 2018/04/10 02:39:20.688 EDT event-server-1 <Management Task> tid=0x58]
Attempting TCP/IP reconnect to host002(Locator2:3605:locator)<ec><v29>:1024
[warning 2018/04/10 02:39:20.707 EDT event-server-1 <P2P message reader for
host002(event-server-2:1487)<ec><v31>:1026 shared ordered uid=8 port=43352>
tid=0x189] Attempting TCP/IP reconnect to
host002(event-server-2:1487)<ec><v31>:1026
[warning 2018/04/10 02:39:33.778 EDT event-server-1 <Cache Server Load Polling
Thread> tid=0xf5] Attempting TCP/IP reconnect to
host002(Locator2:3605:locator)<ec><v29>:1024
[warning 2018/04/10 02:39:33.980 EDT event-server-1 <Queue Removal Thread>
tid=0x102] Attempting TCP/IP reconnect to
host002(event-server-2:1487)<ec><v31>:1026
[warning 2018/04/10 02:39:35.790 EDT event-server-1 <ServerConnection on port
40404 Thread 10> tid=0x124] 15 seconds have elapsed while waiting for replies:
<DistributedCacheOperation$CacheOperationReplyProcessor 513164 waiting for 1
replies from [host002(event-server-2:1487)<ec><v31>:1026]> on
host001(event-server-1:3525)<ec><v18>:1026 whose current membership list is:
[[host001(event-server-1:3525)<ec><v18>:1026,
host001(Locator1:1134:locator)<ec><v16>:1024,
host002(event-server-2:1487)<ec><v31>:1026,
host002(hp-server-2:1418)<ec><v30>:1025,
host002(Locator2:3605:locator)<ec><v29>:1024]]
[info 2018/04/10 02:39:36.837 EDT event-server-1 <P2P message reader for
host002(Locator2:3605:locator)<ec><v29>:1024 shared unordered uid=1 port=42338>
tid=0x17c] Performing final check for suspect member
host002(Locator2:3605:locator)<ec><v29>:1024 reason=member unexpectedly shut
down shared, unordered connection
[info 2018/04/10 02:39:36.851 EDT event-server-1 <unicast
receiver,host001-11901> tid=0x33] Membership received a request to remove
host001(Locator1:1134:locator)<ec><v16>:1024 from
host002(Locator2:3605:locator)<ec><v29>:1024 reason=Member isn't responding to
heartbeat requests
[info 2018/04/10 02:39:36.852 EDT event-server-1 <ServerConnection on port
40404 Thread 2> tid=0x103] Server connection from
[identity(host002(1634:loner):34942:6531c5a4,connection=1; port=44006]:
connection disconnect detected by EOF.
[info 2018/04/10 02:39:37.365 EDT event-server-1 <P2P message reader for
host002(hp-server-2:1418)<ec><v30>:1025 shared unordered uid=1 port=43112>
tid=0x181] Performing final check for suspect member
host002(hp-server-2:1418)<ec><v30>:1025 reason=member unexpectedly shut down
shared, unordered connection
[info 2018/04/10 02:39:37.301 EDT event-server-1 <unicast
receiver,host001-11901> tid=0x33] This member is becoming the membership
coordinator with address host001(event-server-1:3525)<ec><v18>:1026
[info 2018/04/10 02:39:37.281 EDT event-server-1 <Cache Server Load Polling
Thread> tid=0xf5] Successfully reconnected to member
host002(Locator2:3605:locator)<ec><v29>:1024
[info 2018/04/10 02:39:37.276 EDT event-server-1 <Management Task> tid=0x58]
Successfully reconnected to member host002(Locator2:3605:locator)<ec><v29>:1024
[info 2018/04/10 02:39:37.772 EDT event-server-1 <P2P message reader for
host002(event-server-2:1487)<ec><v31>:1026 shared ordered uid=8 port=43352>
tid=0x189] Successfully reconnected to member
host002(event-server-2:1487)<ec><v31>:1026
[info 2018/04/10 02:39:37.820 EDT event-server-1 <P2P message reader for
host002(event-server-2:1487)<ec><v31>:1026 shared unordered uid=1 port=43268>
tid=0x186] Performing final check for suspect member
host002(event-server-2:1487)<ec><v31>:1026 reason=member unexpectedly shut down
shared, unordered connection
[info 2018/04/10 02:39:37.821 EDT event-server-1 <Geode Failure Detection
thread 19> tid=0x207] Performing final check for suspect member
host001(Locator1:1134:locator)<ec><v16>:1024 reason=Member isn't responding to
heartbeat requests
[warning 2018/04/10 02:39:37.930 EDT event-server-1 <ServerConnection on port
40404 Thread 2> tid=0x103] ClientHealthMonitor: Unregistering client with
member id identity(host002(1634:loner):34942:6531c5a4,connection=1 due to: The
connection has been reset while reading the header
[info 2018/04/10 02:39:37.930 EDT event-server-1 <Queue Removal Thread>
tid=0x102] Successfully reconnected to member
host002(event-server-2:1487)<ec><v31>:1026
[warning 2018/04/10 02:39:38.039 EDT event-server-1 <Queue Removal Thread>
tid=0x102] Attempting TCP/IP reconnect to
host002(hp-server-2:1418)<ec><v30>:1025
[info 2018/04/10 02:39:38.725 EDT event-server-1 <Queue Removal Thread>
tid=0x102] Successfully reconnected to member
host002(hp-server-2:1418)<ec><v30>:1025
[info 2018/04/10 02:39:38.769 EDT event-server-1 <Geode Failure Detection
thread 16> tid=0x1fa] received suspect message from
host001(event-server-1:3525)<ec><v18>:1026 for
host002(event-server-2:1487)<ec><v31>:1026: Member isn't responding to
heartbeat requests
[info 2018/04/10 02:39:38.830 EDT event-server-1 <ServerConnection on port
40404 Thread 6> tid=0x10c] Server connection from
[identity(host001(8115:loner):50832:190583a4,connection=1; port=47752]:
connection disconnect detected by EOF.
[warning 2018/04/10 02:39:38.831 EDT event-server-1 <ServerConnection on port
40404 Thread 6> tid=0x10c] ClientHealthMonitor: Unregistering client with
member id identity(host001(8115:loner):50832:190583a4,connection=1 due to: The
connection has been reset while reading the header
[warning 2018/04/10 02:39:39.215 EDT event-server-1 <Management Task> tid=0x58]
Attempting TCP/IP reconnect to host002(Locator2:3605:locator)<ec><v29>:1024
[info 2018/04/10 02:39:39.265 EDT event-server-1 <P2P message reader for
host002(event-server-2:1487)<ec><v31>:1026 shared unordered uid=1 port=43268>
tid=0x186] Final check passed for suspect member
host002(event-server-2:1487)<ec><v31>:1026
[info 2018/04/10 02:39:39.269 EDT event-server-1 <P2P message reader for
host002(Locator2:3605:locator)<ec><v29>:1024 shared unordered uid=1 port=42338>
tid=0x17c] Final check passed for suspect member
host002(Locator2:3605:locator)<ec><v29>:1024
[info 2018/04/10 02:39:39.265 EDT event-server-1 <P2P message reader for
host002(hp-server-2:1418)<ec><v30>:1025 shared unordered uid=1 port=43112>
tid=0x181] Final check passed for suspect member
host002(hp-server-2:1418)<ec><v30>:1025
[warning 2018/04/10 02:39:39.469 EDT event-server-1 <Cache Server Load Polling
Thread> tid=0xf5] Attempting TCP/IP reconnect to
host002(Locator2:3605:locator)<ec><v29>:1024
[warning 2018/04/10 02:39:39.484 EDT event-server-1 <Queue Removal Thread>
tid=0x102] Attempting TCP/IP reconnect to
host002(event-server-2:1487)<ec><v31>:1026
[info 2018/04/10 02:39:39.484 EDT event-server-1 <Geode Failure Detection
thread 19> tid=0x207] Final check passed for suspect member
host001(Locator1:1134:locator)<ec><v16>:1024
[warning 2018/04/10 02:39:43.118 EDT event-server-1 <ServerConnection on port
40404 Thread 6> tid=0x10c] Attempting TCP/IP reconnect to
host002(event-server-2:1487)<ec><v31>:1026
[severe 2018/04/10 02:39:43.301 EDT event-server-1 <unicast
receiver,host001-11901> tid=0x33] This member is no longer in the membership
view. My ID is host001(event-server-1:3525)<ec><v18>:1026 and the new view is
View[host002(Locator2:3605:locator)<ec><v29>:1024|38] members:
[host002(Locator2:3605:locator)<ec><v29>:1024,
host002(hp-server-2:1418)<ec><v30>:1025{lead},
host002(event-server-2:1487)<ec><v31>:1026] crashed:
[host001(Locator1:1134:locator)<ec><v16>:1024,
host001(event-server-1:3525)<ec><v18>:1026]
[info 2018/04/10 02:39:43.325 EDT event-server-1 <Geode Membership View
Creator> tid=0x20d] View Creator thread is starting
[warning 2018/04/10 02:39:43.434 EDT event-server-1 <Management Task> tid=0x58]
Attempting TCP/IP reconnect to host002(Locator2:3605:locator)<ec><v29>:1024
[info 2018/04/10 02:39:43.544 EDT event-server-1 <ServerConnection on port
40404 Thread 6> tid=0x10c] Successfully reconnected to member
host002(event-server-2:1487)<ec><v31>:1026
[info 2018/04/10 02:39:43.614 EDT event-server-1 <Geode Membership View
Creator> tid=0x20d] preparing new view
View[host001(event-server-1:3525)<ec><v18>:1026|46] members:
[host001(event-server-1:3525)<ec><v18>:1026{lead},
host002(Locator2:3605:locator)<ec><v29>:1024,
host002(hp-server-2:1418)<ec><v30>:1025,
host002(event-server-2:1487)<ec><v31>:1026] crashed:
[host001(Locator1:1134:locator)<ec><v16>:1024]
failure detection ports: 12988 44464 27210 21079
[warning 2018/04/10 02:39:43.658 EDT event-server-1 <ServerConnection on port
40404 Thread 6> tid=0x10c] Attempting TCP/IP reconnect to
host002(event-server-2:1487)<ec><v31>:1026
[info 2018/04/10 02:39:43.841 EDT event-server-1 <ServerConnection on port
40404 Thread 6> tid=0x10c] Successfully reconnected to member
host002(event-server-2:1487)<ec><v31>:1026
[info 2018/04/10 02:39:43.993 EDT event-server-1 <Management Task> tid=0x58]
Successfully reconnected to member host002(Locator2:3605:locator)<ec><v29>:1024
[warning 2018/04/10 02:39:43.992 EDT event-server-1 <ServerConnection on port
40404 Thread 7> tid=0x10d] Attempting TCP/IP reconnect to
host002(event-server-2:1487)<ec><v31>:1026
[info 2018/04/10 02:39:44.016 EDT event-server-1 <ServerConnection on port
40404 Thread 7> tid=0x10d] Ending reconnect attempt to
host002(event-server-2:1487)<ec><v31>:1026 because shutdown has started.
[warning 2018/04/10 02:39:44.089 EDT event-server-1 <ServerConnection on port
40404 Thread 2> tid=0x103] Attempting TCP/IP reconnect to
host002(event-server-2:1487)<ec><v31>:1026
[info 2018/04/10 02:39:44.089 EDT event-server-1 <ServerConnection on port
40404 Thread 2> tid=0x103] Ending reconnect attempt to
host002(event-server-2:1487)<ec><v31>:1026 because shutdown has started.
[info 2018/04/10 02:39:44.199 EDT event-server-1 <Queue Removal Thread>
tid=0x102] The QueueRemovalThread is done.
[severe 2018/04/10 02:39:44.379 EDT event-server-1 <unicast
receiver,host001-11901> tid=0x33] Membership service failure: This node is no
longer in the membership view
org.apache.geode.ForcedDisconnectException: This node is no longer in the
membership view
at
org.apache.geode.distributed.internal.membership.gms.mgr.GMSMembershipManager.forceDisconnect(GMSMembershipManager.java:2520)
at
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:998)
at
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processViewMessage(GMSJoinLeave.java:984)
at
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1690)
at
org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1286)
at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
at org.jgroups.JChannel.up(JChannel.java:741)
at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1070)
at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:785)
at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:426)
at
org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:74)
at
org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:72)
at org.jgroups.protocols.TP.passMessageUp(TP.java:1601)
at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1817)
at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1729)
at org.jgroups.protocols.TP.receive(TP.java:1654)
at
org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:160)
at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
at java.lang.Thread.run(Thread.java:745)
[info 2018/04/10 02:39:44.683 EDT event-server-1 <unicast
receiver,host001-11901> tid=0x33] CacheServer configuration saved
[info 2018/04/10 02:39:44.848 EDT event-server-1 <Geode Membership View
Creator> tid=0x20d] finished waiting for responses to view preparation
[info 2018/04/10 02:39:45.815 EDT event-server-1 <DisconnectThread> tid=0x217]
Stopping membership services
[info 2018/04/10 02:39:45.966 EDT event-server-1 <DisconnectThread> tid=0x217]
GMSHealthMonitor server socket is closed in stopServices().
[warning 2018/04/10 02:39:46.148 EDT event-server-1 <ServerConnection on port
40404 Thread 2> tid=0x103] Server connection from
[identity(host001(8115:loner):50832:190583a4,connection=2; port=49136]:
Unexpected Exception
org.apache.geode.distributed.DistributedSystemDisconnectedException:
DistributedSystem is shutting down, caused by
org.apache.geode.ForcedDisconnectException: This node is no longer in the
membership view
at
org.apache.geode.distributed.internal.membership.gms.mgr.GMSMembershipManager.directChannelSend(GMSMembershipManager.java:1700)
at
org.apache.geode.distributed.internal.membership.gms.mgr.GMSMembershipManager.send(GMSMembershipManager.java:1875)
at
org.apache.geode.distributed.internal.DistributionChannel.send(DistributionChannel.java:82)
at
org.apache.geode.distributed.internal.DistributionManager.sendOutgoing(DistributionManager.java:3416)
at
org.apache.geode.distributed.internal.DistributionManager.sendMessage(DistributionManager.java:3453)
at
org.apache.geode.distributed.internal.DistributionManager.putOutgoing(DistributionManager.java:1832)
at
org.apache.geode.internal.cache.DistributedCacheOperation.distribute(DistributedCacheOperation.java:505)
at
org.apache.geode.internal.cache.DistributedRegion.distributeDestroy(DistributedRegion.java:1701)
at
org.apache.geode.internal.cache.DistributedRegion.basicDestroyPart3(DistributedRegion.java:1692)
at
org.apache.geode.internal.cache.AbstractRegionMap.destroy(AbstractRegionMap.java:1519)
at
org.apache.geode.internal.cache.LocalRegion.mapDestroy(LocalRegion.java:6778)
at
org.apache.geode.internal.cache.LocalRegion.mapDestroy(LocalRegion.java:6755)
at
org.apache.geode.internal.cache.LocalRegionDataView.destroyExistingEntry(LocalRegionDataView.java:55)
at
org.apache.geode.internal.cache.LocalRegion.basicDestroy(LocalRegion.java:6717)
at
org.apache.geode.internal.cache.DistributedRegion.basicDestroy(DistributedRegion.java:1662)
at
org.apache.geode.internal.cache.LocalRegion.basicBridgeDestroy(LocalRegion.java:5614)
at
org.apache.geode.internal.cache.tier.sockets.command.Destroy65.cmdExecute(Destroy65.java:239)
at
org.apache.geode.internal.cache.tier.sockets.BaseCommand.execute(BaseCommand.java:141)
at
org.apache.geode.internal.cache.tier.sockets.ServerConnection.doNormalMsg(ServerConnection.java:783)
at
org.apache.geode.internal.cache.tier.sockets.ServerConnection.doOneMessage(ServerConnection.java:914)
at
org.apache.geode.internal.cache.tier.sockets.ServerConnection.run(ServerConnection.java:1138)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at
org.apache.geode.internal.cache.tier.sockets.AcceptorImpl$1$1.run(AcceptorImpl.java:519)
at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.geode.ForcedDisconnectException: This node is no longer
in the membership view
at
org.apache.geode.distributed.internal.membership.gms.mgr.GMSMembershipManager.forceDisconnect(GMSMembershipManager.java:2520)
at
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:998)
at
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processViewMessage(GMSJoinLeave.java:984)
at
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1690)
at
org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1286)
at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
at org.jgroups.JChannel.up(JChannel.java:741)
at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1070)
at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:785)
at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:426)
at
org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:74)
at
org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:72)
at org.jgroups.protocols.TP.passMessageUp(TP.java:1601)
at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1817)
at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1729)
at org.jgroups.protocols.TP.receive(TP.java:1654)
at
org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:160)
at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
... 1 more
[info 2018/04/10 02:39:46.400 EDT event-server-1 <Geode Failure Detection
Server thread 0> tid=0x37] GMSHealthMonitor server thread exiting
[info 2018/04/10 02:39:46.419 EDT event-server-1 <DisconnectThread> tid=0x217]
GMSHealthMonitor serverSocketExecutor is terminated
[info 2018/04/10 02:39:48.577 EDT event-server-1 <Cache Server Selector
/0.0.0.0:40404 local port: 40404> tid=0xfa] Cache server on port 40,404 is
shutting down.
[info 2018/04/10 02:39:49.200 EDT event-server-1 <ReconnectThread> tid=0x217]
Disconnecting old DistributedSystem to prepare for a reconnect attempt
[info 2018/04/10 02:39:49.757 EDT event-server-1 <ReconnectThread> tid=0x217]
GemFireCache[id = 1710483461; isClosing = true; isShutDownAll = false; created
= Sun Apr 08 05:04:20 EDT 2018; server = false; copyOnRead = false; lockLease =
120; lockTimeout = 60]: Now closing.
[info 2018/04/10 02:39:56.831 EDT event-server-1 <ReconnectThread> tid=0x217]
Created oplog#2 krf for disk store events_disk_store.
[info 2018/04/10 02:39:56.847 EDT event-server-1 <ReconnectThread> tid=0x217]
Created oplog#2 krf for disk store pdx_metadata_diskstore.
[info 2018/04/10 02:39:58.157 EDT event-server-1 <ReconnectThread> tid=0x217]
Rest Server on port 9,301 is shutting down
[info 2018/04/10 02:39:58.176 EDT event-server-1 <ReconnectThread> tid=0x217]
Stopping the HTTP service...
[info 2018/04/10 02:39:59.229 EDT event-server-1 <ReconnectThread> tid=0x217]
Shutting down DistributionManager host001(event-server-1:3525)<ec><v18>:1026.
At least one Exception occurred.
[info 2018/04/10 02:39:59.420 EDT event-server-1 <ReconnectThread> tid=0x217]
Now closing distribution for host001(event-server-1:3525)<ec><v18>:1026
[info 2018/04/10 02:39:59.423 EDT event-server-1 <ReconnectThread> tid=0x217]
DistributionManager stopped in 193ms.
[info 2018/04/10 02:39:59.423 EDT event-server-1 <ReconnectThread> tid=0x217]
Marking DistributionManager host001(event-server-1:3525)<ec><v18>:1026 as
closed.
[info 2018/04/10 02:40:59.470 EDT event-server-1 <ReconnectThread> tid=0x217]
Startup Configuration:
### GemFire Properties defined with api ###
ack-severe-alert-threshold=0
ack-wait-threshold=15
archive-disk-space-limit=0
archive-file-size-limit=0
async-distribution-timeout=0
async-max-queue-size=8
async-queue-timeout=60000
bind-address=
cache-xml-file=cache.xml
cluster-configuration-dir=
cluster-ssl-ciphers=any
cluster-ssl-enabled=false
cluster-ssl-keystore=
cluster-ssl-keystore-password=********
cluster-ssl-keystore-type=
cluster-ssl-protocols=any
cluster-ssl-require-authentication=true
cluster-ssl-truststore=
cluster-ssl-truststore-password=********
conflate-events=server
conserve-sockets=true
delta-propagation=true
deploy-working-dir=/home/geode/deploy
disable-auto-reconnect=false
disable-tcp=false
distributed-system-id=-1
distributed-transactions=false
durable-client-id=
durable-client-timeout=300
enable-cluster-configuration=true
enable-network-partition-detection=false
enable-time-statistics=false
enforce-unique-host=false
gateway-ssl-ciphers=any
gateway-ssl-enabled=false
gateway-ssl-keystore=
gateway-ssl-keystore-password=********
gateway-ssl-keystore-type=
gateway-ssl-protocols=any
gateway-ssl-require-authentication=true
gateway-ssl-truststore=
gateway-ssl-truststore-password=********
groups=INSTRUMENTS
http-service-bind-address=host001
http-service-port=9301
http-service-ssl-ciphers=any
http-service-ssl-enabled=false
http-service-ssl-keystore=
http-service-ssl-keystore-password=********
http-service-ssl-keystore-type=
http-service-ssl-protocols=any
http-service-ssl-require-authentication=false
http-service-ssl-truststore=
http-service-ssl-truststore-password=********
jmx-manager=false
jmx-manager-access-file=
jmx-manager-bind-address=
jmx-manager-hostname-for-clients=
jmx-manager-http-port=9301
jmx-manager-password-file=********
jmx-manager-port=1099
jmx-manager-ssl-ciphers=any
jmx-manager-ssl-enabled=false
jmx-manager-ssl-keystore=
jmx-manager-ssl-keystore-password=********
jmx-manager-ssl-keystore-type=
jmx-manager-ssl-protocols=any
jmx-manager-ssl-require-authentication=true
jmx-manager-ssl-truststore=
jmx-manager-ssl-truststore-password=********
jmx-manager-start=false
jmx-manager-update-rate=2000
load-cluster-configuration-from-dir=false
locator-wait-time=0
locators=host001.company.com[10334],host002.company.com[10334]
lock-memory=false
log-disk-space-limit=2048
log-file=/home/logs/current/event-server-1/event-server-1.log
log-file-size-limit=100
log-level=info
max-num-reconnect-tries=3
max-wait-time-reconnect=60000
mcast-address=239.192.81.1
mcast-flow-control=1048576, 0.25, 5000
mcast-port=0
mcast-recv-buffer-size=1048576
mcast-send-buffer-size=65535
mcast-ttl=32
member-timeout=5000
membership-port-range=1024-65535
memcached-bind-address=
memcached-port=0
memcached-protocol=ASCII
name=event-server-1
off-heap-memory-size=
redis-bind-address=
redis-password=********
redis-port=0
redundancy-zone=
remote-locators=
remove-unresponsive-client=false
roles=
security-client-accessor=********
security-client-accessor-pp=********
security-client-auth-init=********
security-client-authenticator=********
security-client-dhalgo=********
security-log-file=********
security-log-level=********
security-manager=********
security-peer-auth-init=********
security-peer-authenticator=********
security-peer-verifymember-timeout=********
security-post-processor=********
security-udp-dhalgo=********
server-bind-address=
server-ssl-ciphers=any
server-ssl-enabled=false
server-ssl-keystore=
server-ssl-keystore-password=********
server-ssl-keystore-type=
server-ssl-protocols=any
server-ssl-require-authentication=true
server-ssl-truststore=
server-ssl-truststore-password=********
socket-buffer-size=32768
socket-lease-time=60000
ssl-ciphers=any
ssl-cluster-alias=
ssl-default-alias=
ssl-enabled-components=
ssl-gateway-alias=
ssl-jmx-alias=
ssl-keystore=
ssl-keystore-password=********
ssl-keystore-type=
ssl-locator-alias=
ssl-protocols=any
ssl-require-authentication=true
ssl-server-alias=
ssl-truststore=
ssl-truststore-password=********
ssl-web-alias=
ssl-web-require-authentication=false
start-dev-rest-api=true
start-locator=
statistic-archive-file=
statistic-sample-rate=1000
statistic-sampling-enabled=true
tcp-port=0
udp-fragment-size=60000
udp-recv-buffer-size=1048576
udp-send-buffer-size=65535
use-cluster-configuration=true
user-command-packages=
[info 2018/04/10 02:40:59.426 EDT event-server-1 <ReconnectThread> tid=0x217]
Attempting to reconnect to the distributed system. This is attempt #1.
[info 2018/04/10 02:40:59.489 EDT event-server-1 <ReconnectThread> tid=0x217]
Starting membership services
[info 2018/04/10 02:40:59.497 EDT event-server-1 <ReconnectThread> tid=0x217]
JGroups channel reinitialized (took 8ms)
[info 2018/04/10 02:40:59.499 EDT event-server-1 <ReconnectThread> tid=0x217]
GemFire P2P Listener started on host001.company.com/169.87.179.47:34532
[info 2018/04/10 02:40:59.505 EDT event-server-1 <Geode Failure Detection
Server thread 0> tid=0x232] Started failure detection server thread on
/169.87.179.47:62294.
[info 2018/04/10 02:40:59.516 EDT event-server-1 <ReconnectThread> tid=0x217]
Attempting to join the distributed system through coordinator
host002(Locator2:3605:locator)<ec><v29>:1024 using address
host001(event-server-1:3525)<ec>:1026
[info 2018/04/10 02:40:59.522 EDT event-server-1 <ReconnectThread> tid=0x217]
Stopping membership services
[info 2018/04/10 02:40:59.534 EDT event-server-1 <ReconnectThread> tid=0x217]
GMSHealthMonitor server socket is closed in stopServices().
[info 2018/04/10 02:40:59.534 EDT event-server-1 <Geode Failure Detection
Server thread 0> tid=0x232] GMSHealthMonitor server thread exiting
[info 2018/04/10 02:40:59.534 EDT event-server-1 <ReconnectThread> tid=0x217]
GMSHealthMonitor serverSocketExecutor is terminated
[warning 2018/04/10 02:40:59.541 EDT event-server-1 <ReconnectThread>
tid=0x217] Exception occurred while trying to connect the system during
reconnect
org.apache.geode.security.AuthenticationRequiredException: Failed to find
credentials from [host001(event-server-1:3525)<ec>:1026]
at
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.attemptToJoin(GMSJoinLeave.java:424)
at
org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.join(GMSJoinLeave.java:318)
at
org.apache.geode.distributed.internal.membership.gms.mgr.GMSMembershipManager.join(GMSMembershipManager.java:656)
at
org.apache.geode.distributed.internal.membership.gms.mgr.GMSMembershipManager.joinDistributedSystem(GMSMembershipManager.java:745)
at
org.apache.geode.distributed.internal.membership.gms.Services.start(Services.java:181)
at
org.apache.geode.distributed.internal.membership.gms.GMSMemberFactory.newMembershipManager(GMSMemberFactory.java:102)
at
org.apache.geode.distributed.internal.membership.MemberFactory.newMembershipManager(MemberFactory.java:89)
at
org.apache.geode.distributed.internal.DistributionManager.<init>(DistributionManager.java:1112)
at
org.apache.geode.distributed.internal.DistributionManager.<init>(DistributionManager.java:1160)
at
org.apache.geode.distributed.internal.DistributionManager.create(DistributionManager.java:531)
at
org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:687)
at
org.apache.geode.distributed.internal.InternalDistributedSystem.newInstance(InternalDistributedSystem.java:299)
at
org.apache.geode.distributed.DistributedSystem.connect(DistributedSystem.java:202)
at
org.apache.geode.distributed.internal.InternalDistributedSystem.reconnect(InternalDistributedSystem.java:2675)
at
org.apache.geode.distributed.internal.InternalDistributedSystem.tryReconnect(InternalDistributedSystem.java:2508)
at
org.apache.geode.distributed.internal.InternalDistributedSystem.disconnect(InternalDistributedSystem.java:983)
at
org.apache.geode.distributed.internal.DistributionManager$MyListener.membershipFailure(DistributionManager.java:4307)
at
org.apache.geode.distributed.internal.membership.gms.mgr.GMSMembershipManager.uncleanShutdown(GMSMembershipManager.java:1530)
at
org.apache.geode.distributed.internal.membership.gms.mgr.GMSMembershipManager.lambda$forceDisconnect$0(GMSMembershipManager.java:2550)
at java.lang.Thread.run(Thread.java:745)