Surya Mudundi created GEODE-9910:
------------------------------------

             Summary: Failure to auto-reconnect upon network partition
                 Key: GEODE-9910
                 URL: https://issues.apache.org/jira/browse/GEODE-9910
             Project: Geode
          Issue Type: Bug
    Affects Versions: 1.14.0
            Reporter: Surya Mudundi


Two node cluster with embedded locators failed to auto-reconnect when node-1 
experienced network outage for couple of minutes and when node-1 recovered from 
the outage, node-2 failed to auto-reconnect.

node-2 tried to re-connect to node-1 as:

[org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
 [] Attempting to reconnect to the distributed system.  This is attempt #1.
[org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
 [] Attempting to reconnect to the distributed system.  This is attempt #2.
[org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
 [] Attempting to reconnect to the distributed system.  This is attempt #3.

Finally reported below error after 3 attempts as:

INFO  
[org.apache.geode.logging.internal.LoggingProviderLoader]-[ReconnectThread] [] 
Using org.apache.geode.logging.internal.SimpleLoggingProvider for service 
org.apache.geode.logging.internal.spi.LoggingProvider
INFO  [org.apache.geode.internal.InternalDataSerializer]-[ReconnectThread] [] 
initializing InternalDataSerializer with 0 services
INFO  
[org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
 [] performing a quorum check to see if location services can be started early
INFO  
[org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
 [] Quorum check passed - allowing location services to start early
WARN  
[org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
 [] Exception occurred while trying to connect the system during reconnect
java.lang.IllegalStateException: A locator can not be created because one 
already exists in this JVM.
        at 
org.apache.geode.distributed.internal.InternalLocator.createLocator(InternalLocator.java:298)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.InternalLocator.createLocator(InternalLocator.java:273)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.InternalDistributedSystem.startInitLocator(InternalDistributedSystem.java:916)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:768)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3034)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.InternalDistributedSystem.reconnect(InternalDistributedSystem.java:2605)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.InternalDistributedSystem.tryReconnect(InternalDistributedSystem.java:2424)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.InternalDistributedSystem.disconnect(InternalDistributedSystem.java:1275)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.ClusterDistributionManager$DMListener.membershipFailure(ClusterDistributionManager.java:2326)
 ~[geode-core-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.membership.gms.GMSMembership.uncleanShutdown(GMSMembership.java:1187)
 ~[geode-membership-1.14.0.jar:?]
        at 
org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.lambda$forceDisconnect$0(GMSMembership.java:1811)
 ~[geode-membership-1.14.0.jar:?]
        at java.lang.Thread.run(Thread.java:829) [?:?]

 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to