Shashikant Banerjee created HDDS-5062:
-----------------------------------------

             Summary: Add a config to bypass clusterId validation for 
bootstrapping SCM
                 Key: HDDS-5062
                 URL: https://issues.apache.org/jira/browse/HDDS-5062
             Project: Apache Ozone
          Issue Type: Bug
          Components: SCM HA
            Reporter: Shashikant Banerjee
            Assignee: Shashikant Banerjee
             Fix For: 1.2.0


IN SCM HA, the primary node starts up the ratis server while other 
bootstrapping nodes will get added to the ratis group. Now, if all the 
bootstrapping SCM's get stopped, the primary node will now step down from 
leadership as it will loose majority. If the bootstrapping nodes are now 
bootstrapped again,  the bootsrapping node will try to first validate the 
cluster id from the leader SCM with the persisted cluster id , but as there is 
no leader existing, bootstrapping wil keep on failing and retrying until it 
shuts down. 

The issue can be very easily simulated in kubernetes deployments, where 
bootstrap and init cmds are run repeatedly on every restart.

The Jira aims to bypass the cluster id validation if a bootstrapping node 
already has a cluster id.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to