[ 
https://issues.apache.org/jira/browse/HDDS-13980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sumit Agrawal updated HDDS-13980:
---------------------------------
    Description: 
Follower SCM never comes out of safe mode, as RPC server for DN HB is not 
started at SCM. Its started if certain action like leader change happens, Or 
leader is doing some update to SCM.

 

So, DatanodeProtocolServer can be started immediately when scm starts so that 
DN register/re-register can happen immediately

 

Safemode exit still depends on trigger of notifyTermIndex()



 

Changes for SCM to have quick Safemode rule exist as soon satisfied,
1) DatanodeProtocolServer can be started immediately when scm starts so that DN 
register/re-register can happen immediately on startup.

3) StateMachineReadyRule triggered by StateMachine using refreshAndvalidate() 
all rules, ensure all rules are re-checked, and exit safemode status after 
start machine has flushed / applied all raft log on startup to avoid 
in-progress raft log transaction issue as referred by 
[HDDS-5263](https://issues.apache.org/jira/browse/HDDS-5263).

  was:
Follower SCM never comes out of safe mode, as RPC server for DN HB is not 
started at SCM. Its started if certain action like leader change happens, Or 
leader is doing some update to SCM.

 

So, DatanodeProtocolServer can be started immediately when scm starts so that 
DN register/re-register can happen immediately

 

Safemode exit still depends on trigger of notifyTermIndex()


> SCM start DN protocol server during startup
> -------------------------------------------
>
>                 Key: HDDS-13980
>                 URL: https://issues.apache.org/jira/browse/HDDS-13980
>             Project: Apache Ozone
>          Issue Type: Improvement
>            Reporter: Sumit Agrawal
>            Assignee: Sumit Agrawal
>            Priority: Major
>
> Follower SCM never comes out of safe mode, as RPC server for DN HB is not 
> started at SCM. Its started if certain action like leader change happens, Or 
> leader is doing some update to SCM.
>  
> So, DatanodeProtocolServer can be started immediately when scm starts so that 
> DN register/re-register can happen immediately
>  
> Safemode exit still depends on trigger of notifyTermIndex()
>  
> Changes for SCM to have quick Safemode rule exist as soon satisfied,
> 1) DatanodeProtocolServer can be started immediately when scm starts so that 
> DN register/re-register can happen immediately on startup.
> 3) StateMachineReadyRule triggered by StateMachine using refreshAndvalidate() 
> all rules, ensure all rules are re-checked, and exit safemode status after 
> start machine has flushed / applied all raft log on startup to avoid 
> in-progress raft log transaction issue as referred by 
> [HDDS-5263](https://issues.apache.org/jira/browse/HDDS-5263).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to