Xu Shao Hong created HDDS-5976:
----------------------------------

             Summary: add raftlog IOException notifier for SCM HA
                 Key: HDDS-5976
                 URL: https://issues.apache.org/jira/browse/HDDS-5976
             Project: Apache Ozone
          Issue Type: Bug
            Reporter: Xu Shao Hong
            Assignee: Xu Shao Hong


Currently, for SCM HA, the raft log  IO exception will lead the leader to get 
stuck in an indefinite loop of timeout where 
server.submitClientRequestAsync(raftClientRequest). 

reason:

Once ratis server get the IO exception during appending logs, it will tell the 
appendlog task to complete exceptionally and notify the statemachine the event 
''notifyLogFailed'', and then stops tasks in the following queue. The task is 
asynchronous and not "get()" by any one, thus the exception will not be known. 
SCM only gets timeout exception endlessly.

 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to