[ 
https://issues.apache.org/jira/browse/HDDS-7341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17620304#comment-17620304
 ] 

mingchao zhao edited comment on HDDS-7341 at 10/19/22 12:09 PM:
----------------------------------------------------------------

This looks like a necessary bug fix for EC. I suggest we cherry-pick this to 
[release branch|https://github.com/apache/ozone/tree/ozone-1.3]. Let me do this.


was (Author: micahzhao):
This looks like a necessary bug fix for EC. I suggest we cherry-pick this to 
[release branch|https://github.com/apache/ozone/tree/ozone-1.3].

> EC: Close pipelines with unregistered nodes
> -------------------------------------------
>
>                 Key: HDDS-7341
>                 URL: https://issues.apache.org/jira/browse/HDDS-7341
>             Project: Apache Ozone
>          Issue Type: Sub-task
>          Components: SCM
>            Reporter: Stephen O'Donnell
>            Assignee: Stephen O'Donnell
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.4.0
>
>
> A datanode is stopped and before the stale node handler is triggered, SCM is 
> restarted. When SCM restarts its loads all the only pipelines and nodes from 
> RocksDB, and then all the nodes will register again.
> In the case of EC pipelines, there is nothing to trigger the close of a 
> pipeline (and the containers on it) except:
> 1. The Container getting full and the DN triggering the close
> 2. The stale / dead node handlers noticing a node on it has gone dead.
> In the case above, the EC pipeline will sit forever in an Open state, but any 
> attempt to write to it will likely result in errors on the client due to one 
> of the nodes not being available. These errors still will not trigger it to 
> close.
> A solution to this problem, is to add logic to the pipeline scrubber to close 
> any pipelines that have unregistered nodes. Stale / Dead nodes should be 
> handled by the existing stale / dead node handlers.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to