[ 
https://issues.apache.org/jira/browse/HDDS-7738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HDDS-7738:
---------------------------------
    Labels: pull-request-available  (was: )

> SCM terminates when adding container to a closed pipeline
> ---------------------------------------------------------
>
>                 Key: HDDS-7738
>                 URL: https://issues.apache.org/jira/browse/HDDS-7738
>             Project: Apache Ozone
>          Issue Type: Bug
>            Reporter: Duong
>            Assignee: Duong
>            Priority: Critical
>              Labels: pull-request-available
>
> This is similar to HDDS-5843, but in a different scenario.
>  
> An Ozone customer encountered this issue after a container (c1) is allocated 
> with a newly created pipeline (p1). The chain of events is as follows:
>  # SCM processes pipeline creation transaction *p1* => *p1* is {*}created{*}.
>  # SCM received a request to close p1 from a data node (see the previous 
> comment)
> => *p1* is {*}closed{*}.
> => SCM also tried to find and close relevant containers, at this point, 
> container *c1* doesn't *exist* yet, so it {*}can't be closed{*}.
>  # SCM processes the container *c1* allocation transaction => failed because 
> *p1* is *closed* already.
> => SCM terminates and both transactions #1 and #3 are not committed (as Ratis 
> commits transactions in chunks).
> Because the transactions are not committed, whenever SCM restarts, it got 
> through the same step #1 and #3 and terminates again.
> Solution: SCM should not terminate when adding a container with a closed 
> pipeline. The fix is similar to HDDS-5843.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to