Nandakumar created HDDS-9479:
--------------------------------

             Summary: Pipeline close doesn't wait for containers to be closed
                 Key: HDDS-9479
                 URL: https://issues.apache.org/jira/browse/HDDS-9479
             Project: Apache Ozone
          Issue Type: Improvement
          Components: SCM
    Affects Versions: 1.3.0
            Reporter: Nandakumar
            Assignee: Nandakumar


Whenever we close a pipeline, we have an option to give some grace time for the 
container to get closed, the grace time is configured using 
_ozone.scm.pipeline.destroy.timeout_.  
We wait for the timeout to happen before we go ahead and delete the pipeline. 
This will give enough time for the datanodes to close the container gracefully.
It will prevent the containers from moving to the QUASI_CLOSED state.

This functionality is broken and we don't wait for the timeout to happen before 
we delete the Pipeline. This creates a lot of QUASI_CLOSED containers in the 
cluster when a node goes stale or when a datanode is getting decommissioned.

This has to be fixed and we should wait for the configured amount of time 
before we delete the Pipeline, this will give datanodes enough time to CLOSE 
the containers on that pipeline.





--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to