[ 
https://issues.apache.org/jira/browse/HDDS-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bharat Viswanadham resolved HDDS-3072.
--------------------------------------
    Fix Version/s: 0.5.0
       Resolution: Fixed

> SCM scrub pipeline should be started after coming out of safe mode
> ------------------------------------------------------------------
>
>                 Key: HDDS-3072
>                 URL: https://issues.apache.org/jira/browse/HDDS-3072
>             Project: Hadoop Distributed Data Store
>          Issue Type: Bug
>            Reporter: Bharat Viswanadham
>            Assignee: Bharat Viswanadham
>            Priority: Blocker
>              Labels: pull-request-available
>             Fix For: 0.5.0
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> We should start scrubbing pipelines after SCM is out of safe mode.
> Reasons to do this:
>  # Right now, we do scrub pipeline as part of triggerPipelineCreation, now 
> when we scrub pipelines in allocated state for more than 
> "ozone.scm.pipeline.allocated.timeout", we might close some pipelines and 
> with this, we might not be able to come out of safeMode. As in SafeModeRules, 
> we get pipeline count from pipelineDB during initialization.
> Example scenario:
>  # Stop 3 Datanodes. 
>  # Restart SCM.
>  # Start Datanode after 6 mts. We shall never come out of safe mode, as 
> pipeline in allocated state will meet scrubber time out condition.
> To not to be in these kinds of scenarios, better thing to be done here is 
> scrub pipelines after SCM out of the safe mode
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to