[ 
https://issues.apache.org/jira/browse/HDFS-2077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13050761#comment-13050761
 ] 

Todd Lipcon commented on HDFS-2077:
-----------------------------------

This was in fact true prior to HDFS-1073.. the difference is that it would 
rarely happen, since the failure state of edits logs and image dirs was 
coupled. Now, if an edit log fails to write, that doesn't cause the image dir 
to immediately be marked failed, so it's more likely that it will be 
"discovered" at checkpoint time by the GetImageServlet.

> 1073: address checkpoint upload when one of the storage dirs is failed
> ----------------------------------------------------------------------
>
>                 Key: HDFS-2077
>                 URL: https://issues.apache.org/jira/browse/HDFS-2077
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: name-node
>    Affects Versions: Edit log branch (HDFS-1073)
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>             Fix For: Edit log branch (HDFS-1073)
>
>
> This JIRA addresses the following case:
> - NN is running with 2 storage dirs
> - 1 of the dirs fails
> - 2NN makes a checkpoint
> Currently, if GetImageServlet fails to open _any_ of the local files to 
> receive a checkpoint, it will fail the entire checkpoint upload process. 
> Instead, it should continue to receive checkpoints in the non-failed 
> directories.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to