[jira] [Commented] (YARN-90) NodeManager should identify failed disks becoming good back again

Ravi Prakash (JIRA) Fri, 01 Nov 2013 15:41:46 -0700

    [ 
https://issues.apache.org/jira/browse/YARN-90?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13811724#comment-13811724
 ]


Ravi Prakash commented on YARN-90:
----------------------------------

Apart from DirectoryCollection changes, I think we should also update 
LocalDirAllocation.AllocatorPerContext. Maybe we should handle that in a 
separate JIRA.

Anyway. I noticed that after this patch, although DirectoryCollection recovered 
the repaired directories, they were not actually used. I wonder if its 
something wrong with my test procedure or we need more changes.

> NodeManager should identify failed disks becoming good back again
> -----------------------------------------------------------------
>
>                 Key: YARN-90
>                 URL: https://issues.apache.org/jira/browse/YARN-90
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>            Reporter: Ravi Gummadi
>         Attachments: YARN-90.1.patch, YARN-90.patch
>
>
> MAPREDUCE-3121 makes NodeManager identify disk failures. But once a disk goes 
> down, it is marked as failed forever. To reuse that disk (after it becomes 
> good), NodeManager needs restart. This JIRA is to improve NodeManager to 
> reuse good disks(which could be bad some time back).



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (YARN-90) NodeManager should identify failed disks becoming good back again

Reply via email to