[ 
https://issues.apache.org/jira/browse/YARN-2902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14902808#comment-14902808
 ] 

Varun Saxena commented on YARN-2902:
------------------------------------

Ok. Then what I will do is NOT wait for completion of running tasks which have 
been cancelled. 
Localizer will try to only delete directories for which download was complete. 
Tasks which have failed, directories for them will anyways be deleted by 
FSDownload.

We however may need a config in NM for deletion task delay(the one I have added 
in current patch). Or we can simply have a hardcoded value of 2 minutes.
Regarding System exit, it will called after ExecutorService#shutdownNow(which 
will only interrupt running tasks and not wait for them) anyways.

> Killing a container that is localizing can orphan resources in the 
> DOWNLOADING state
> ------------------------------------------------------------------------------------
>
>                 Key: YARN-2902
>                 URL: https://issues.apache.org/jira/browse/YARN-2902
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>    Affects Versions: 2.5.0
>            Reporter: Jason Lowe
>            Assignee: Varun Saxena
>         Attachments: YARN-2902.002.patch, YARN-2902.03.patch, 
> YARN-2902.04.patch, YARN-2902.05.patch, YARN-2902.06.patch, YARN-2902.patch
>
>
> If a container is in the process of localizing when it is stopped/killed then 
> resources are left in the DOWNLOADING state.  If no other container comes 
> along and requests these resources they linger around with no reference 
> counts but aren't cleaned up during normal cache cleanup scans since it will 
> never delete resources in the DOWNLOADING state even if their reference count 
> is zero.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to