[ 
https://issues.apache.org/jira/browse/YARN-9608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16860470#comment-16860470
 ] 

jialei weng edited comment on YARN-9608 at 6/11/19 1:58 AM:
------------------------------------------------------------

{color:#333333}This solution provides an idea to extend life-cycle of node 
local data to the whole application running time. A small question here, if the 
application is long running job, the node decommission time will also take 
longer? And rely on the time-out? [~abmodi] Please correct me if I 
misunderstand.{color}


was (Author: wjlei):
{color:#333333}This solution provides an idea to extend life-cycle of 
{color:#333333}node local data to the whole application running time. A small 
question here, if the application is long running job, the node decommission 
time will also take longer? And rely on the time-out? Please correct me if I 
misunderstand.{color}{color}

> DecommissioningNodesWatcher should get lists of running applications on node 
> from RMNode.
> -----------------------------------------------------------------------------------------
>
>                 Key: YARN-9608
>                 URL: https://issues.apache.org/jira/browse/YARN-9608
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Abhishek Modi
>            Assignee: Abhishek Modi
>            Priority: Major
>         Attachments: YARN-9608.001.patch
>
>
> At present, DecommissioningNodesWatcher tracks list of running applications 
> and triggers decommission of nodes when all the applications that ran on the 
> node completes. This Jira proposes to solve following problem:
>  # DecommissioningNodesWatcher skips tracking application containers on a 
> particular node before the node is in DECOMMISSIONING state. It only tracks 
> containers once the node is in DECOMMISSIONING state. This can lead to 
> shuffle data loss of apps whose containers ran on this node before it was 
> moved to decommissioning state.
>  # It is keeping track of running apps. We can leverage this directly from 
> RMNode.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to