[ 
https://issues.apache.org/jira/browse/YARN-9608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861943#comment-16861943
 ] 

Zhankun Tang edited comment on YARN-9608 at 6/12/19 10:24 AM:
--------------------------------------------------------------

Thanks for the patch! [~abmodi].
 I took a glance at it. Some minor questions:

1. Is the "rmNode.getRunningApps()" deliberate?
{code:java}
DecommissioningNodeContext context = decomNodes.get(rmNode.getNodeID());
rmNode.getRunningApps();
long now = mclock.getTime();{code}
2. Do we need to update the comment below?

{code:java}
    // All applications run on the node at or after decommissioningStartTime.
    private List<ApplicationId> appIds;
{code}



was (Author: tangzhankun):
Thanks for the patch! [~abmodi].
 I took a glance at it. Some minor question:

Is the "rmNode.getRunningApps()" deliberate?
{code:java}
DecommissioningNodeContext context = decomNodes.get(rmNode.getNodeID());
rmNode.getRunningApps();
long now = mclock.getTime();{code}

> DecommissioningNodesWatcher should get lists of running applications on node 
> from RMNode.
> -----------------------------------------------------------------------------------------
>
>                 Key: YARN-9608
>                 URL: https://issues.apache.org/jira/browse/YARN-9608
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Abhishek Modi
>            Assignee: Abhishek Modi
>            Priority: Major
>         Attachments: YARN-9608.001.patch
>
>
> At present, DecommissioningNodesWatcher tracks list of running applications 
> and triggers decommission of nodes when all the applications that ran on the 
> node completes. This Jira proposes to solve following problem:
>  # DecommissioningNodesWatcher skips tracking application containers on a 
> particular node before the node is in DECOMMISSIONING state. It only tracks 
> containers once the node is in DECOMMISSIONING state. This can lead to 
> shuffle data loss of apps whose containers ran on this node before it was 
> moved to decommissioning state.
>  # It is keeping track of running apps. We can leverage this directly from 
> RMNode.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to