[ 
https://issues.apache.org/jira/browse/YARN-4589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107096#comment-15107096
 ] 

Chang Li commented on YARN-4589:
--------------------------------

[~jlowe] please help review the latest patch.
Latest implementation add a new container external state localizing, and in 
each nodeheartbeat to rm, RMNode maintains and updates states of its container. 
When RMAppattempt timeout it queries from RMNode about its container state. The 
implementation also considers backward compatibility

> Diagnostics for localization timeouts is lacking
> ------------------------------------------------
>
>                 Key: YARN-4589
>                 URL: https://issues.apache.org/jira/browse/YARN-4589
>             Project: Hadoop YARN
>          Issue Type: Improvement
>            Reporter: Chang Li
>            Assignee: Chang Li
>         Attachments: YARN-4589.2.patch, YARN-4589.3.patch, YARN-4589.patch
>
>
> When a container takes too long to localize it manifests as a timeout, and 
> there's no indication that localization was the issue. We need diagnostics 
> for timeouts to indicate the container was still localizing when the timeout 
> occurred.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to