[ 
https://issues.apache.org/jira/browse/YARN-1686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rohith updated YARN-1686:
-------------------------

    Attachment: YARN-1686.1.patch

> NodeManager.resyncWithRM() does not handle exception which cause NodeManger 
> to Hang.
> ------------------------------------------------------------------------------------
>
>                 Key: YARN-1686
>                 URL: https://issues.apache.org/jira/browse/YARN-1686
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.3.0
>            Reporter: Rohith
>            Assignee: Rohith
>             Fix For: 3.0.0
>
>         Attachments: YARN-1686.1.patch
>
>
> During start of NodeManager,if registration with resourcemanager throw 
> exception then nodemager shutdown happens. 
> Consider case where NM-1 is registered with RM. RM issued Resync to NM. If 
> any exception thrown in "resyncWithRM" (starts new thread which does not 
> handle exception) during RESYNC evet, then this thread is lost. NodeManger 
> enters hanged state. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to