[ 
https://issues.apache.org/jira/browse/YARN-10184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

kyungwan nam updated YARN-10184:
--------------------------------
    Attachment: YARN-10184.002.patch

> NPE happens in NMClient when reinitializeContainer
> --------------------------------------------------
>
>                 Key: YARN-10184
>                 URL: https://issues.apache.org/jira/browse/YARN-10184
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: kyungwan nam
>            Assignee: kyungwan nam
>            Priority: Major
>         Attachments: YARN-10184.001.patch, YARN-10184.002.patch
>
>
> NPE happens in NMClient when upgrading a yarn-service app which AM has been 
> restarted.
> Here is AM’s log at the time of the NPE.
> {code}
> 2020-02-20 16:43:35,962 [Container  Event Dispatcher] ERROR 
> yarn.YarnUncaughtExceptionHandler - Thread Thread[Container  Event 
> Dispatcher,5,main] threw an Exception.
> java.lang.NullPointerException
>       at 
> org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl$1.run(NMClientAsyncImpl.java:172)
> 2020-02-20 16:43:36,398 [AMRM Callback Handler Thread] WARN  
> service.ServiceScheduler - Container 
> container_e58_1581930783345_1954_01_000006 Completed. No component instance 
> exists. exitStatus=-100. diagnostics=Container released by application 
> {code}
> NMClient keeps containers since the container has been started.
> But, when restarting AM, NMClient is initialized and previous containers are 
> lost. 
> Since then, NPE will happen when reinitializeContainer is requested.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to