[ https://issues.apache.org/jira/browse/YARN-10184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
kyungwan nam updated YARN-10184: -------------------------------- Attachment: YARN-10184.002.patch > NPE happens in NMClient when reinitializeContainer > -------------------------------------------------- > > Key: YARN-10184 > URL: https://issues.apache.org/jira/browse/YARN-10184 > Project: Hadoop YARN > Issue Type: Bug > Reporter: kyungwan nam > Assignee: kyungwan nam > Priority: Major > Attachments: YARN-10184.001.patch, YARN-10184.002.patch > > > NPE happens in NMClient when upgrading a yarn-service app which AM has been > restarted. > Here is AM’s log at the time of the NPE. > {code} > 2020-02-20 16:43:35,962 [Container Event Dispatcher] ERROR > yarn.YarnUncaughtExceptionHandler - Thread Thread[Container Event > Dispatcher,5,main] threw an Exception. > java.lang.NullPointerException > at > org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl$1.run(NMClientAsyncImpl.java:172) > 2020-02-20 16:43:36,398 [AMRM Callback Handler Thread] WARN > service.ServiceScheduler - Container > container_e58_1581930783345_1954_01_000006 Completed. No component instance > exists. exitStatus=-100. diagnostics=Container released by application > {code} > NMClient keeps containers since the container has been started. > But, when restarting AM, NMClient is initialized and previous containers are > lost. > Since then, NPE will happen when reinitializeContainer is requested. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org