kyungwan nam created YARN-10184: ----------------------------------- Summary: NPE happens in NMClient when reinitializeContainer Key: YARN-10184 URL: https://issues.apache.org/jira/browse/YARN-10184 Project: Hadoop YARN Issue Type: Bug Reporter: kyungwan nam Assignee: kyungwan nam
NPE happens in NMClient when upgrading a yarn-service app which AM has been restarted. Here is AM’s log at the time of the NPE. {code} 2020-02-20 16:43:35,962 [Container Event Dispatcher] ERROR yarn.YarnUncaughtExceptionHandler - Thread Thread[Container Event Dispatcher,5,main] threw an Exception. java.lang.NullPointerException at org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl$1.run(NMClientAsyncImpl.java:172) 2020-02-20 16:43:36,398 [AMRM Callback Handler Thread] WARN service.ServiceScheduler - Container container_e58_1581930783345_1954_01_000006 Completed. No component instance exists. exitStatus=-100. diagnostics=Container released by application {code} NMClient keeps containers since the container has been started. But, when restarting AM, NMClient is initialized and previous containers are lost. Since then, NPE will happen when reinitializeContainer is requested. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org