[
https://issues.apache.org/jira/browse/YARN-7890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jason Lowe updated YARN-7890:
-----------------------------
Target Version/s: 3.1.0, 2.10.0, 2.9.1, 3.0.1 (was: 3.1.0, 2.9.1, 3.0.1)
It typically is at the discretion of the committer. Many things are picked to
maintenance branches without precommit runs, as it simply would take too long
to run them all (e.g.: picking something all the way back to 2.7 which as of
today involves committing to 7 branches). IMHO if the pick is clean and the
risk of the change is low it isn't worth running all those separate precommit
builds.
Since this is an NPE that breaks relaunch, I think this needs to be fixed in
3.0.1 and 2.9.1 which involves picking it to branch-3.0, branch-3.0.1,
branch-2, and branch-2.9.
> NPE during container relaunch
> -----------------------------
>
> Key: YARN-7890
> URL: https://issues.apache.org/jira/browse/YARN-7890
> Project: Hadoop YARN
> Issue Type: Bug
> Affects Versions: 2.9.0, 3.0.0-alpha1
> Reporter: Billie Rinaldi
> Assignee: Jason Lowe
> Priority: Major
> Attachments: YARN-7890.001.patch, YARN-7890.002.patch
>
>
> While running a recent build of trunk, I saw the following:
> {noformat}
> 2018-02-02 21:02:40,026 INFO
> org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl:
> Container container_e02_1517604848419_0002_01_000004 transitioned from
> RELAUNCHING to RUNNING
> 2018-02-02 21:02:40,026 WARN
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerRelaunch:
> Failed to relaunch container.
> java.lang.NullPointerException
> at
> java.util.Collections$UnmodifiableCollection.<init>(Collections.java:1026)
> at
> java.util.Collections$UnmodifiableList.<init>(Collections.java:1302)
> at java.util.Collections.unmodifiableList(Collections.java:1287)
> at
> org.apache.hadoop.yarn.server.nodemanager.executor.ContainerStartContext.getFilecacheDirs(ContainerStartContext.java:200)
> at
> org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.buildContainerRuntimeContext(LinuxContainerExecutor.java:651)
> at
> org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.launchContainer(LinuxContainerExecutor.java:546)
> at
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.launchContainer(ContainerLaunch.java:465)
> at
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerRelaunch.call(ContainerRelaunch.java:107)
> at
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerRelaunch.call(ContainerRelaunch.java:49)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> {noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]