[
https://issues.apache.org/jira/browse/YARN-6805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16083110#comment-16083110
]
Jason Lowe commented on YARN-6805:
----------------------------------
Sample stack trace from a 2.8-based release:
{noformat}
2017-07-10 20:39:12,810 [LocalizerRunner for
container_e03_1496686551678_8189060_01_005998] WARN
privileged.PrivilegedOperationExecutor: IOException executing command:
java.io.InterruptedIOException: java.lang.InterruptedException
at org.apache.hadoop.util.Shell.runCommand(Shell.java:1007)
at org.apache.hadoop.util.Shell.run(Shell.java:898)
at
org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:1213)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:151)
at
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:263)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1155)
Caused by: java.lang.InterruptedException
at java.lang.Object.wait(Native Method)
at java.lang.Object.wait(Object.java:502)
at java.lang.UNIXProcess.waitFor(UNIXProcess.java:395)
at org.apache.hadoop.util.Shell.runCommand(Shell.java:997)
... 5 more
2017-07-10 20:39:12,811 [LocalizerRunner for
container_e03_1496686551678_8189060_01_005998] INFO
localizer.ResourceLocalizationService: Localizer failed
java.lang.NullPointerException
at
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:267)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1155)
{noformat}
> NPE in LinuxContainerExecutor due to null PrivilegedOperationException exit
> code
> --------------------------------------------------------------------------------
>
> Key: YARN-6805
> URL: https://issues.apache.org/jira/browse/YARN-6805
> Project: Hadoop YARN
> Issue Type: Bug
> Components: nodemanager
> Affects Versions: 2.8.1
> Reporter: Jason Lowe
>
> The LinuxContainerExecutor contains a number of code snippets like this:
> {code}
> } catch (PrivilegedOperationException e) {
> int exitCode = e.getExitCode();
> {code}
> PrivilegedOperationException#getExitCode can return null if the operation was
> interrupted, so when the JVM does auto-unboxing on that last line it can NPE
> if there was no exit code.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]