[
https://issues.apache.org/jira/browse/MAPREDUCE-6649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15244449#comment-15244449
]
Eric Payne commented on MAPREDUCE-6649:
---------------------------------------
[~ebadger], I have checked this fix into trunk, branch-2 and branch-2.8. It
looks like it will need a separate patch if we want it to go into branch-2.7.
> getFailureInfo not returning any failure info
> ---------------------------------------------
>
> Key: MAPREDUCE-6649
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6649
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Reporter: Eric Badger
> Assignee: Eric Badger
> Attachments: MAPREDUCE-6649.001.patch, MAPREDUCE-6649.002.patch
>
>
> The following command does not produce any failure info as to why the job
> failed.
> {noformat}
> $HADOOP_PREFIX/bin/hadoop jar
> $HADOOP_PREFIX/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-${HADOOP_VERSION}-tests.jar
> sleep -Dmapreduce.jobtracker.split.metainfo.maxsize=10
> -Dmapreduce.job.queuename=default -m 1 -r 1 -mt 1 -rt 1
> {noformat}
> {noformat}
> 2016-03-07 10:34:58,112 INFO [main] mapreduce.Job
> (Job.java:monitorAndPrintJob(1431)) - Job job_1457364518683_0004 failed with
> state FAILED due to:
> {noformat}
> To contrast, here is a command and associated command line output to show a
> failed job that gives the correct failiure info.
> {noformat}
> $HADOOP_PREFIX/bin/hadoop jar
> $HADOOP_PREFIX/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-${HADOOP_VERSION}-tests.jar
> sleep -Dyarn.app.mapreduce.am.command-opts=-goober
> -Dmapreduce.job.queuename=default -m 20 -r 0 -mt 30000
> {noformat}
> {noformat}
> 2016-03-07 10:30:13,103 INFO [main] mapreduce.Job
> (Job.java:monitorAndPrintJob(1431)) - Job job_1457364518683_0003 failed with
> state FAILED due to: Application application_1457364518683_0003 failed 3
> times due to AM Container for appattempt_1457364518683_0003_000003 exited
> with exitCode: 1
> Failing this attempt.Diagnostics: Exception from container-launch.
> Container id: container_1457364518683_0003_03_000001
> Exit code: 1
> Stack trace: ExitCodeException exitCode=1:
> at org.apache.hadoop.util.Shell.runCommand(Shell.java:927)
> at org.apache.hadoop.util.Shell.run(Shell.java:838)
> at
> org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:1117)
> at
> org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:227)
> at
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:319)
> at
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:88)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)