[ https://issues.apache.org/jira/browse/MAPREDUCE-6649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Eric Badger updated MAPREDUCE-6649: ----------------------------------- Attachment: MAPREDUCE-6649.001.patch Uploading patch that fixes the issue and also creates a unit test to maintain the fix. > getFailureInfo not returning any failure info > --------------------------------------------- > > Key: MAPREDUCE-6649 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-6649 > Project: Hadoop Map/Reduce > Issue Type: Bug > Reporter: Eric Badger > Assignee: Eric Badger > Attachments: MAPREDUCE-6649.001.patch > > > The following command does not produce any failure info as to why the job > failed. > {noformat} > $HADOOP_PREFIX/bin/hadoop jar > $HADOOP_PREFIX/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-${HADOOP_VERSION}-tests.jar > sleep -Dmapreduce.jobtracker.split.metainfo.maxsize=10 > -Dmapreduce.job.queuename=default -m 1 -r 1 -mt 1 -rt 1 > {noformat} > {noformat} > 2016-03-07 10:34:58,112 INFO [main] mapreduce.Job > (Job.java:monitorAndPrintJob(1431)) - Job job_1457364518683_0004 failed with > state FAILED due to: > {noformat} > To contrast, here is a command and associated command line output to show a > failed job that gives the correct failiure info. > {noformat} > $HADOOP_PREFIX/bin/hadoop jar > $HADOOP_PREFIX/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-${HADOOP_VERSION}-tests.jar > sleep -Dyarn.app.mapreduce.am.command-opts=-goober > -Dmapreduce.job.queuename=default -m 20 -r 0 -mt 30000 > {noformat} > {noformat} > 2016-03-07 10:30:13,103 INFO [main] mapreduce.Job > (Job.java:monitorAndPrintJob(1431)) - Job job_1457364518683_0003 failed with > state FAILED due to: Application application_1457364518683_0003 failed 3 > times due to AM Container for appattempt_1457364518683_0003_000003 exited > with exitCode: 1 > Failing this attempt.Diagnostics: Exception from container-launch. > Container id: container_1457364518683_0003_03_000001 > Exit code: 1 > Stack trace: ExitCodeException exitCode=1: > at org.apache.hadoop.util.Shell.runCommand(Shell.java:927) > at org.apache.hadoop.util.Shell.run(Shell.java:838) > at > org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:1117) > at > org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:227) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:319) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:88) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)