Eric Badger created MAPREDUCE-6649:
--------------------------------------

             Summary: getFailureInfo not returning any failure info
                 Key: MAPREDUCE-6649
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6649
             Project: Hadoop Map/Reduce
          Issue Type: Bug
            Reporter: Eric Badger
            Assignee: Eric Badger


The following command does not produce any failure info as to why the job 
failed. 

{noformat}
$HADOOP_PREFIX/bin/hadoop jar 
$HADOOP_PREFIX/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-${HADOOP_VERSION}-tests.jar
 sleep -Dmapreduce.jobtracker.split.metainfo.maxsize=10 
-Dmapreduce.job.queuename=default -m 1 -r 1 -mt 1 -rt 1
{noformat}

{noformat}
2016-03-07 10:34:58,112 INFO  [main] mapreduce.Job 
(Job.java:monitorAndPrintJob(1431)) - Job job_1457364518683_0004 failed with 
state FAILED due to: 
{noformat}

To contrast, here is a command and associated command line output to show a 
failed job that gives the correct failiure info. 

{noformat}
$HADOOP_PREFIX/bin/hadoop jar 
$HADOOP_PREFIX/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-${HADOOP_VERSION}-tests.jar
 sleep -Dyarn.app.mapreduce.am.command-opts=-goober 
-Dmapreduce.job.queuename=default -m 20 -r 0 -mt 30000
{noformat}

{noformat}
2016-03-07 10:30:13,103 INFO  [main] mapreduce.Job 
(Job.java:monitorAndPrintJob(1431)) - Job job_1457364518683_0003 failed with 
state FAILED due to: Application application_1457364518683_0003 failed 3 times 
due to AM Container for appattempt_1457364518683_0003_000003 exited with  
exitCode: 1
Failing this attempt.Diagnostics: Exception from container-launch.
Container id: container_1457364518683_0003_03_000001
Exit code: 1
Stack trace: ExitCodeException exitCode=1: 
        at org.apache.hadoop.util.Shell.runCommand(Shell.java:927)
        at org.apache.hadoop.util.Shell.run(Shell.java:838)
        at 
org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:1117)
        at 
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:227)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:319)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:88)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
        at java.lang.Thread.run(Thread.java:745)
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to