[
https://issues.apache.org/jira/browse/MAPREDUCE-3688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13588931#comment-13588931
]
Jason Lowe commented on MAPREDUCE-3688:
---------------------------------------
The original intent of this JIRA is a MapReduce thing. The AM encounters a
configuration error that causes it to abort before even trying to register with
the RM. If the AM fails to register with the RM, there's no hope of the RM's
web UI giving any useful diagnostic since the only thing that knows the
diagnostic is the AM. The only thing it can do is point to AM logs (which is
already fixed by things like YARN-236) because the container just up and exited.
It'd be nice to handle all the YARN-side failures like localization, launching,
etc., but that should be handled in a separate YARN JIRA. Most things that
happen once the MRAppMaster gets going could be handled better by the AM
itself. Perfect example is in the description or things like MAPREDUCE-4937,
where the AM could do a much better job informing the user of the error by
registering with the RM then unregistering with failure and a useful diagnostic
and also generating job history so it's in the "usual places" for users.
> Need better Error message if AM is killed/throws exception
> ----------------------------------------------------------
>
> Key: MAPREDUCE-3688
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3688
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mr-am, mrv2
> Affects Versions: 0.23.1
> Reporter: David Capwell
> Assignee: Ravi Prakash
> Fix For: 0.23.2
>
>
> We need better error messages in the UI if the AM gets killed or throws an
> Exception.
> If the following error gets thrown:
> java.lang.NumberFormatException: For input string: "9223372036854775807l" //
> last char is an L
> then the UI should say this exception. Instead I get the following:
> Application application_1326504761991_0018 failed 1 times due to AM Container
> for appattempt_1326504761991_0018_000001
> exited with exitCode: 1 due to: Exception from container-launch:
> org.apache.hadoop.util.Shell$ExitCodeException
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira