[
https://issues.apache.org/jira/browse/TEZ-900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14301946#comment-14301946
]
chang li commented on TEZ-900:
------------------------------
I have done some further investigation, noticed that the TezSessionManager on
Pig does explicitly call TezAppMasterStatus on TezClient object, which will
print out the diagnostic. Hive currently still have some problems to print out
useful diagnostic on job's failure before launch (such as submit to the wrong
queue), but Hive should be able to solve this easily, because in the "execute"
function from TezTask, monitorExecution will be called, and according to the
@HIVE-5041 diagnostic should be able to printed out. However, in the case of
jobs submitted to the wrong queue, exception will be thrown before
monitorExecution in TezTask's execute function. In the Hive branch-0.13,
exception will be thrown during updateSession, and the waitTillReady within
updateSession. waitTillReady of TezClient will throw an SessionNotRunning
exception, which contains the diagnostic. But currently, Hive TezTask somehow
doesn't print out the exception error to the console, instead record the error
in the log. So the right thing for some Hive people to do is to properly handle
the exception and display the error.
So I propose that every non session job should call TezAppMasterStatus after
TezClient.start, and every session job should call waitTillReady after
TezClient.start. Actually within waitTillReady it will also call the
TezAppMasterStatus. TezAppMasterStatus is an existing function that helps print
out useful diagnostics.
> Confusing message for incorrect queue for some tez examples
> -----------------------------------------------------------
>
> Key: TEZ-900
> URL: https://issues.apache.org/jira/browse/TEZ-900
> Project: Apache Tez
> Issue Type: Bug
> Affects Versions: 0.3.0
> Reporter: Jonathan Eagles
> Assignee: chang li
>
> The error message of at least orderedwordcount with non-existent queue is
> very difficult as compared to mrrsleep which is very intuitive.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)