[
https://issues.apache.org/jira/browse/TEZ-4232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
László Bodor updated TEZ-4232:
------------------------------
Description:
While using analyzers, I found that TaskAttemptFinishedEvent contains a [very
long message in
diagnostics|https://github.com/apache/tez/blob/master/tez-dag/src/main/java/org/apache/tez/dag/app/dag/impl/TaskAttemptImpl.java#L1194-L1195],
the whole exception trace in particular. I think for reporting purposes, a
shorter message should be available, for instance, in case of shuffle issues,
the original message of the exception:
{code}
java.io.IOException: Map_1: Shuffle failed with too many fetch failures and
insufficient progress!failureCounts=82, pendingInputs=274,
fetcherHealthy=false, reducerProgressedEnough=true, reducerStalled=true
{code}
was:I found that
> TaskAttemptFinishedEvent should contain a short diagnostic message
> ------------------------------------------------------------------
>
> Key: TEZ-4232
> URL: https://issues.apache.org/jira/browse/TEZ-4232
> Project: Apache Tez
> Issue Type: Improvement
> Reporter: László Bodor
> Assignee: László Bodor
> Priority: Major
>
> While using analyzers, I found that TaskAttemptFinishedEvent contains a [very
> long message in
> diagnostics|https://github.com/apache/tez/blob/master/tez-dag/src/main/java/org/apache/tez/dag/app/dag/impl/TaskAttemptImpl.java#L1194-L1195],
> the whole exception trace in particular. I think for reporting purposes, a
> shorter message should be available, for instance, in case of shuffle issues,
> the original message of the exception:
> {code}
> java.io.IOException: Map_1: Shuffle failed with too many fetch failures and
> insufficient progress!failureCounts=82, pendingInputs=274,
> fetcherHealthy=false, reducerProgressedEnough=true, reducerStalled=true
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)