[
https://issues.apache.org/jira/browse/TEZ-3990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16642016#comment-16642016
]
Kuhu Shukla edited comment on TEZ-3990 at 10/8/18 3:32 PM:
-----------------------------------------------------------
Addressed comments by [~jeagles]. Agreed on the issues mentioned with delay
calculation and testability. [~jeagles], should I go ahead and create JIRAs for
these issues?
P.S. The unordered case doesn't seem to have the concept of penalties fyi..
which is odd..
was (Author: kshukla):
Addressed comments by [~jeagles]. Agreed on the issues mentioned with delay
calculation and testability. [~jeagles], should I go ahead and create JIRAs for
these issues?
> The number of shuffle penalties for a host/inputAttemptIdentifier should be
> capped
> ----------------------------------------------------------------------------------
>
> Key: TEZ-3990
> URL: https://issues.apache.org/jira/browse/TEZ-3990
> Project: Apache Tez
> Issue Type: Bug
> Affects Versions: 0.9.1, 0.10.0
> Reporter: Kuhu Shukla
> Assignee: Kuhu Shukla
> Priority: Major
> Attachments: TEZ-3990.001.patch, TEZ-3990.002.patch,
> TEZ-3990.003.patch, TEZ-3990.004.patch
>
>
> In a scenario where the same mapId fetches fail, the penalty code allows
> adding the same Host/InputAttemptIdentifier over and over with revised
> penalty time that grows exponentially. It should at some point drop the
> retrying and report failure to the AM asap to allow the job to rectify the
> upstream output.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)