[ https://issues.apache.org/jira/browse/TEZ-4139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17406479#comment-17406479 ]
Rajesh Balamohan commented on TEZ-4139: --------------------------------------- >> can these changes go into the same patch? Sure, link both the tickets after the patch and it should be fine. > Tez should consider node information for computing failure fraction > ------------------------------------------------------------------- > > Key: TEZ-4139 > URL: https://issues.apache.org/jira/browse/TEZ-4139 > Project: Apache Tez > Issue Type: Improvement > Reporter: Rajesh Balamohan > Assignee: László Bodor > Priority: Major > Attachments: TEZ-4139.01.WIP.patch, TEZ-4139.02.WIP.patch > > > When lots of downstream attempts fail to pull the information from source > task, source task is marked as failed and it is retried. Currently failure > fraction is handled by looking at unique task attempts from downstream. > However, it should consider taking into account node information for > computing "failureFraction". > https://github.com/apache/tez/blob/master/tez-dag/src/main/java/org/apache/tez/dag/app/dag/impl/TaskAttemptImpl.java#L1845-L1849 -- This message was sent by Atlassian Jira (v8.3.4#803005)