[
https://issues.apache.org/jira/browse/TEZ-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14972625#comment-14972625
]
Bikas Saha commented on TEZ-808:
--------------------------------
Attaching patch that does 1) above. Will add framework IO progress calls in a
follow up jira.
Patch 1) adds an API to notifyProgress 2) notification counts are sent to the
AM 3) AM checks for unchanging notification counts for a certain threshold and
fails attempts that cross it. Unit and e2e tests added. [~jeagles] [~jlowe]
Please review. Thanks!
> Handle task attempts that are not making progress
> -------------------------------------------------
>
> Key: TEZ-808
> URL: https://issues.apache.org/jira/browse/TEZ-808
> Project: Apache Tez
> Issue Type: Sub-task
> Reporter: Bikas Saha
> Assignee: Bikas Saha
> Attachments: TEZ-808.1.patch
>
>
> If a task attempt is not making progress then it may cause the job to hang.
> We may want to kill and restart the attempt. With speculation support and
> free resources we may want to run another version in parallel.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)