[
https://issues.apache.org/jira/browse/TEZ-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14957365#comment-14957365
]
Rohini Palaniswamy commented on TEZ-808:
----------------------------------------
Filed PIG-4700 to call setProgress() on ProcessorContext. I somehow missed that
method when I was checking the Tez code when this issue came up.
I agree with [~jlowe]. All the mapreduce frameworks have always been diligent
about reporting progress and fixing places if things get killed because of
progress reporting is not done in some case. So it should not be difficult to
implement it for Tez. For eg: in Pig it is a one line fix as it has an
interface and different implementations for mapreduce and tez for reporting
progress with the tez method currently doing nothing. I believe hive also
should have something like that.
And having the default as no timeout will help with the migration without not
breaking anyone.
> Handle task attempts that are not making progress
> -------------------------------------------------
>
> Key: TEZ-808
> URL: https://issues.apache.org/jira/browse/TEZ-808
> Project: Apache Tez
> Issue Type: Sub-task
> Reporter: Bikas Saha
>
> If a task attempt is not making progress then it may cause the job to hang.
> We may want to kill and restart the attempt. With speculation support and
> free resources we may want to run another version in parallel.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)