[
https://issues.apache.org/jira/browse/TEZ-2581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979729#comment-14979729
]
Bikas Saha commented on TEZ-2581:
---------------------------------
Also renaming it to RecoverCompletedDAGTransition would help. This is a
slippery slope though. Today we dont sent task info to clients but when we
start doing that then we would need to recover tasks too.
Does the client transition to using ATS for completed DAGs? If yes, then
perhaps we dont need this since the client will transition to using the ATS.
In this same context, what happens to ATS data? AM may have crashed after DAG
completed but before its ATS data was written (eg slow ATS). So we may recover
the completed DAG but the ATS may report the DAG as running since we have not
sent it a DAG completed ATS event.
> Umbrella for Tez Recovery Redesign
> ----------------------------------
>
> Key: TEZ-2581
> URL: https://issues.apache.org/jira/browse/TEZ-2581
> Project: Apache Tez
> Issue Type: Improvement
> Reporter: Jeff Zhang
> Assignee: Jeff Zhang
> Attachments: TEZ-2581-WIP-1.patch, TEZ-2581-WIP-2.patch,
> TEZ-2581-WIP-3.patch, TEZ-2581-WIP-4.patch, TEZ-2581-WIP-5.patch,
> TEZ-2581-WIP-6.patch, TezRecoveryRedesignProposal.pdf,
> TezRecoveryRedesignV1.1.pdf
>
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)