[
https://issues.apache.org/jira/browse/TEZ-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tsuyoshi Ozawa updated TEZ-3010:
--------------------------------
Attachment: TEZ-3010.001.patch
Attaching first patch to add retry semantics to TezChild. This patch is created
on the patch of TEZ-3009.
> Container task acquisition has no retries for errors
> ----------------------------------------------------
>
> Key: TEZ-3010
> URL: https://issues.apache.org/jira/browse/TEZ-3010
> Project: Apache Tez
> Issue Type: Bug
> Affects Versions: 0.7.0
> Reporter: Jason Lowe
> Attachments: TEZ-3010.001.patch
>
>
> There's no retries for errors that occur during task acquisition. If any
> error occurs the container will just shut down, resulting in task attempt
> failures if a task attempt happened to be assigned to the container by the
> AM. The container should try harder to obtain the task before giving up.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)