[
https://issues.apache.org/jira/browse/TEZ-3296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15334623#comment-15334623
]
Bikas Saha edited comment on TEZ-3296 at 6/16/16 8:29 PM:
----------------------------------------------------------
Ah. Looks like a result of using priority as a key for unique requests vs using
it a just priority. Sigh! /cc [~vinodkv] [~wangda]
was (Author: bikassaha):
Ah. Looks like a result of using priority as a key for unique requests vs using
it a just priority.
> Tez job can hang if two vertices at the same root distance have different
> task requirements
> -------------------------------------------------------------------------------------------
>
> Key: TEZ-3296
> URL: https://issues.apache.org/jira/browse/TEZ-3296
> Project: Apache Tez
> Issue Type: Bug
> Affects Versions: 0.7.1
> Reporter: Jason Lowe
> Assignee: Jason Lowe
> Priority: Critical
> Fix For: 0.7.2, 0.9.0, 0.8.4
>
> Attachments: TEZ-3296.001.patch
>
>
> When two vertices have the same distance from the root Tez will schedule
> containers with the same priority. However those vertices could have
> different task requirements and therefore different capabilities. As
> documented in YARN-314, YARN currently doesn't support requests for multiple
> sizes at the same priority. In practice this leads to one vertex allocation
> requests clobbering the other, and that can result in a situation where the
> Tez AM is waiting on containers it will never receive from the RM.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)