[ 
https://issues.apache.org/jira/browse/FLINK-19324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xintong Song reassigned FLINK-19324:
------------------------------------

    Assignee: Xintong Song

> Map requested/allocated containers with priority on YARN
> --------------------------------------------------------
>
>                 Key: FLINK-19324
>                 URL: https://issues.apache.org/jira/browse/FLINK-19324
>             Project: Flink
>          Issue Type: Bug
>          Components: Deployment / YARN
>            Reporter: Xintong Song
>            Assignee: Xintong Song
>            Priority: Major
>
> In the design doc of FLINK-14106, there was a 
> [discussion|https://docs.google.com/document/d/1f8imSus3QwKEUPAldzR8CSMjZ-2a9O17-rn4oeKGtqw/edit?disco=AAAAGPX_tmg]
>  on how we map allocated containers with the requested ones on YARN. We 
> rejected the design option that uses container priorities for mapping 
> containers of different resources, because we do not want to priorities 
> different container requests (which is the original purpose for this field). 
> As a result, we have to interpret how the requested container request would 
> be normalized by Yarn, and map the allocated/requested containers 
> accordingly, which is complicated and fragile. See also FLINK-19151.
> Recently in our POC for fine grained resource management, we surprisingly 
> discovered that Yarn actually doesn't work with container requests same 
> priority and different resources. I do not find this described as an official 
> protocol in any Yarn's documents. The issue has been raised in early Yarn 
> versions (YARN-314) and has not been fixed util Hadoop 2.9 when 
> {{allocationRequestId}} is introduced. In Hadoop 2.8, Yarn scheduler is still 
> internally using priority as the key of a container request (see 
> [AppSchedulingInfo#updateResourceRequests 
> |https://github.com/apache/hadoop/blob/eb818cdc64336ade273a960ba3b9b5a5d0c4d4ec/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/AppSchedulingInfo.java#L341]),
>  thus requests same priority and different resources would overwrite each 
> other.
> The new discovery suggests that, if we want to support containers with 
> different resources on Hadoop 2.8 and earlier versions, we have to give them 
> different priorities anyway. Thus, I would suggest to get rid of the 
> container normalization simulation and go back to the previously rejected 
> priority based design option.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to