[ 
https://issues.apache.org/jira/browse/YARN-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Subru Krishnan updated YARN-4879:
---------------------------------
    Attachment: SimpleAllocateProtocolProposal-v2.pdf

Thanks [~kasha], [~leftnoteasy], [~vinodkv] and [[email protected]] for taking 
a look at our proposal. PFA updated doc (v2) that incorporates your feedback.

A few additional clarifications (have addressed rest of comments directly in 
the updated doc):

bq. While making these changes, would it possible to address YARN-314 too? 
bq. I'm okay if we can get two in a shot, but I'd caution against risking this 
effort by blowing up the size.

We will address YARN-314 as long as applications specify the Request-ID as they 
can request for multiple containers at same priority through independent 
requests.

bq. Why are we putting priority semantics onto the ID? We should just follow 
the existing priority ordering.

We will continue to follow the existing priority ordering. But as explained 
above, with the proposed enhancement user can potentially make multiple 
requests at same priority (YARN-314). In such a scenario, we will simply 
allocate containers in FIFO order.

bq. BTW, for the federation related issue, does the client-library need to 
always generate these IDs? How does that interact with application generated 
IDs?

In Federation also, we expect applications to generate the IDs. For e.g.: we 
will work with the REEF team (and the long running service AM proposed as part 
of YARN-4692) to start specifying IDs for their allocation requests.

bq. I would also like to see if the allocated containers could support a role 
ID field too...nothing much, but enough that on an AM restart their role can be 
determined. That one, I'd keep separate from the request ID; they serve 
slightly different purposes. (I could have 5 requests outstanding for 
containers of role 4; I'd want to track those requests)

I agree that having an explicit role ID is useful but feel its outside the 
scope of this JIRA which IIUC is what you are also observing. I think adding a 
role ID should be part of YARN-4692/YARN-4902.

> Enhance Allocate Protocol to Identify Requests Explicitly
> ---------------------------------------------------------
>
>                 Key: YARN-4879
>                 URL: https://issues.apache.org/jira/browse/YARN-4879
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: applications, resourcemanager
>            Reporter: Subru Krishnan
>            Assignee: Subru Krishnan
>         Attachments: SimpleAllocateProtocolProposal-v1.pdf, 
> SimpleAllocateProtocolProposal-v2.pdf
>
>
> For legacy reasons, the current allocate protocol expects expanded requests 
> which represent the cumulative request for any change in resource 
> constraints. This is not only very difficult to comprehend but makes it 
> impossible for the scheduler to associate container allocations to the 
> original requests. This problem is amplified by the fact that the expansion 
> is managed by the AMRMClient which makes it cumbersome for non-Java clients 
> as they all have to replicate the non-trivial logic. In this JIRA, we are 
> proposing enhancement to the Allocate Protocol to allow AMs to identify 
> requests explicitly.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to