[ 
https://issues.apache.org/jira/browse/YARN-624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13732188#comment-13732188
 ] 

Steve Loughran commented on YARN-624:
-------------------------------------

[~bikassaha] has pointed out that gang scheduling can be implemented in an AM 
today: it can just hang on to assigned nodes until a minimum has been 
allocated, at which point it can bring up its service.

This would move all scheduling decisions into the AM, which is free to 
implement it own policy.

If we start this way, then once a few apps have done this we can look at 
commonality and decide what features -if any- need to go into YARN.

The most likely - again, credit to Bikas - is for YARN to recognise that an AM 
has been given some containers but not, after a time period, deployed anything 
to them. YARN could then cancel the lease unless the AM specifically indicates 
it wants to retain it.
                
> Support gang scheduling in the AM RM protocol
> ---------------------------------------------
>
>                 Key: YARN-624
>                 URL: https://issues.apache.org/jira/browse/YARN-624
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: api, scheduler
>    Affects Versions: 2.0.4-alpha
>            Reporter: Sandy Ryza
>            Assignee: Sandy Ryza
>
> Per discussion on YARN-392 and elsewhere, gang scheduling, in which a 
> scheduler runs a set of tasks when they can all be run at the same time, 
> would be a useful feature for YARN schedulers to support.
> Currently, AMs can approximate this by holding on to containers until they 
> get all the ones they need.  However, this lends itself to deadlocks when 
> different AMs are waiting on the same containers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to