Thanks Subru, Carlo, Dharmesh and Chris!

Overall a great feature and a great contribution. I've been involved in various 
design discussions and did a bunch of reviews. The APIs are great, the 
implementation is too barring few last rush items like persistence. We can do 
these in the follow-up.

+1 binding.

Seeing this as a vote for trunk, let's merge this to trunk first, get a couple 
of clean builds and test-runs before we move it to branch-2.

This will be a great alpha feature for many users! Thanks again!

+Vinod

On Sep 30, 2014, at 11:40 AM, Chris Douglas <[email protected]> wrote:

> +1
> 
> Excellent work, Carlo and Subru. -C
> 
> On Fri, Sep 26, 2014 at 11:50 AM, Carlo Curino <[email protected]> wrote:
>> (Apologies if it is delivered twice.)
>> 
>> YARN Devs,
>> 
>> We propose to merge YARN-1051 development branch into trunk.
>> 
>> Key Idea:
>> This work adds support for Reservations to YARN RM. The key idea is to allow 
>> users to request dedicated access to resources (a reservation), ahead of 
>> time.
>> For example I can ask for "10 containers for 1 hour sometime between 4pm and 
>> 9pm today".  The RM keeps track of the accepted reservation by means of
>> a Plan (think it as an agenda on how the  cluster resources will be used), 
>> and performs admission control to guarantee that if a reservation is 
>> accepted enough
>> resources are set aside to satisfy it.  We enforce the reservation promises 
>> by dynamically creating/resizing/removing queues at the right time. This 
>> allows us
>> to leverage the existing schedulers for the actual container assignment and 
>> tracking. The key benefit is to expose to the scheduler flexibility of 
>> allocation, while
>> guaranteeing users predictable resource allocation.
>> 
>> Status
>> 
>> *         The work has been "broken down" into 14 subtasks (+3 patches 
>> already committed to trunk for move/kill of apps). All the issues have been 
>> resolved.
>> 
>> *         Jenkins +1 the patch (with the exception of one test failure which 
>> we did not introduce, which is tracked here: 
>> https://issues.apache.org/jira/browse/MAPREDUCE-6094)
>> 
>> *         Simple integration with MapReduce: 
>> https://issues.apache.org/jira/browse/MAPREDUCE-6103
>> 
>> *         The broken-down patches have been reviewed and +1ed by Vinod Kumar 
>> Vavilapali, Jian He, Wangda Tan, Karthik Kambatla, and Chris Douglas. Thanks 
>> to all of you for the thorough reviews!
>> 
>> *         The current version has been rather thoroughly tested by running 
>> it on our 250 machines research cluster for months (first prototype was 
>> operational about a year ago) by:
>> 
>> o   Running hundreds of thousands of job generate by a modified version of 
>> gridmix that exercise the reservations mechanism side-by-side normal queues.
>> 
>> o   To support our integration with the resource estimation framework 
>> Perforator (http://research.microsoft.com/pubs/178971/perforator.pdf). 
>> Kaushik and Dharmesh have been pounding the reservation system for their 
>> research for 3-4 months now, and helped us spot few bugs and iron them out.
>> 
>> o   Code has been inspected/extended by 4-5 other researchers which are 
>> exploring integration with other systems and extensions of our algorithms 
>> for "reservation placement".
>> 
>> *         We have few ideas for follow-up extensions/improvements are 
>> tracked by the umbrella JIRA https://issues.apache.org/jira/browse/YARN-2572
>> 
>> Documents and Deliverables
>> 
>> *         This work was accepted for publication to SoCC 2014 (pre-camera 
>> ready version of the paper here):   
>> https://issues.apache.org/jira/secure/attachment/12671498/socc14-paper15.pdf
>> 
>> *         Shorter design doc: 
>> https://issues.apache.org/jira/secure/attachment/12628330/YARN-1051-design.pdf
>> 
>> *         Overall patch: 
>> https://issues.apache.org/jira/secure/attachment/12671361/YARN-1051.1.patch
>> 
>> *         Per Karthik request we are preparing a small how-to document and 
>> example code/configuration tracked by 
>> https://issues.apache.org/jira/browse/YARN-2609
>> 
>> 
>> Credits
>> Myself and Subru did lots of the coding (hence the flow of patches from us), 
>> but this is a group effort that could have not been possible without the 
>> ideas and hard work of many other
>> folks in our research group (Microsoft-CISL). Major kudos to:  Chris 
>> Douglas, Sriram Rao, Raghu Ramakrishnan, and our intern Djellel Difallah. 
>> Also big thanks to the many folks in community  (Arun, Vinod, Alejandro, 
>> Bikas, Karthik, Sandy, Hitesh, Jakob, Mohammad, Mayank, Jason, Bobby, and 
>> many more) that helped us shape our ideas and code with very insightful 
>> feedback and comments.
>> 
>> We expect the vote to run for the usual 7 days and will expire at 12pm PDT 
>> on Oct 3. Please feel free to reach out to us if you have any 
>> questions/doubts.
>> 
>> Cheers,
>> Carlo & Subru
>> 

Attachment: signature.asc
Description: Message signed with OpenPGP using GPGMail

Reply via email to