Very good points Josh and Lei. I totally agree with you on the importance
of the app-packages.

Given that YARN-4692 is all about long-running apps/services and primarily
focused around Docker, apps are at its core. I guess the Apple App Store
and its devices can be only as great as its apps. Having said that, I am
pretty sure sample apps have to be provided by the platform. The samples
and guidelines will then help the app-owner community to build more and
more. YARN-4692 design doc already talks about the concept of image stores
in section (3.3.3) Packaging.

Naturally, the existing Slider apps need a home, and I am sure will be of
prime importance when we take the discussion to the Hadoop community, once
we have a consensus here.

-Gour

On 3/16/16, 9:45 PM, "Lei Guo" <lei...@huawei.com> wrote:

>Some quick thoughts regarding the app-packages topic, I was the owner of
>a cluster management product development a few years ago, the product can
>help to build a HPC cluster within 15-30 minutes from bare metal. The
>major challenge for me was to define the boundary between product feature
>and best practice. Each customer is different, from driver required for
>HW to the IP address planning, there were so many miscellaneous
>requirements on product enhancements. If we put everything in product, it
>may triple our delivery effort and each small feature may can only help a
>couple of customers. We had to try our best to generalize the
>requirements as product feature and put more best practice guide or
>sample in community/support channel.
>
>Back to the topic on app-packages, I see it's in similar position as
>cookbook/recipe in Chef. It's good enough for some basic customer cases,
>but must be customized/enhanced to support advanced customer cases. So
>basic app-packages could be as a sample in Yarn package, but advanced
>case or customization should only be published in certain community way
>as it does need the customer's effort to make it fit certain environment.
>
>Just my two cents,
>Lei
>
>-----Original Message-----
>From: Josh Elser [mailto:els...@apache.org]
>Sent: Wednesday, March 16, 2016 10:58 PM
>To: dev@slider.incubator.apache.org
>Subject: Re: [DISCUSS] Next Steps for Slider & First-Class Services in
>YARN (YARN-4692)
>
>Thanks for the proposal, Gour! Interesting thought.
>
>I think it makes sense. As YARN is maturing, long-lived services
>becoming a primitive is a natural progression. Slider is likely at the
>forefront of building such a primitive on YARN (from a lot of great
>planning/design from Steve).
>
>I think this would definitely be an interesting conversation to be had
>with YARN (if the other podling members are of the same mindset). I
>think how this plays out would require a bit of planning/coordination
>from the Hadoop PMC side.
>
>Now, there is the other half of Slider: the app-packages. My gut
>reaction is that YARN would have no interest in owning/maintaining
>these. This is a bit concerning to me because Slider on its own really
>isn't that exciting. It's the app-packages that make it so enticing --
>build a zip, install it to your cluster, and suddenly users can start
>dynamically creating clusters (HBase, Accumulo, Storm, etc). I would be
>strongly opposed to any plan to merge Slider into YARN/Hadoop without a
>clear path forward on where the app-packages would live. This is
>extremely important to me.
>
>I'd love to see where this conversation can go.
>
>- Josh
>
>Gour Saha wrote:
>> Slider community,
>>
>>
>> The YARN team is discussing in
>>YARN-4692<https://issues.apache.org/jira/browse/YARN-4692>  on how to
>>add "first class services" directly to YARN. Some of the names in the
>>discussion document should be familiar: that's because Slider is
>>essentially the original long-lived application in YARN.
>>
>>
>> With YARN-4692<https://issues.apache.org/jira/browse/YARN-4692>, it is
>>apparent that the Apache Hadoop YARN community is working towards
>>providing direct support for long-lived services. I think we need to
>>look at that proposal and think "where and how does Slider relate to
>>this".
>>
>>
>> Apache Slider (incubating) has been in the business of creating and
>>managing long-running services in YARN for a couple of years. Today it
>>is being used in production YARN clusters across several companies (big
>>and small). Several production-grade applications (data and non-data)
>>are available as sample packages. A good number of them have been
>>contributed by interested parties like Lucidworks contributing a Solr
>>Slider Application Package and DataTorrent contributing a Kafka Slider
>>Application Package.
>>
>>
>> Slider has been pretty good at taking existing applications and turning
>>them into long-lived services in YARN. YARN offers the core scheduling,
>>execution and failure reporting functions; slider takes that and adds:
>>advanced container placement (history; anti-affine, escalation
>>policies), configuration, dynamic binding, monitoring, failure handling,
>>and an API for clients. It's also driven a lot of the
>>YARN-896<https://issues.apache.org/jira/browse/YARN-896>  "long-lived
>>services" development: long-lived failure resilience, the YARN registry,
>>container-preservation over YARN restarts. Big chunks of that code
>>actually came from the Slider team. This was always a goal of the work
>>even in its Hoya predecessor: show that YARN can be used to host
>>applications like HBase, and identify where it can be be improved.
>>
>>
>> What does it mean for Slider if YARN starts doing this directly?
>>
>>
>> Slider provides a lot of the basic functionalities for long-running
>>services proposed in  YARN-4692. It is a universal YARN app-master and
>>lets application-owners focus on their application functionalities,
>>while it handles the internals of orchestrating services on YARN.
>>
>>
>> Which means: we have an opportunity here to contribute the core of
>>slider into YARN itself, and, with it in YARN, use it as the basis for
>>the full TODO-list of YARN-4692.
>>
>>
>> The YARN team gets the stable codebase that's evolved over the past few
>>years: something to deploy applications in a YARN cluster. What does
>>Slider get? We'd get to be the foundation for long lived YARN services
>>with the new work on top.
>>
>>
>> Would this work? What's wrong with the idea? How do we do it if we want
>>to go with it?
>>
>>
>> I would like to call upon the community to weigh in their thoughts and
>>opinions on this topic.
>>
>> -Gour
>>
>>
>

Reply via email to