Lightweight cluster management may be ideal in terms of ease of use, but if it doesn’t fit within the overall cluster/resource management infrastructure of where Kylin is deployed, you would end up in creating small islands of partitioned resources…
My personal preference would be (1)Mesos/Marathon which has larger installed base (2)Kubernetes based on the momentum behind it and should be optional for small scale deployments... On 11/3/15, 5:39 AM, "Adunuthula, Seshu" <[email protected]> wrote: >What about Kubernetes? More heavy weight, but eBay overall is putting all >its eggs into that basket :-) >http://googlecloudplatform.blogspot.com/2015/07/Kubernetes-V1-Released.htm >l > > >On 11/3/15, 5:26 AM, "Ted Dunning" <[email protected]> wrote: > >>You need to also decide whether Helix has momentum. The low traffic on >>the >>mailing list may be because the project is mature, but may also mean >>small >>user base. >> >>That is a very hard call to make. >> >> >>On Tue, Nov 3, 2015 at 2:00 AM, Li Yang <[email protected]> wrote: >> >>> One important goal is the cluster management should be as simple as >>> possible and introduces minimal dependency. This will allow Kylin to be >>> self-contained, easy to deploy, and is able to run on different >>>computing >>> platform. Especially we don't want to be locked-in by any platform. >>> >>> Helix is good as it's just a lightweight 700 KB lib and the only >>>dependency >>> is zookeeper (which everyone depends on nowadays). >>> >>> YARN is great but implies Hadoop, meaning Kylin may have trouble >>>running on >>> Spark alone. >>> >>> Marathon and Mesos is super great but seems a overkill. All Kylin needs >>>at >>> the moment is leader selection and task dispatching. >>> >>> >>> On Tue, Nov 3, 2015 at 3:03 PM, Ted Dunning <[email protected]> >>>wrote: >>> >>> > True, but it is one of the few options for running outside of a >>>cluster. >>> > >>> > >>> > >>> > On Mon, Nov 2, 2015 at 10:43 PM, 周千昊 <[email protected]> wrote: >>> > >>> > > Marathon need Mesos as extra dependency. For those who use hdp or >>> > cloudera >>> > > as sandbox for POC or some production env, Mesos does not seem to >>>be >>> the >>> > > standard component. I think it might be a concern for the users. >>> > > >>> > > Luke Han <[email protected]>于2015年11月3日周二 下午1:31写道: >>> > > >>> > > > Very interesting one, Marathon can run on Mesos which also could >>> manage >>> > > > containers... >>> > > > Worth to have a try... >>> > > > >>> > > > https://github.com/mesosphere/marathon >>> > > > >>> > > > >>> > > > >>> > > > >>> > > > Best Regards! >>> > > > --------------------- >>> > > > >>> > > > Luke Han >>> > > > >>> > > > On Tue, Nov 3, 2015 at 1:12 PM, Ted Dunning >>><[email protected]> >>> > > wrote: >>> > > > >>> > > > > Try Marathon then. >>> > > > > >>> > > > > >>> > > > > >>> > > > > On Mon, Nov 2, 2015 at 5:06 PM, Luke Han <[email protected]> >>> wrote: >>> > > > > >>> > > > > > Run on YARN requires to run kylin on "one" Hadoop cluster. >>> > > > > > >>> > > > > > Based on today's practices, Kylin's services always sit >>>beside >>> > > clusters >>> > > > > not >>> > > > > > inside to have capability to interactive with more than 2 >>> clusters >>> > > with >>> > > > > > just one Kylin deployment. >>> > > > > > >>> > > > > > >>> > > > > > >>> > > > > > >>> > > > > > >>> > > > > > >>> > > > > > Best Regards! >>> > > > > > --------------------- >>> > > > > > >>> > > > > > Luke Han >>> > > > > > >>> > > > > > On Tue, Nov 3, 2015 at 12:12 AM, Henry Saputra < >>> > > > [email protected]> >>> > > > > > wrote: >>> > > > > > >>> > > > > > > Why dont we just leverage Hadoop YARN? >>> > > > > > > >>> > > > > > > We could use Apache Twill [1] to wrap it as YARN >>>application. >>> > > > > > > >>> > > > > > > Helix is great but one of the limitations it could not >>> provision >>> > > new >>> > > > > > > node when not already available. >>> > > > > > > >>> > > > > > > [1] http://twill.incubator.apache.org/ >>> > > > > > > >>> > > > > > > On Sun, Nov 1, 2015 at 7:01 PM, 周千昊 <[email protected]> >>>wrote: >>> > > > > > > > Hi, all >>> > > > > > > > Kylin currently manages the cluster itself, so far >>>it >>> > works >>> > > > fine >>> > > > > > > since >>> > > > > > > > there are only 2 roles for the cluster, job engine and >>>query >>> > > > engine. >>> > > > > > > > In order to reduce the complexity of deployment and >>> > support >>> > > > the >>> > > > > > > coming >>> > > > > > > > streaming feature, I suggest kylin have a better cluster >>> > > > management. >>> > > > > > > > There are several candidates I think can be >>>leveraged. >>> > > Apache >>> > > > > > Helix >>> > > > > > > > seem suitable for Kylin in my opinion. >>> > > > > > > > I would love to start a discussion about cluster >>>manager >>> > of >>> > > > > Kylin, >>> > > > > > > how >>> > > > > > > > we can do it the better way, and which component we >>>should >>> > > choose. >>> > > > > > > >>> > > > > > >>> > > > > >>> > > > >>> > > >>> > >>> >
