Devs,

I've got an initial prototype going on this project. During this process I noticed that much of what I needed to do ended up duplicating code that is in the twill-yarn project. I'm curious if the community would be open to refactoring that moved some of this code such that it could be shared by different clustering control implementations. For instance, much of the code in YarnTwillPreparer, YarnContainerMain, YarnServiceMain, etc is not really yarn specific.

Thoughts?

Thanks,
Martin

On 02/01/2017 04:13 PM, Martin Serrano wrote:
Hey Devs,

I've been evangelizing Twill at here at my company. We have a for several years had a basic clustering solution for running our system and its components across multiple hosts. As we've worked to migrate the platform to Hadoop and YARN, we have started to move our cluster control to be Yarn-based. And my goal is to make it Twill-based. However as we do this work we anticipate the need to support customers that do not have and do not want to operate Hadoop infrastructure.

I'm working on an architecture which would use Twill for command and control for all of our services. For customers that did not want the full enterprise capabilities that come with Hadoop we could continue to offer our basic clustering support (albeit with reduced capabilities) by plugging our clustering solution into Twill. One of the aspects of Twill that interested me from the start was that the control API was abstracted from YARN.

A couple questions:

1) Do any plugins for other clustering backends already exist? Open source? Commercial?

2) If we were to do this, would the Twill community be interested in a donation of this code to the Twill project?

I recall that Henry and I had a conversation at the last Apache Big Data that there had been talk of plugins for other clustering backends but I don't know if anything ever came of that.

Cheers,
Martin Serrano


Reply via email to