Re: Design doc: Agent draining and deprecation of maintenance primitives

2019-06-14 Thread Vinod Kone
ersion of this, since deactivated agents >>> > would never have their resources offered to frameworks. This provides a >>> > blunt mechanism to prevent scheduling on nodes which have upcoming >>> > maintenance, although it sounds like you see some benefit to a more >&g

Re: Design doc: Agent draining and deprecation of maintenance primitives

2019-06-14 Thread Greg Mann
> With the new proposal, it's going to be as difficult as before to have > SLA-aware maintenances because it will need cooperation from the frameworks > anyway and we know this is rarely a priority for them. We will also lose > the ability to signal future maintenance in orde

Re: Design doc: Agent draining and deprecation of maintenance primitives

2019-06-14 Thread Greg Mann
s that we can finally set >> nodes down for maintenance with a configurable kill grace period and a >> proper task status (with maintenance primitives, it was TASK_LOST I think) >> without any specific cooperation from the frameworks. >> > >> > I think that this co

Re: Design doc: Agent draining and deprecation of maintenance primitives

2019-06-07 Thread Heer, Christoph
d in > priority, and then sort agents by maintenance date in decremasing order. This > would be a big improvement to prevent cluster reboots to trigger too many > task restarts. This will not be possible with the new primitives. The same > idea apply for frameworks too. > > M

Re: Design doc: Agent draining and deprecation of maintenance primitives

2019-06-07 Thread Maxime Brugidou
gt; >>> For example I had this idea to improve the allocator (or write a custom >>> one) that would offer resources from agents with no maintenance planned in >>> priority, and then sort agents by maintenance date in decremasing order. >>> This would be a

Re: Design doc: Agent draining and deprecation of maintenance primitives

2019-06-06 Thread Benjamin Mahler
e idea apply for frameworks too. >> >> Maxime >> >> Le jeu. 30 mai 2019 à 22:16, Joseph Wu a écrit : >> >>> As far as I can tell, the document is public. >>> >>> On Thu, May 30, 2019 at 12:22 AM Marc Roos >>> wrote: >>> >>&g

Re: Design doc: Agent draining and deprecation of maintenance primitives

2019-06-06 Thread Greg Mann
Marc Roos >> wrote: >> >>> >>> Is the doc not public? >>> >>> >>> -----Original Message- >>> From: Joseph Wu [mailto:jos...@mesosphere.io] >>> Sent: donderdag 30 mei 2019 2:07 >>> To: dev; user >>> Subject: Des

Re: Design doc: Agent draining and deprecation of maintenance primitives

2019-06-06 Thread Maxime Brugidou
ell, the document is public. > > On Thu, May 30, 2019 at 12:22 AM Marc Roos > wrote: > >> >> Is the doc not public? >> >> >> -Original Message- >> From: Joseph Wu [mailto:jos...@mesosphere.io] >> Sent: donderdag 30 mei 2019 2:

Re: Design doc: Agent draining and deprecation of maintenance primitives

2019-05-30 Thread Joseph Wu
Design doc: Agent draining and deprecation of maintenance > primitives > > Hi all, > > A few years back, we added some constructs called maintenance primitives > to Mesos. This feature was meant to allow operators and frameworks to > cooperate in draining tasks off nodes schedu

Design doc: Agent draining and deprecation of maintenance primitives

2019-05-29 Thread Joseph Wu
Hi all, A few years back, we added some constructs called maintenance primitives to Mesos. This feature was meant to allow operators and frameworks to cooperate in draining tasks off nodes scheduled for maintenance. As far as we've observed since, this feature never achieved enough adoption to b