Re: Detecting Flapping Tasks in Aurora

2018-02-02 Thread Meghdoot bhattacharya
Nice. > On Feb 2, 2018, at 7:12 AM, Mauricio Garavaglia > wrote: > > > >> On Thu, Feb 1, 2018 at 8:15 PM, De, Bipra wrote: >> Hello Everyone, >> >> >> >> I am working on an alert system that will call Aurora APIs to detect jobs >> that have flapping tasks. It runs every hour. >> >> >

Re: kill task for unknown task id

2018-02-02 Thread Mohit Jaggi
Thanks Meghdoot. Yes reconciliation seems like a possible case. On Thu, Feb 1, 2018 at 9:39 PM, Meghdoot bhattacharya wrote: > Is it during implicit reconciliation when mesos master lists a set of > tasks that aurora does not recognize and hence kills them. Could be because > of a race between a

Re: Staggered deployments

2018-02-02 Thread Renan DelValle
Yup, I can take the lead on this, I'll work on a design doc first to get the story straight before I dive into coding. On Thu, Feb 1, 2018 at 10:28 PM, Meghdoot bhattacharya wrote: > Thx David. Renan can you take a lead on this? > > Yeah we do have separate orchestration for multi aurora cluster

Re: Detecting Flapping Tasks in Aurora

2018-02-02 Thread Mauricio Garavaglia
On Thu, Feb 1, 2018 at 8:15 PM, De, Bipra wrote: > Hello Everyone, > > > > I am working on an alert system that will call Aurora APIs to detect jobs > that have flapping tasks. It runs every hour. > > > > Any suggestions on how to detect such jobs that have tasks flapping, > provided those tasks