Maxim Khutornenko created AURORA-894:
----------------------------------------

             Summary: Server updater should watch healthy instances
                 Key: AURORA-894
                 URL: https://issues.apache.org/jira/browse/AURORA-894
             Project: Aurora
          Issue Type: Task
          Components: Scheduler
            Reporter: Maxim Khutornenko


Instead of starting the {{minWaitInInstanceRunningMs}} (aka {{watch_secs}}) 
countdown when an instance reaches RUNNING state, the updater should rely on 
the first successful health check instead. This will potentially speed up 
updates as the {{minWaitInInstanceRunningMs}} will no longer have to be chosen 
based on the worst observed instance startup/warmup delay but rather as a 
desired health check duration according to the following formula:
{noformat}
minWaitInInstanceRunningMs = interval_secs x num_desired_healthchecks x 1000
{noformat}
where:
  {{interval_secs}} - 
https://github.com/apache/incubator-aurora/blob/master/docs/configuration-reference.md#healthcheckconfig-objects
  {{num_desired_healthchecks}} - the desired number of OK health checks to 
observe before declaring an instance updated successfully
  
The above would allow every instance to start watching interval depending on 
the individual instance performance and potentially exit updater earlier. This 
feature requires AURORA-279.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to