Kevin Sweeney created AURORA-222:
------------------------------------

             Summary: Updater should run health checks in parallel
                 Key: AURORA-222
                 URL: https://issues.apache.org/jira/browse/AURORA-222
             Project: Aurora
          Issue Type: Story
          Components: Client
            Reporter: Kevin Sweeney


Right now when the Updater is operating on a batch of instances and those 
instances are configured for health checking it runs their health checks 
sequentially one-instance-at-a-time [1]. This could be trivially parallelized, 
resulting in a speedup of up to the batch size.

[1] 
https://github.com/apache/incubator-aurora/blob/master/src/main/python/apache/aurora/client/api/instance_watcher.py#L106
{code}
      for instance_id in instance_ids:
        if instance_id not in finished_instances():
          running_task = tasks_by_instance.get(instance_id)
          if running_task is not None:
            task_healthy, retriable = health_check.health(running_task)
            if task_healthy:
              set_instance_healthy(instance_id, now)
            else:
              maybe_set_instance_unhealthy(instance_id, retriable)
          else:
            # Set retriable=True since an instance should be retried if it has 
not been healthy.
            maybe_set_instance_unhealthy(instance_id, retriable=True)
{code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to