Kevin Sweeney created AURORA-222:
------------------------------------
Summary: Updater should run health checks in parallel
Key: AURORA-222
URL: https://issues.apache.org/jira/browse/AURORA-222
Project: Aurora
Issue Type: Story
Components: Client
Reporter: Kevin Sweeney
Right now when the Updater is operating on a batch of instances and those
instances are configured for health checking it runs their health checks
sequentially one-instance-at-a-time [1]. This could be trivially parallelized,
resulting in a speedup of up to the batch size.
[1]
https://github.com/apache/incubator-aurora/blob/master/src/main/python/apache/aurora/client/api/instance_watcher.py#L106
{code}
for instance_id in instance_ids:
if instance_id not in finished_instances():
running_task = tasks_by_instance.get(instance_id)
if running_task is not None:
task_healthy, retriable = health_check.health(running_task)
if task_healthy:
set_instance_healthy(instance_id, now)
else:
maybe_set_instance_unhealthy(instance_id, retriable)
else:
# Set retriable=True since an instance should be retried if it has
not been healthy.
maybe_set_instance_unhealthy(instance_id, retriable=True)
{code}
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)