John Barker created AIRFLOW-1885:
------------------------------------

             Summary: Exception when polling ready workers and a gunicorn 
worker becomes a zombie
                 Key: AIRFLOW-1885
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-1885
             Project: Apache Airflow
          Issue Type: Bug
    Affects Versions: Airflow 1.8
            Reporter: John Barker


If one of the gunicorn workers happens to become a zombie between `children()` 
and `cmdline()` calls to psutil in `get_num_ready_workers_running`:

{code}
Traceback (most recent call last):
  File "/usr/local/bin/airflow", line 28, in <module>
    args.func(args)
  File "/usr/local/lib/python3.5/dist-packages/airflow/bin/cli.py", line 803, 
in webserver
    restart_workers(gunicorn_master_proc, num_workers)
  File "/usr/local/lib/python3.5/dist-packages/airflow/bin/cli.py", line 687, 
in restart_workers
    num_ready_workers_running = 
get_num_ready_workers_running(gunicorn_master_proc)
  File "/usr/local/lib/python3.5/dist-packages/airflow/bin/cli.py", line 663, 
in get_num_ready_workers_running
    proc for proc in workers
  File "/usr/local/lib/python3.5/dist-packages/airflow/bin/cli.py", line 664, 
in <listcomp>
    if settings.GUNICORN_WORKER_READY_PREFIX in proc.cmdline()[0]
IndexError: list index out of range
{code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to