Roland de Boo created AIRFLOW-5312:
--------------------------------------

             Summary: KubernetesPodOperator hangs when kubernetes API times out
                 Key: AIRFLOW-5312
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-5312
             Project: Apache Airflow
          Issue Type: Bug
          Components: contrib
    Affects Versions: 1.10.4
            Reporter: Roland de Boo


The KubernetesPodOperator launches a Pod in Kubernetes and then continues to 
watch its state and stream its logs. This is done via the Kubernetes Api. Not 
all calls to the Kubernetes Api have a timeout defined, so the operator will 
wait indefinitely for a response and therefore get stuck.

This is the same issue as https://issues.apache.org/jira/browse/AIRFLOW-5282 
but now in the KubernetesPodOperator.

The solution is to add a timeout on all Api calls, and handle the errors 
(catching them, adding tenacity retrying where appropriate).

 



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

Reply via email to