[
https://issues.apache.org/jira/browse/AIRFLOW-5447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16928788#comment-16928788
]
Henry Cohen commented on AIRFLOW-5447:
--------------------------------------
```[2019-09-12 17:56:03,034] \{{kubernetes_executor.py:698}} INFO -
TaskInstance: <TaskInstance: tutorial.print_date 2019-09-10 00:00:00+00:00
[queued]> found in queued state but was not launched, rescheduling
[2019-09-12 17:56:03,043] \{{scheduler_job.py:1376}} INFO - Resetting orphaned
tasks for active dag runs
[2019-09-12 17:56:03,085] \{{base_job.py:308}} INFO - Reset the following 30
TaskInstances:
<TaskInstance: latest_only.latest_only 2019-09-10 04:00:00+00:00 [None]>
<TaskInstance: example_branch_operator.run_this_first 2019-09-11
00:00:00+00:00 [None]>
<TaskInstance: tutorial.print_date 2019-09-11 00:00:00+00:00 [None]>
<TaskInstance: example_skip_dag.skip_operator_2 2019-09-11 00:00:00+00:00
[None]>
<TaskInstance: example_skip_dag.skip_operator_1 2019-09-11 00:00:00+00:00
[None]>
<TaskInstance: example_skip_dag.always_true_2 2019-09-11 00:00:00+00:00 [None]>
<TaskInstance: example_skip_dag.always_true_1 2019-09-11 00:00:00+00:00 [None]>
<TaskInstance: example_short_circuit_operator.condition_is_True 2019-09-11
00:00:00+00:00 [None]>
<TaskInstance: example_short_circuit_operator.condition_is_False 2019-09-11
00:00:00+00:00 [None]>
<TaskInstance: example_subdag_operator.start 2019-09-10 00:00:00+00:00 [None]>
<TaskInstance: example_branch_dop_operator_v3.condition 2019-09-10
00:00:00+00:00 [None]>
<TaskInstance: example_http_operator.http_sensor_check 2019-09-10
00:00:00+00:00 [None]>
<TaskInstance: example_passing_params_via_test_command.run_this 2019-09-11
00:00:00+00:00 [None]>
<TaskInstance: latest_only_with_trigger.task2 2019-09-10 00:00:00+00:00 [None]>
<TaskInstance: latest_only_with_trigger.latest_only 2019-09-10 00:00:00+00:00
[None]>
<TaskInstance: example_trigger_controller_dag.test_trigger_dagrun 2019-09-10
00:00:00+00:00 [None]>
<TaskInstance: example_skip_dag.skip_operator_2 2019-09-10 00:00:00+00:00
[None]>
<TaskInstance: example_skip_dag.skip_operator_1 2019-09-10 00:00:00+00:00
[None]>
<TaskInstance: example_skip_dag.always_true_2 2019-09-10 00:00:00+00:00 [None]>
<TaskInstance: example_skip_dag.always_true_1 2019-09-10 00:00:00+00:00 [None]>
<TaskInstance: example_short_circuit_operator.condition_is_True 2019-09-10
00:00:00+00:00 [None]>
<TaskInstance: example_short_circuit_operator.condition_is_False 2019-09-10
00:00:00+00:00 [None]>
<TaskInstance: example_bash_operator.runme_2 2019-09-10 00:00:00+00:00 [None]>
<TaskInstance: example_bash_operator.runme_1 2019-09-10 00:00:00+00:00 [None]>
<TaskInstance: example_bash_operator.runme_0 2019-09-10 00:00:00+00:00 [None]>
<TaskInstance: example_bash_operator.also_run_this 2019-09-10 00:00:00+00:00
[None]>
<TaskInstance: example_xcom.push_by_returning 2019-09-10 00:00:00+00:00 [None]>
<TaskInstance: example_xcom.push 2019-09-10 00:00:00+00:00 [None]>
<TaskInstance: example_branch_operator.run_this_first 2019-09-10
00:00:00+00:00 [None]>
<TaskInstance: latest_only.latest_only 2019-09-10 00:00:00+00:00 [None]>
[2019-09-12 17:56:03,092] \{{dag_processing.py:545}} INFO - Launched
DagFileProcessorManager with pid: 35
[2019-09-12 17:56:03,093] \{{scheduler_job.py:1390}} DEBUG - Starting Loop...
[2019-09-12 17:56:03,093] \{{scheduler_job.py:1401}} DEBUG - Harvesting DAG
parsing results
[2019-09-12 17:56:03,093] \{{scheduler_job.py:1403}} DEBUG - Harvested 0
SimpleDAGs
[2019-09-12 17:56:03,093] \{{scheduler_job.py:1438}} DEBUG - Heartbeating the
executor
[2019-09-12 17:56:03,093] \{{base_executor.py:124}} DEBUG - 0 running task
instances
[2019-09-12 17:56:03,094] \{{base_executor.py:125}} DEBUG - 0 in queue
[2019-09-12 17:56:03,094] \{{base_executor.py:126}} DEBUG - 96 open slots
[2019-09-12 17:56:03,094] \{{base_executor.py:135}} DEBUG - Calling the <class
'airflow.contrib.executors.kubernetes_executor.KubernetesExecutor'> sync method
[2019-09-12 17:56:03,100] \{{scheduler_job.py:1459}} DEBUG - Ran scheduling
loop in 0.01 seconds
[2019-09-12 17:56:03,101] \{{scheduler_job.py:1462}} DEBUG - Sleeping for 1.00
seconds
[2019-09-12 17:56:03,107] \{{settings.py:54}} INFO - Configured default
timezone <Timezone [America/New_York]>
[2019-09-12 17:56:03,109] \{{settings.py:327}} DEBUG - Failed to import
airflow_local_settings.
Traceback (most recent call last):
File "/usr/local/lib/python3.7/site-packages/airflow/settings.py", line 315,
in import_local_settings
import airflow_local_settings
ModuleNotFoundError: No module named 'airflow_local_settings'
[2019-09-12 17:56:03,111] \{{logging_config.py:47}} INFO - Successfully
imported user-defined logging config from log_config.LOGGING_CONFIG
[2019-09-12 17:56:03,120] \{{settings.py:170}} DEBUG - Setting up DB connection
pool (PID 35)
[2019-09-12 17:56:03,121] \{{settings.py:213}} INFO - settings.configure_orm():
Using pool settings. pool_size=5, max_overflow=10, pool_recycle=1800, pid=35
[2019-09-12 17:56:03,289] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 45)
[2019-09-12 17:56:03,356] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 41)
[2019-09-12 17:56:04,101] \{{scheduler_job.py:1474}} DEBUG - Sleeping for 0.99
seconds to prevent excessive logging
[2019-09-12 17:56:04,126] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor0-Process, stopped)>
[2019-09-12 17:56:04,127] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor1-Process, stopped)>
[2019-09-12 17:56:04,162] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 55)
[2019-09-12 17:56:04,223] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 58)
[2019-09-12 17:56:05,095] \{{scheduler_job.py:1390}} DEBUG - Starting Loop...
[2019-09-12 17:56:05,095] \{{scheduler_job.py:1401}} DEBUG - Harvesting DAG
parsing results
[2019-09-12 17:56:05,097] \{{dag_processing.py:637}} DEBUG - Received message
of type DagParsingStat
[2019-09-12 17:56:05,098] \{{dag_processing.py:637}} DEBUG - Received message
of type SimpleDag
[2019-09-12 17:56:05,098] \{{dag_processing.py:637}} DEBUG - Received message
of type SimpleDag
[2019-09-12 17:56:05,099] \{{dag_processing.py:637}} DEBUG - Received message
of type SimpleDag
[2019-09-12 17:56:05,099] \{{dag_processing.py:637}} DEBUG - Received message
of type SimpleDag
[2019-09-12 17:56:05,100] \{{dag_processing.py:637}} DEBUG - Received message
of type DagParsingStat
[2019-09-12 17:56:05,101] \{{dag_processing.py:637}} DEBUG - Received message
of type DagParsingStat
[2019-09-12 17:56:05,101] \{{scheduler_job.py:1403}} DEBUG - Harvested 4
SimpleDAGs
[2019-09-12 17:56:05,128] \{{scheduler_job.py:921}} INFO - 5 tasks up for
execution:
<TaskInstance: example_subdag_operator.start 2019-09-10 00:00:00+00:00
[scheduled]>
<TaskInstance: latest_only_with_trigger.task2 2019-09-10 04:00:00+00:00
[scheduled]>
<TaskInstance: latest_only_with_trigger.task2 2019-09-10 00:00:00+00:00
[scheduled]>
<TaskInstance: latest_only_with_trigger.latest_only 2019-09-10 04:00:00+00:00
[scheduled]>
<TaskInstance: latest_only_with_trigger.latest_only 2019-09-10 00:00:00+00:00
[scheduled]>
[2019-09-12 17:56:05,138] \{{scheduler_job.py:953}} INFO - Figuring out tasks
to run in Pool(name=default_pool) with 128 open slots and 5 task instances
ready to be queued
[2019-09-12 17:56:05,139] \{{scheduler_job.py:981}} INFO - DAG
example_subdag_operator has 0/48 running and queued tasks
[2019-09-12 17:56:05,139] \{{scheduler_job.py:981}} INFO - DAG
latest_only_with_trigger has 0/48 running and queued tasks
[2019-09-12 17:56:05,139] \{{scheduler_job.py:981}} INFO - DAG
latest_only_with_trigger has 1/48 running and queued tasks
[2019-09-12 17:56:05,139] \{{scheduler_job.py:981}} INFO - DAG
latest_only_with_trigger has 2/48 running and queued tasks
[2019-09-12 17:56:05,139] \{{scheduler_job.py:981}} INFO - DAG
latest_only_with_trigger has 3/48 running and queued tasks
[2019-09-12 17:56:05,139] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor2-Process, stopped)>
[2019-09-12 17:56:05,140] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor3-Process, stopped)>
[2019-09-12 17:56:05,157] \{{scheduler_job.py:1031}} INFO - Setting the
following tasks to queued state:
<TaskInstance: example_subdag_operator.start 2019-09-10 00:00:00+00:00
[scheduled]>
<TaskInstance: latest_only_with_trigger.latest_only 2019-09-10 00:00:00+00:00
[scheduled]>
<TaskInstance: latest_only_with_trigger.latest_only 2019-09-10 04:00:00+00:00
[scheduled]>
<TaskInstance: latest_only_with_trigger.task2 2019-09-10 00:00:00+00:00
[scheduled]>
<TaskInstance: latest_only_with_trigger.task2 2019-09-10 04:00:00+00:00
[scheduled]>
[2019-09-12 17:56:05,182] \{{scheduler_job.py:1107}} INFO - Setting the
following 5 tasks to queued state:
<TaskInstance: latest_only_with_trigger.latest_only 2019-09-10 00:00:00+00:00
[queued]>
<TaskInstance: latest_only_with_trigger.task2 2019-09-10 00:00:00+00:00
[queued]>
<TaskInstance: example_subdag_operator.start 2019-09-10 00:00:00+00:00
[queued]>
<TaskInstance: latest_only_with_trigger.latest_only 2019-09-10 04:00:00+00:00
[queued]>
<TaskInstance: latest_only_with_trigger.task2 2019-09-10 04:00:00+00:00
[queued]>
[2019-09-12 17:56:05,183] \{{scheduler_job.py:1143}} INFO - Sending
('latest_only_with_trigger', 'latest_only', datetime.datetime(2019, 9, 10, 0,
0, tzinfo=<TimezoneInfo [UTC, GMT, +00:00:00, STD]>), 1) to executor with
priority 4 and queue default
[2019-09-12 17:56:05,183] \{{base_executor.py:59}} INFO - Adding to queue:
['airflow', 'run', 'latest_only_with_trigger', 'latest_only',
'2019-09-10T00:00:00+00:00', '--local', '--pool', 'default_pool', '-sd',
'/usr/local/lib/python3.7/site-packages/airflow/example_dags/example_latest_only_with_trigger.py']
[2019-09-12 17:56:05,183] \{{scheduler_job.py:1143}} INFO - Sending
('latest_only_with_trigger', 'task2', datetime.datetime(2019, 9, 10, 0, 0,
tzinfo=<TimezoneInfo [UTC, GMT, +00:00:00, STD]>), 1) to executor with priority
3 and queue default
[2019-09-12 17:56:05,183] \{{base_executor.py:59}} INFO - Adding to queue:
['airflow', 'run', 'latest_only_with_trigger', 'task2',
'2019-09-10T00:00:00+00:00', '--local', '--pool', 'default_pool', '-sd',
'/usr/local/lib/python3.7/site-packages/airflow/example_dags/example_latest_only_with_trigger.py']
[2019-09-12 17:56:05,184] \{{scheduler_job.py:1143}} INFO - Sending
('example_subdag_operator', 'start', datetime.datetime(2019, 9, 10, 0, 0,
tzinfo=<TimezoneInfo [UTC, GMT, +00:00:00, STD]>), 1) to executor with priority
5 and queue default
[2019-09-12 17:56:05,184] \{{base_executor.py:59}} INFO - Adding to queue:
['airflow', 'run', 'example_subdag_operator', 'start',
'2019-09-10T00:00:00+00:00', '--local', '--pool', 'default_pool', '-sd',
'/usr/local/lib/python3.7/site-packages/airflow/example_dags/example_subdag_operator.py']
[2019-09-12 17:56:05,184] \{{scheduler_job.py:1143}} INFO - Sending
('latest_only_with_trigger', 'latest_only', datetime.datetime(2019, 9, 10, 4,
0, tzinfo=<TimezoneInfo [UTC, GMT, +00:00:00, STD]>), 1) to executor with
priority 4 and queue default
[2019-09-12 17:56:05,184] \{{base_executor.py:59}} INFO - Adding to queue:
['airflow', 'run', 'latest_only_with_trigger', 'latest_only',
'2019-09-10T04:00:00+00:00', '--local', '--pool', 'default_pool', '-sd',
'/usr/local/lib/python3.7/site-packages/airflow/example_dags/example_latest_only_with_trigger.py']
[2019-09-12 17:56:05,185] \{{scheduler_job.py:1143}} INFO - Sending
('latest_only_with_trigger', 'task2', datetime.datetime(2019, 9, 10, 4, 0,
tzinfo=<TimezoneInfo [UTC, GMT, +00:00:00, STD]>), 1) to executor with priority
3 and queue default
[2019-09-12 17:56:05,185] \{{base_executor.py:59}} INFO - Adding to queue:
['airflow', 'run', 'latest_only_with_trigger', 'task2',
'2019-09-10T04:00:00+00:00', '--local', '--pool', 'default_pool', '-sd',
'/usr/local/lib/python3.7/site-packages/airflow/example_dags/example_latest_only_with_trigger.py']
[2019-09-12 17:56:05,185] \{{scheduler_job.py:1438}} DEBUG - Heartbeating the
executor
[2019-09-12 17:56:05,185] \{{base_executor.py:124}} DEBUG - 0 running task
instances
[2019-09-12 17:56:05,186] \{{base_executor.py:125}} DEBUG - 5 in queue
[2019-09-12 17:56:05,186] \{{base_executor.py:126}} DEBUG - 96 open slots
[2019-09-12 17:56:05,186] \{{kubernetes_executor.py:764}} INFO - Add task
('example_subdag_operator', 'start', datetime.datetime(2019, 9, 10, 0, 0,
tzinfo=<TimezoneInfo [UTC, GMT, +00:00:00, STD]>), 1) with command ['airflow',
'run', 'example_subdag_operator', 'start', '2019-09-10T00:00:00+00:00',
'--local', '--pool', 'default_pool', '-sd',
'/usr/local/lib/python3.7/site-packages/airflow/example_dags/example_subdag_operator.py']
with executor_config {}
[2019-09-12 17:56:05,193] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 72)
[2019-09-12 17:56:05,315] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 69)
[2019-09-12 17:56:06,153] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor4-Process, stopped)>
[2019-09-12 17:56:06,154] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor5-Process, stopped)>
[2019-09-12 17:56:06,290] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 83)
[2019-09-12 17:56:06,322] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 86)
[2019-09-12 17:56:07,171] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor6-Process, stopped)>
[2019-09-12 17:56:07,171] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor7-Process, stopped)>
[2019-09-12 17:56:07,248] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 97)
[2019-09-12 17:56:07,415] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 100)
[2019-09-12 17:56:08,188] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor8-Process, stopped)>
[2019-09-12 17:56:08,189] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor9-Process, stopped)>
[2019-09-12 17:56:08,247] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 111)
[2019-09-12 17:56:08,379] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 114)
[2019-09-12 17:56:09,205] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor10-Process, stopped)>
[2019-09-12 17:56:09,206] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor11-Process, stopped)>
[2019-09-12 17:56:09,323] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 125)
[2019-09-12 17:56:09,444] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 127)
[2019-09-12 17:56:10,223] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor12-Process, stopped)>
[2019-09-12 17:56:10,223] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor13-Process, stopped)>
[2019-09-12 17:56:10,299] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 139)
[2019-09-12 17:56:10,325] \{{settings.py:238}} DEBUG - Disposing DB connection
pool (PID 142)
[2019-09-12 17:56:11,237] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor14-Process, stopped)>
[2019-09-12 17:56:11,238] \{{scheduler_job.py:257}} DEBUG - Waiting for
<Process(DagFileProcessor15-Process, stopped)>```
> KubernetesExecutor hangs on task queueing
> -----------------------------------------
>
> Key: AIRFLOW-5447
> URL: https://issues.apache.org/jira/browse/AIRFLOW-5447
> Project: Apache Airflow
> Issue Type: Bug
> Components: executor-kubernetes
> Affects Versions: 1.10.4, 1.10.5
> Environment: Kubernetes version v1.14.3, Airflow version 1.10.4-1.10.5
> Reporter: Henry Cohen
> Assignee: Daniel Imberman
> Priority: Blocker
>
> Starting in 1.10.4, and continuing in 1.10.5, when using the
> KubernetesExecutor, with the webserver and scheduler running in the
> kubernetes cluster, tasks are scheduled, but when added to the task queue,
> the executor process hangs indefinitely. Based on log messages, it appears to
> be stuck at this line
> https://github.com/apache/airflow/blob/v1-10-stable/airflow/contrib/executors/kubernetes_executor.py#L761
--
This message was sent by Atlassian Jira
(v8.3.2#803003)