[
https://issues.apache.org/jira/browse/AIRFLOW-1147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15989176#comment-15989176
]
Mubin Khalid commented on AIRFLOW-1147:
---------------------------------------
another issue I'm facing is when I try to execute this {code}airflow backfill
testDag -s 2017-04-28T12:37 -e 2017-04-28T12:55{code} with interval of 3
minutes, it simple says that tasks been deadblocked, how can I achieve that?
Here is output and all the stuff.
{code}
airflow backfill testDag -s 2017-04-28T12:37 -e 2017-04-28T12:55
[2017-04-28 12:36:21,483] {__init__.py:57} INFO - Using executor LocalExecutor
[2017-04-28 12:36:21,901] {models.py:167} INFO - Filling up the DagBag from
/airflow/dags
[2017-04-28 12:36:22,038] {models.py:1939} WARNING - start_date for
<Task(BashOperator): raw_sales_elastic> isn't datetime.datetime
[2017-04-28 12:36:22,378] {models.py:1120} INFO - Dependencies not met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:55:00 [scheduled]>,
dependency 'Execution Date' FAILED: Execution date 2017-04-28T12:55:00 is in
the future (the current date is 2017-04-28T12:36:22.378118).
/anaconda3/lib/python3.5/site-packages/airflow/ti_deps/deps/base_ti_dep.py:94:
PendingDeprecationWarning: generator '_get_dep_statuses' raised StopIteration
for dep_status in self._get_dep_statuses(ti, session, dep_context):
[2017-04-28 12:36:22,383] {models.py:1120} INFO - Dependencies not met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:40:00 [scheduled]>,
dependency 'Execution Date' FAILED: Execution date 2017-04-28T12:40:00 is in
the future (the current date is 2017-04-28T12:36:22.383537).
[2017-04-28 12:36:22,388] {models.py:1120} INFO - Dependencies not met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:43:00 [scheduled]>,
dependency 'Execution Date' FAILED: Execution date 2017-04-28T12:43:00 is in
the future (the current date is 2017-04-28T12:36:22.388029).
[2017-04-28 12:36:22,392] {models.py:1120} INFO - Dependencies not met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:49:00 [scheduled]>,
dependency 'Execution Date' FAILED: Execution date 2017-04-28T12:49:00 is in
the future (the current date is 2017-04-28T12:36:22.392258).
[2017-04-28 12:36:22,396] {models.py:1120} INFO - Dependencies not met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:46:00 [scheduled]>,
dependency 'Execution Date' FAILED: Execution date 2017-04-28T12:46:00 is in
the future (the current date is 2017-04-28T12:36:22.396745).
[2017-04-28 12:36:22,401] {models.py:1120} INFO - Dependencies not met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:52:00 [scheduled]>,
dependency 'Execution Date' FAILED: Execution date 2017-04-28T12:52:00 is in
the future (the current date is 2017-04-28T12:36:22.401057).
[2017-04-28 12:36:22,405] {models.py:1120} INFO - Dependencies not met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:37:00 [scheduled]>,
dependency 'Execution Date' FAILED: Execution date 2017-04-28T12:37:00 is in
the future (the current date is 2017-04-28T12:36:22.405613).
[2017-04-28 12:36:27,119] {jobs.py:1935} WARNING - Deadlock discovered for
tasks_to_run=dict_values([<TaskInstance: testDag.slack_notify 2017-04-28
12:55:00 [scheduled]>, <TaskInstance: testDag.slack_notify 2017-04-28 12:40:00
[scheduled]>, <TaskInstance: testDag.slack_notify 2017-04-28 12:43:00
[scheduled]>, <TaskInstance: testDag.slack_notify 2017-04-28 12:49:00
[scheduled]>, <TaskInstance: testDag.slack_notify 2017-04-28 12:46:00
[scheduled]>, <TaskInstance: testDag.slack_notify 2017-04-28 12:52:00
[scheduled]>, <TaskInstance: testDag.slack_notify 2017-04-28 12:37:00
[scheduled]>])
[2017-04-28 12:36:27,124] {models.py:4024} INFO - Updating state for <DagRun
testDag @ 2017-04-28 12:37:00: backfill_2017-04-28T12:37:00, externally
triggered: False> considering 1 task(s)
[2017-04-28 12:36:27,142] {models.py:4024} INFO - Updating state for <DagRun
testDag @ 2017-04-28 12:40:00: backfill_2017-04-28T12:40:00, externally
triggered: False> considering 1 task(s)
[2017-04-28 12:36:27,157] {models.py:4024} INFO - Updating state for <DagRun
testDag @ 2017-04-28 12:43:00: backfill_2017-04-28T12:43:00, externally
triggered: False> considering 1 task(s)
[2017-04-28 12:36:27,172] {models.py:4024} INFO - Updating state for <DagRun
testDag @ 2017-04-28 12:46:00: backfill_2017-04-28T12:46:00, externally
triggered: False> considering 1 task(s)
[2017-04-28 12:36:27,187] {models.py:4024} INFO - Updating state for <DagRun
testDag @ 2017-04-28 12:49:00: backfill_2017-04-28T12:49:00, externally
triggered: False> considering 1 task(s)
[2017-04-28 12:36:27,201] {models.py:4024} INFO - Updating state for <DagRun
testDag @ 2017-04-28 12:52:00: backfill_2017-04-28T12:52:00, externally
triggered: False> considering 1 task(s)
[2017-04-28 12:36:27,216] {models.py:4024} INFO - Updating state for <DagRun
testDag @ 2017-04-28 12:55:00: backfill_2017-04-28T12:55:00, externally
triggered: False> considering 1 task(s)
[2017-04-28 12:36:27,227] {jobs.py:1978} INFO - [backfill progress] | finished
run 0 of 7 | tasks waiting: 0 | succeeded: 0 | kicked_off: 0 | failed: 0 |
skipped: 0 | deadlocked: 7 | not ready: 7
[2017-04-28 12:36:27,248] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:37:00 [scheduled]>
[2017-04-28 12:36:27,249] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:37:00 [scheduled]>
[2017-04-28 12:36:27,249] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:46:00 [scheduled]>
[2017-04-28 12:36:27,249] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:46:00 [scheduled]>
[2017-04-28 12:36:27,250] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:40:00 [scheduled]>
[2017-04-28 12:36:27,250] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:40:00 [scheduled]>
[2017-04-28 12:36:27,251] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:55:00 [scheduled]>
[2017-04-28 12:36:27,251] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:55:00 [scheduled]>
[2017-04-28 12:36:27,251] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:43:00 [scheduled]>
[2017-04-28 12:36:27,252] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:43:00 [scheduled]>
[2017-04-28 12:36:27,252] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:52:00 [scheduled]>
[2017-04-28 12:36:27,252] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:52:00 [scheduled]>
[2017-04-28 12:36:27,252] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:49:00 [scheduled]>
[2017-04-28 12:36:27,253] {models.py:1126} INFO - Dependencies all met for
<TaskInstance: testDag.slack_notify 2017-04-28 12:49:00 [scheduled]>
Traceback (most recent call last):
File "/anaconda3/bin/airflow", line 28, in <module>
args.func(args)
File "/anaconda3/lib/python3.5/site-packages/airflow/bin/cli.py", line 167,
in backfill
pool=args.pool)
File "/anaconda3/lib/python3.5/site-packages/airflow/models.py", line 3330,
in run
job.run()
File "/anaconda3/lib/python3.5/site-packages/airflow/jobs.py", line 200, in
run
self._execute()
File "/anaconda3/lib/python3.5/site-packages/airflow/jobs.py", line 2021, in
_execute
raise AirflowException(err)
airflow.exceptions.AirflowException:
---------------------------------------------------
BackfillJob is deadlocked. These tasks have succeeded:
set()
These tasks have started:
{}
These tasks have failed:
set()
These tasks are skipped:
set()
These tasks are deadlocked:
{<TaskInstance: testDag.slack_notify 2017-04-28 12:37:00 [scheduled]>,
<TaskInstance: testDag.slack_notify 2017-04-28 12:46:00 [scheduled]>,
<TaskInstance: testDag.slack_notify 2017-04-28 12:40:00 [scheduled]>,
<TaskInstance: testDag.slack_notify 2017-04-28 12:55:00 [scheduled]>,
<TaskInstance: testDag.slack_notify 2017-04-28 12:43:00 [scheduled]>,
<TaskInstance: testDag.slack_notify 2017-04-28 12:52:00 [scheduled]>,
<TaskInstance: testDag.slack_notify 2017-04-28 12:49:00 [scheduled]>}
{code}
> airflow scheduler not working
> -----------------------------
>
> Key: AIRFLOW-1147
> URL: https://issues.apache.org/jira/browse/AIRFLOW-1147
> Project: Apache Airflow
> Issue Type: Bug
> Components: scheduler
> Affects Versions: Airflow 1.8
> Environment: CentOS running on 128 GB ram
> Reporter: Mubin Khalid
> Priority: Critical
> Labels: documentation, newbie
> Original Estimate: 24h
> Remaining Estimate: 24h
>
> I've created some `DAG`s, and I tried to put it on scheduler. I want to run
> all the tasks in the DAG after exact 24 hours.
> I tried to do something like this.
> {code}
> DEFAULT_ARGS = {
> 'owner' : 'mubin',
> 'depends_on_past' : False,
> 'start_date' : datetime(2017, 4, 24, 14, 30),
> 'retries' : 5,
> 'retry_delay' : timedetla(1),
> }
> SCHEDULE_INTERVAL = timedelta(minutes=1440)
> # SCHEDULE_INTERVAL = timedelta(hours=24)
> # SCHEDULE_INTERVAL = timedelta(days=1)
> dag = DAG('StandardizeDataDag',
> default_args = DEFAULT_ARGS,
> schedule_interval = SCHEDULE_INTERVAL
> )
> {code}
> I tried to put different intervals, but not any working. However if I try to
> reset db {code} airflow resetdb -y {code} and then run {code} airflow
> initdb {code} , it works for once. then after that, scheduler isn't able to
> run it.
> PS. {code} airflow scheduler {code} executed from {code} root {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)