[
https://issues.apache.org/jira/browse/AIRFLOW-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15906796#comment-15906796
]
ASF subversion and git services commented on AIRFLOW-910:
---------------------------------------------------------
Commit dcc8ede5c1a2f6819b151dd5ce839f0a0917313a in incubator-airflow's branch
refs/heads/v1-8-test from [~bolke]
[ https://git-wip-us.apache.org/repos/asf?p=incubator-airflow.git;h=dcc8ede ]
[AIRFLOW-910] Use parallel task execution for backfills
The refactor to use dag runs in backfills caused a
regression
in task execution performance as dag runs were
executed
sequentially. Next to that, the backfills were non
deterministic
due to the random execution of tasks, causing root
tasks
being added to the non ready list too soon.
This updates the backfill logic as follows:
* Parallelize execution of tasks
* Use a leave first execution model
* Replace state updates from the executor by task
based only
Closes #2107 from bolkedebruin/AIRFLOW-910
> Parallelize dag runs in backfills
> ---------------------------------
>
> Key: AIRFLOW-910
> URL: https://issues.apache.org/jira/browse/AIRFLOW-910
> Project: Apache Airflow
> Issue Type: Sub-task
> Components: backfill
> Affects Versions: 1.8.0rc4
> Reporter: Bolke de Bruin
> Assignee: Bolke de Bruin
> Priority: Blocker
> Fix For: 1.8.0
>
>
> Currently dag runs are executed sequentially while backfilling. This is a
> regression and slows down the processing off tasks.
> [~aoen]
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)