Re: [PROPOSAL] New operator for "watcher" scenario

Ash Berlin-Taylor Thu, 10 Feb 2022 11:05:10 -0800

The one caveat to this is you have to do it "last" (obviously) and thedirection matters.

For instance `watcher() << dag.tasks` doesn't work as it tries to setwatcher to a dependency of itself which falls foul of the cycledetector (the Acyclic property of DAG is enforced).


But given list >> task works fine that's not a problem. Just FYI I guess

-ash

On Thu, Feb 10 2022 at 20:00:58 +0100, Jarek Potiuk <[email protected]>wrote:

Ash! you are my hero :)
On Thu, Feb 10, 2022 at 7:58 PM Ash Berlin-Taylor <[email protected]<mailto:[email protected]>> wrote:
dag.tasks >> watcher()

No new syntax nor pre-commit needed :)
On Thu, Feb 10 2022 at 15:35:58 +0100, Jarek Potiuk<[email protected] <mailto:[email protected]>> wrote:
Hey everyone,
I have a small proposal about adding a new overloaded operator -initial proposal '>>=` (and a method), resulting from ourdiscussion on AIP-47 - system tests refactoring(<https://lists.apache.org/thread/htd4013yn483qfhwv11vc26jpf2yvjph>).
The ">>=" operator is not really 100% necessary to complete theAIP-47 (we have ways around it with somewhat complex-ishpre-commits) but it might simplify the way how we approach theexample dags turned into system tests and make them moremaintainable - but also it might simplify "real" DAGS for someusers.
Context:
In the AIP-47 we have a need for "status" like functionality insystem tests. The basic idea is to encompass all "test code" in asingle file - example dag. This way each system test (which arecurrently spread among DAG files, pytest tests and configurationwill "shrink" to the single "example_dag" file. This is a greatsimplification and it will be extremely helpful in system tests butit means that we need a "status" like functionality in the DAGsthat will fail in case any of the tasks failed during the execution.
In a number of DAGs we have to do "cleanup/tearDown" as the laststep no matter if any of the tasks failed (and this can be easilydone with "leaf" all_done rule) but then such a cleanup operationdetermines the status of the DAG ("failed" succeeded". So we needto by-pass the "cleanup" task and in case of any task failureexecute the "watcher" task (as we named it) as a leaf-node todetermine the status of the whole DAG.
Proposed Solution:
Maybe there is an easier way than what we came up with (happy tohear it), but the idea we have is to have a "watcher" leaf nodetask that has "one_failed" dependency on all the tasks in theexample DAG. This leads to all a bit more complex example dags toend up with:
[task1, task2, task3, ... ] >> watcher
This is a bit brittle because it is long and you have to rememberto add new tasks added in the example dags in the future (and it'svery easy to miss it as there is no "problem" if you miss a task).We can automate it in pre-commits for example dags, but we couldeasily solve it by adding a new operator, that would add"one_failed" dependency between all other tasks to the watcher task:
Pseudocode:

I proposed >>= operator but there also should be a "set_" method):

dag >>= watcher

the operator:

for task in dag.task:
    if not task == watcher:
        task >> watcher
The operator might also be needed in other scenarios - when someonewants to send notifications when any of the tasks failed(<https://stackoverflow.com/questions/50959743/airflow-trigger-rule-using-one-failed-cause-dag-failure>)or (I imagine) when you want to make sure to teardown your wholeinfrastructure you set-up in a complex DAG with cases similar tothe case Cloudflare presented last year in the summit<https://airflowsummit.org/sessions/2021/provision-as-a-service/> .Some of that can be done with cluster policies etc. But I thoughtthis might be a much nicer way.
WDYT? Are there any better/simpler ways of solving this problemthat we are not aware of ? Do you think it is "enough" ofjustification to add a new method/overloaded operator like that?Are there any strong "no", or maybe we could agree to it via lazyconsensus if there are some supportive voices and no viablealternative?
BTW. We are also open to change (or drop) the ">>=" proposedoperator if someone thinks this might be confusing.
J.

Re: [PROPOSAL] New operator for "watcher" scenario

Reply via email to