Thanks Vikram, XD, and Jake for the proposal. It covers all the angles I can think at the time being and I appreciate merging together the various patterns.
The AIP really covers good breadth and depth too. +1 from me on this, and I hope we can see this one in action. Happy to help with any efforts here. Thanks & Regards, Amogh Desai On Mon, Mar 23, 2026 at 4:45 AM Xiaodong Deng <[email protected]> wrote: > Thanks, Vikram, for helping bring all the efforts together. > And thanks, everyone, for your positive feedback. > > I have been discussing this proposal with Vikram offline, and I'm quite > confident it is going to resolve quite a few pending issues and > inconveniences in how people use Airflow, or at least help people avoid > unnecessary hacks. > > As far as I could see, it will be able to cover all the use cases I > brought up in my earlier draft AIP, "Add 'persist_xcom_through_retry' > Parameter to Airflow Operators" ( > https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=399278333). > That's also why I look forward even more to seeing this Task State > Management feature in a very near future version of Airflow. > > Would love to receive more feedback from the community. Vikram, Jake, and > I are looking forward to working with everyone to bring this thrilling > feature to life. > > Regards, > XD > > On 2026/03/22 06:48:10 Rahul Vats wrote: > > Thanks, Vikram for bringing this up. > > > > A big +1 from me as well. The three patterns you mentioned are very > real, I > > have seen users stretch XCom in all sorts of ways to fill exactly these > > gaps. > > The clean separation from XCom with different scoping and lifecycle > makes a > > lot of sense. Will go through the AIP doc in detail. > > > > Thanks, > > Rahul > > > > > > On Sun, 22 Mar 2026 at 01:39, Jens Scheffler <[email protected]> > wrote: > > > > > Thanks Vikram, Jake, XD also from my side! > > > > > > A big +1 for moving this forward and I think this is really important. > > > Though from reading over it I do not see why it is marked as DRAFT, > > > because besides nt I think it is already very mature. All what I saw is > > > in general "right". So I hope this is a not really controversional > > > discuss and then we can get this in 3.3! > > > > > > (Some could say this concept is overdue... but is important to have!) > > > > > > Jens > > > > > > On 21.03.26 20:58, Jarek Potiuk wrote: > > > > Thanks Vikram, > > > > > > > > This is a crucial AIP for Airflow 3.3+. I skimmed through it and will > > > > provide more comments over the coming days, but it very much looks > like > > > > what I imagined for state management in Airflow. > > > > It has about the right abstraction layer, focusing on building > > > > infrastructure that serves the previously articulated - use cases and > > > > likely supports other use cases we are not yet aware of. I really > like > > > how > > > > it maps the "generic" interface into those cases. > > > > > > > > I have this old "rule of thumb": you need at least three use cases > to be > > > > able to design a truly reusable infrastructure API/component. .. > Here we > > > > have 3 use cases it will serve :) > > > > > > > > Jl > > > > > > > > > > > > On Sat, Mar 21, 2026 at 8:44 PM Vikram Koka via dev < > > > [email protected]> > > > > wrote: > > > > > > > >> Dear Airflowers, > > > >> > > > >> Over the last several months, there have been a lot of discussions > in > > > the > > > >> devlist around improvements needed for long running jobs outside of > > > Airflow > > > >> (raised by XD and others), and about improved event triggering > (raised > > > by > > > >> Jake and others). XD, Jake, and I have gotten together and > collaborated > > > on > > > >> a unified approach for Task State Management within Airflow which we > > > would > > > >> like to propose. > > > >> > > > >> Apache Airflow has been built around stateless, idempotent tasks, > and > > > this > > > >> has served the community incredibly well. But as production AI and > data > > > >> workloads have grown more sophisticated, a clear gap has emerged > that > > > the > > > >> community has been working around for a while. > > > >> > > > >> Three patterns keep coming up. An incremental operator needs to know > > > where > > > >> it left off last time, so it does not reprocess data it has already > > > >> handled. An operator running a Databricks or EMR job needs to > survive a > > > >> worker disruption without cancelling a job that was 90% complete and > > > >> starting over from scratch. A long-running async task processing > > > thousands > > > >> of files needs to checkpoint its progress so a retry picks up where > it > > > left > > > >> off, not from the beginning. > > > >> > > > >> All three patterns are forcing users into the same workarounds today > > > >> generally bending XCom beyond its intended purpose, or building > their > > > own > > > >> state persistence outside of Airflow entirely. > > > >> > > > >> We think we can do better. AIP-XX: Task State Management is a new > > > >> foundation AIP that addresses all three patterns through a single, > > > minimal, > > > >> pluggable framework. Built on top of the Execution API from AIP-72, > with > > > >> full async support consistent with AIP-98, Task State is > deliberately > > > and > > > >> cleanly separate from XCom, with different scoping, different > lifecycle > > > >> semantics, and different garbage collection mechanics. It also > provides > > > the > > > >> foundation for a simplified AIP-93 (Asset Watermarking) > > > >> < > > > >> > > > > https://cwiki.apache.org/confluence/display/AIRFLOW/%5BWIP%5D+AIP-93+Asset+Watermarks+and+State+Variables > > > >> and for long running remote operations using either the AIP-tbd > > > Persistent > > > >> Parameter for Airflow Operators > > > >> < > > > >> > > > > https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=399278333 > > > >> or AIP-96 (Resumable Operators) > > > >> < > > > >> > > > > https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-96+Resumable+Operators > > > >> . > > > >> > > > >> Full draft is on Confluence as Draft AIP-xx: Task State Management > > > >> < > > > >> > > > > https://cwiki.apache.org/confluence/display/AIRFLOW/Draft%3A+AIP-xx%3A+Task+State+Management > > > >> We would love to hear your thoughts. Please comment on the AIP doc. > > > >> > > > >> Best regards, > > > >> Vikram, XD, and Jake > > > >> -- > > > >> > > > >> Vikram Koka > > > >> Chief Strategy Officer > > > >> Email: [email protected] > > > >> > > > >> > > > >> <https://www.astronomer.io/> > > > >> > > > > > > --------------------------------------------------------------------- > > > To unsubscribe, e-mail: [email protected] > > > For additional commands, e-mail: [email protected] > > > > > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
