Re: Discuss/proposal: Update our AI coding policy to "forbid" agents opening PRs (not banning LLM generated-code)

Shahar Epstein Sun, 14 Jun 2026 08:53:26 -0700

Hey Kaxil,

First of all - if my previous mail hit a nerve, I do apologize - that
wasn't my intention at all, and I'll be more than happy to discuss it
offline if that would be helpful.
To clarify, I provided the background mostly for readers who may not have
been aware of the previous discussions, and as context for the current
conversation - not to suggest that "this was decided, end of story", nor to
dismiss concerns with hand-wavy arguments.


(long part between the dashes, feel free to skip if it's not interesting)
---
I would like to share my pain points which are behind my suggestion:
1. As a reviewer - I'm concerned, and I think others have expressed similar
concerns, that our community (at least on GitHub) is gradually moving
toward a more agent-driven culture rather than community-driven (with
agentic assistance). People (including me) let agents write their code,
hopefully while understanding, validating, and taking ownership of the
result. At the same time, maintainers are expected to review those changes.
The ease of generating contributions, in my opinion, can weaken the sense
of ownership - especially among first-time contributors. I have reviewed
PRs where the contributor simply disappeared (in the worst case), or at
best is unable to explain the reasoning behind one change or another.

How the checkbox and a human explanation might help to address the above*:

Warning: the following are not a thesis against AI usage - but to express
the my prespective as a reviewer:
a. The checkbox helps reviewers understand the context of the contribution.
In today's reality, PRs that are primarily human-written (or anywhere close
to that) are more likely to come from someone who either understands what
they are trying to achieve, or at least has tried to do so, which can make
the review process easier. Heavily AI-generated PRs can be a different
experience, much depending on the contributor. When they come from
committers or very active contributors, there is usually little to no issue
because as they take ownership of the changes. With first-time
contributors, however, that is not always the case and it varies between
"no issue" to "low-quality", which I believe is the reason for this thread
in the first place.

Having a short human-made explanation in the PR's description of "I changed
X to fix Y..." instead of bunch of AI generated text will be way more
helpful:
a. For reviewers to understand what's going on before they start going over
code.
b. For all contributors - a basis for (some?) human communication inside
the PR. PR discussions are ultimately conversations between contributors
and maintainers, and concise human explanations help facilitate those
conversations.

2. As a release manager - in each providers release I currently need to go
through each and every related-commit, make sure that it's classified
correctly (feature/bug/misc./etc.) and that the change is properly
reflected in the Changelog - which is the most important thing to maintain
as a release manager (came from Elad's strict school 🙂). Even with LLMs
enabled to address that, they often miss: sometimes it is accidentally
merged with a wrong title, sometimes the motivation behind the change is
not clear from the changes themselves, and sometimes the LLM just
hallucinates. Having this short explanation in the PR saves reading
time/tokens for the RM that needs to investigate these cases (in my last
release I encountered plenty of them).

---

> "know the motivation behind a PR and any gotchas"
> Agents are very good at producing PRs
with no real motivation behind them, so a human actually articulating the
why and the what-to-watch-for is worth more now than it ever was

When I review PRs I do my best to understand what they are about (w/ or w/o
the help of an agent), I'm sure that you and all maintainers do as well.
The problems that I see with these statements in current reality:
1. The amount of PRs, their contents (description+files), and the context
that we (and/or the agents) need to understand in order to review them is
relatively big comparing to current number of maintainers. The real pain is
in the AI slops that waste everyone's time and tokens.
2. How can we encourage "community-over-code" in this reality?

My current view is that retaining the checkbox and requiring a short
human-written explanation are practical ways to address some of the issues
above.
However, as long as there's no consensus - it won't happen, which is why
I'll be happy to hear your suggestions as well as everyone's (claiming that
both are non-issues is fine too).


Shahar



On Sun, Jun 14, 2026 at 2:52 PM Kaxil Naik <[email protected]> wrote:

> I am also stunned by your following statement, and I really hope this is a
> clear misunderstanding on your part or mine.
>
> I don't think that comparison to what we used to do in the past is
> > "apples-to-apples".
>
>
> If you check my email, the lesson from all those years was simple: know the
> motivation behind a PR and any gotchas. Sometimes the title alone made that
> obvious, so I merged the PR with no description. That lesson is more
> relevant in the age of AI, not less. Agents are very good at producing PRs
> with no real motivation behind them, so a human actually articulating the
> why and the what-to-watch-for is worth more now than it ever was.
>
> I never said: do X because we've always done it that way. In fact, I always
> ask if this is still worth doing since time changes. However, the learnings
> are the only thing you carry forward from years -- and thats precisely what
> I stated: "learnings" and a recommendation.
>
>
>
>
> On Sun, 14 Jun 2026 at 11:25, Kaxil Naik <[email protected]> wrote:
>
> > @Shahar Epstein I think you completely missed the point of my email. Let
> > me try and explain. tldr: "Are we using that field at all at present"!
> >
> > > adding the checkbox was discussed on the devlist and agreed upon by a
> > lazy cocensus
> > Yup, absolutely it was. And I did not say it was not. That's why I said
> > "is that those are guidelines and could help spot drive-by new
> contributors
> > ...... Hence, those are guidelines and are not rules ...." [1]  and "...
> > And we include the attribution line in AGENTS.md anyway, .."
> > My specific question, because I am genuinely curious was "Would you (and
> > generally everyone here) mind sharing what folks are using that field
> for?".
> > All the point you had in your tldr are pretty hand-waivy, could you help
> > me understand that, please?
> > - "Reduce reviewer burden " -- How? How does the attribution help with
> the
> > review?- Increase transparency. Again? What do we achieve by it? -
> Preserve
> > ownership -- Not really? What ownership? The PR author is responsible for
> > it. In fact we say that out loudly too: "By contributing code generated
> by
> > Gen-AI tools, you agree to comply with the project's licensing terms and
> > any applicable laws and regulations." "Remember that the final
> > responsibility for the code in your PR lies with you, regardless of
> whether
> > it was generated by a tool or written by you."
> >
> > > but I think that it's a serious matter for the project's future health
> > Stronlgy disagree, how is the attribution a serious matter for project's
> > future health? In my opinion we should care about ensuring the code
> > quality, ensuring the AI slop is away from the repo. Not the rubber
> > stamping of attribution.
> >
> > https://www.apache.org/legal/generative-tooling.html talks about
> > following and focuses more on copyrights as it should -- which I
> completely
> > agree:
> >
> >    - When disclosing these materials, contributors should also identify
> >    the licensing for these materials. The ASF maintains a 3rd Party
> Licensing
> >    Policy that provides guidance on which licenses are acceptable, along
> with
> >    instructions on the treatment of 3rd Party Works.
> >    - While in general, content generated by a non-human (e.g., machine or
> >    monkey) is not copyrightable, if content consists of some portions
> >    generated by AI and other portions authored by a human, the portions
> >    authored by a human may be copyrightable.
> >
> > The only para that has mentions of attribution is:
> >
> > >When providing contributions authored using generative AI tooling, a
> > recommended practice is for contributors to indicate the tooling used to
> > create the contribution. This should be included as a token in the source
> > control commit message, for example including the phrase “Generated-by:
> ”.
> > This allows for future release tooling to be considered that pulls this
> > content into a machine parsable Tooling-Provenance file.
> >
> > Read: no where it says this is mandatory. And the reason they cite is a
> > future release tooling might parse it. Not all the reasons very
> high-level
> > reasons you cited, which I completely disagree with.
> >
> >
> > The following is a compltely different thing :) and none of the
> > attribution stuff helps here, which is the gist of what I was really
> trying
> > to understand -- is that are we -- all of us as reviewers using that
> field
> > for anything.
> >
> > > AI tooling to cherry-pick and/or describe changes in the changelog, it
> > is even more important that PRs are well-summarized - to help them with
> the
> > release, as the number of PRs has nicely grown overtime - but there's
> > usually one release manager that handles them in each release.
> >
> > [1]:
> >
> https://github.com/apache/airflow/blob/main/contributing-docs/05_pull_requests.rst#gen-ai-assisted-contributions
> >
> > On Sun, 14 Jun 2026 at 11:17, Jarek Potiuk <[email protected]> wrote:
> >
> >> Yes. The https://www.apache.org/legal/generative-tooling.html is the
> >> decisive factor here. And it's not only a "cargo cult" of some sort,
> >> but a real legal expectation (not mandatory but strongly suggested)
> >> expectation. This is mostly because it allows us to avoid attempting
> >> to track provenance in case of any kind of legal disputes when the
> >> matter—for example, using Gen AI trained on GPL software—is settled.
> >> For now those disputes are not yet settled - though there are already
> >> some signs showing that "GenAI" is not going to be treated the same as
> >> "Copying" - so copyright and licences related to copyright have no
> >> impact here (again - not settled yet).
> >>
> >> And I agree with Shahar that introducing a bit of "friction" in the
> >> process - and especially doing everything to slow things down while
> >> removing things that should not take our attention is important.
> >>
> >> Maybe - we can start discussing on a "complete" change there - I.a PR
> >> with a description of the expected process and even maybe showing the
> >> expected "flow" of contribution we want to have - with/without AI and
> >> wiht assistance rather than with no human involvement. I think it
> >> would be great to describe it and discuss it - knowing what happens
> >> where and also connect it with "what happens next" - i.e. we will not
> >> improve our experience (and experience of our contributors) if we do
> >> not think about the contribution process end-2-end - we not only have
> >> to set expectations for our contributors, but we also have to be clear
> >> what they should expect from us - becasue this human - human relation
> >> works both ways.
> >>
> >> J.
> >>
> >> On Sun, Jun 14, 2026 at 7:16 AM Shahar Epstein <[email protected]>
> wrote:
> >> >
> >> > To put things into context, adding the checkbox was discussed
> >> > <https://lists.apache.org/thread/s5pchk082wpqro8vk400c7wv5jhsbvwg> on
> >> the
> >> > devlist and agreed upon by a lazy cocensus
> >> > <https://lists.apache.org/thread/9b19dcbcdb41ngw0jqgzcsrtrxl0v34c>.
> >> >
> >> > tl;dr - why we need it:
> >> > - Reduce reviewer burden (it was introduced at the time when we were
> >> > overwhelmed with AI slop)
> >> > - Increase transparency
> >> > - Preserve ownership
> >> > - Legal considerations
> >> > <https://www.apache.org/legal/generative-tooling.html> (was briefly
> >> > mentioned as part of the proposed template, but I think that it's a
> >> serious
> >> > matter for the project's future health)
> >> >
> >> > I don't think that comparison to what we used to do in the past is
> >> > "apples-to-apples".
> >> > We're in a very different state in terms of community size, number of
> >> > releases, and now we're at the beginning of an AI revolution.
> >> > So asking contributors to add a short description to ensure that
> they're
> >> > aware of and own the changes they proposed is a blessing (and not a
> big
> >> > deal, IMO - eventually it's a matter of copying-pasting-stylizing the
> >> > prompt that they had just given to the AI moments before creating the
> >> PR).
> >> > From another perspective, now that release managers use AI tooling to
> >> > cherry-pick and/or describe changes in the changelog, it is even more
> >> > important that PRs are well-summarized - to help them with the
> release,
> >> as
> >> > the number of PRs has nicely grown overtime - but there's usually one
> >> > release manager that handles them in each release.
> >> >
> >> >
> >> > Shahar
> >> >
> >> >
> >> > On Sun, Jun 14, 2026 at 4:00 AM Kaxil Naik <[email protected]>
> wrote:
> >> >
> >> > > Hi Przemsyslaw,
> >> > >
> >> > > Thanks for the nudge for not keeping the GenAI checkbox on
> >> > > https://github.com/apache/airflow/pull/68492
> >> > > .
> >> > >
> >> > > Would you (and generally everyone here) mind sharing what folks are
> >> using
> >> > > that field for?
> >> > >
> >> > > I deliberately try to not keep that field in my PR descriptions. And
> >> while
> >> > > reviewing the PRs as well I do not look at that field. As similar to
> >> what
> >> > > you said, it has 0 bearing on my code review. Either the PR is good
> or
> >> > > genuine attempt with mistakes or pure slop.
> >> > >
> >> > > My read (which can obviously be just my own interpretation) is that
> >> those
> >> > > are guidelines and could help spot drive-by new contributors who in
> an
> >> > > attempt to create stars for their GitHub profile, create genuine
> slop
> >> with
> >> > > 0 project understanding. For committers on the other hand, they are
> >> well
> >> > > aware and have knowledge of the working of the project. Hence, those
> >> are
> >> > > guidelines and are not rules. I can keep the checkbox on my PR but I
> >> don’t
> >> > > think it would serve any purpose. And we include the attribution
> line
> >> in
> >> > > AGENTS.md anyway, so any AI agent will add that line to the PR by
> >> current
> >> > > design. So that isn’t going to help if we plan or decide to use that
> >> to
> >> > > classify AI slop.
> >> > >
> >> > > I have closed probably 50+ PRs over last several months on the
> >> Airflow repo
> >> > > to close sloppy PRs but haven’t used that field to judge that but
> >> more the
> >> > > pattern of several PRs, unrelated changes, lack of response when
> >> asked with
> >> > > technical questions etc were the reason.
> >> > >
> >> > > Over Airflow’s history of last 10-11 years, the PR description
> >> template has
> >> > > undergone various incarnation.
> >> > > https://github.com/apache/airflow/pull/2810 Is an example of my
> >> simple PR
> >> > > -
> >> > > 9 years ago. And while it looks closed, it isn’t:) we just used
> >> different
> >> > > mechanism to merge changes to main. And this PR was merged. And
> >> lessons
> >> > > from all those years was to know the motivation behind the PR and
> any
> >> > > gotchas. We have had several PRs with no descriptions at all but I
> >> might
> >> > > have merged them as well as it was just too evident from the PR
> title.
> >> > >
> >> > > So my recommendation would not be to add/need anything in PR
> >> description
> >> > > which we aren’t going to use to determine something from it. If we’d
> >> like
> >> > > to do any test like the one suggested in thread email, i.e some
> action
> >> > > based on the failure of the test, I am fine with it.
> >> > >
> >> > > Regards,
> >> > > Kaxil
> >> > >
> >> > > On Sat, 13 Jun 2026 at 13:42, Przemysław Mirowski <
> [email protected]>
> >> > > wrote:
> >> > >
> >> > > > Hello everyone,
> >> > > >
> >> > > > For the start, this message can be a little out of context of this
> >> > > > discussion (sorry about that), but as it touches the AI usage on
> the
> >> > > > Airflow project, I felt that it may be worth it to add my 2c here.
> >> > > >
> >> > > > As for the context - I don't use AI to do any coding, prepare PRs,
> >> review
> >> > > > the code. I don't use it also in other areas in my life as a
> >> > > contradiction
> >> > > > to the fact that I did some science research on AI in couple of
> >> areas
> >> > > e.g.
> >> > > > medicine. I don't use it mainly from 2 reasons:
> >> > > >
> >> > > >   1.
> >> > > > I don't trust AI - any generative model can generate stuff which
> >> are not
> >> > > > true and most of the time, AI is pretty convinced that it is
> right,
> >> when
> >> > > it
> >> > > > is not
> >> > > >   2.
> >> > > > I believe that Software Engineering is a craft which I just like
> >> doing
> >> > > and
> >> > > > getting better in it. Using any AI, will not make my craftsmanship
> >> > > better.
> >> > > > In the contrary, it will made me dependent on the tool and will
> not
> >> make
> >> > > me
> >> > > > exercise my understanding on multiple levels of Software
> >> Engineering as
> >> > > the
> >> > > > design decisions will be proposed by the model and not the result
> >> of my
> >> > > > thoughts and understanding of the issue. Now I can write anything,
> >> with
> >> > > > using AI to generate code for 3 years straight, I don't believe
> >> that I
> >> > > > could write same quality of code or maybe I would not be able to
> >> write
> >> > > > anything after that long time
> >> > > >
> >> > > > Of course there are more reasons like climate-related stuff, but
> >> these
> >> > > two
> >> > > > are the most important for me.
> >> > > >
> >> > > > I am the Apache Airflow contributor for some time now and in
> >> majority of
> >> > > > cases, I'm involved in the Helm Chart area. As there are not many
> >> things
> >> > > > going on there, the PRs number is low. As there was not many
> >> interest in
> >> > > > the Helm Chart in the past, I started doing review to potentially
> >> make
> >> > > PRs
> >> > > > "ready for maintainer" review to, maybe, make Helm Chart alive
> >> again. Due
> >> > > > to all AI-stuff going on since some time now, I'm doing less
> review
> >> and
> >> > > it
> >> > > > takes longer time. Not because I'm less committed to the project
> or
> >> I
> >> > > don't
> >> > > > like AI or anything. Personally, who wrote the code (human or AI)
> it
> >> > > > doesn't really matter for me, until the quality is good and the
> >> code is
> >> > > > fitted correctly to the project and does not break e.g.
> consistency
> >> of
> >> > > it.
> >> > > > What I see in the Helm Chart-related PRs is that people do not
> >> review the
> >> > > > code which they commit and, in most cases, when the e.g. helm
> >> template
> >> > > > logic is not perfect, but good enough for me to press "Approve",
> >> the test
> >> > > > cases are just out of the place e.g. in terms of quality,
> >> consistency or
> >> > > > even duplication (same test case already exists somewhere in the
> >> test
> >> > > suite
> >> > > > and new one is proposed). For me this is really discouraging,
> >> because it
> >> > > > basically kills "Community over Code" which is the core of the
> >> Apache
> >> > > > Software Foundation which was part of my decision why I've got
> >> involved
> >> > > in
> >> > > > the project.
> >> > > >
> >> > > > But, keeping some more relevance to the thread itself, I would ask
> >> you,
> >> > > as
> >> > > > the Maintainers of the project, to slow down a bit. I feel like
> >> past 2-3
> >> > > > months was like a sprint to try to solve the issue and looking at
> >> some
> >> > > PRs,
> >> > > > comments and discussions on the devlist, I think that some things
> >> are
> >> > > > tested too quickly on too big scale, which impacts both
> Maintainers
> >> and
> >> > > > Contributors of the project. I believe that nobody knows how to
> >> resolve
> >> > > > current situation but taking some actions can be discouraging for
> >> current
> >> > > > or future contributors (first PR which came up to my mind -
> >> > > > https://github.com/apache/airflow/pull/61039). Just take one step
> >> at the
> >> > > > time. I think that moving just faster will not resolve this issue.
> >> > > >
> >> > > > +1 for the Shahar proposal regarding the new PR Template. I would
> >> add to
> >> > > > it the gate in the CI for validation of the description e.g. if
> >> > > everything
> >> > > > is visible as it should be (I noticed that a lot of PRs do not
> have
> >> "Was
> >> > > > generative AI tooling..." part in desc e.g.
> >> > > > https://github.com/apache/airflow/pull/68492).
> >> > > >
> >> > > > P.S. For anyone interested the starting point for this message was
> >> > > > https://github.com/apache/airflow/pull/68074 PR.
> >> > > >
> >> > > > Best regards,
> >> > > > Przemek
> >> > > > ________________________________
> >> > > > From: Jarek Potiuk <[email protected]>
> >> > > > Sent: 12 June 2026 16:47
> >> > > > To: [email protected] <[email protected]>
> >> > > > Subject: Re: Discuss/proposal: Update our AI coding policy to
> >> "forbid"
> >> > > > agents opening PRs (not banning LLM generated-code)
> >> > > >
> >> > > > Hello everyone,
> >> > > >
> >> > > > I’m happy to share that I’ve implemented and tested a new
> iteration
> >> of
> >> > > our
> >> > > > triage process based on your feedback!  I hope this will help us
> to
> >> > > > continue getting benefits from the triage (100s of drive-by PRs
> >> moved out
> >> > > > of the pile, plus useful guidance to some human new
> collaborators) +
> >> > > > opportunity to automate deterministic parts in CI and continuously
> >> refine
> >> > > > it will be a good start to make more improvements.
> >> > > >
> >> > > > I hope this stabilizes things so we can move forward next with the
> >> PR
> >> > > > template updates and review process (as next steps) and help clear
> >> the
> >> > > > maintainer review queue together.
> >> > > >
> >> > > > Here’s a look at what’s new:
> >> > > >
> >> > > > 1.  Focused Communication: I’ve replaced repetitive comments with
> a
> >> > > > single, updateable description in the PR. It keeps track of the
> >> latest
> >> > > > status and responsible party, letting authors know exactly when
> >> they are
> >> > > > "ready for review."
> >> > > > 2.  Helpful Notifications: Authors are now assigned when action is
> >> needed
> >> > > > and unassigned once ready, ensuring they get the right
> >> notifications at
> >> > > the
> >> > > > right time.
> >> > > > 3.  Smarter Mentions: A new Python script (in deterministic hooks)
> >> > > ensures
> >> > > > maintainer IDs are formatted correctly - with (`@id` in backticks)
> >> to
> >> > > > prevent any accidental pings.
> >> > > > 4.  Approachable Tone: Comments are now shorter and more direct,
> >> > > balancing
> >> > > > friendly guidance with our expectations.
> >> > > > 5.  Reliability: The workflow remains consistent while making
> >> > > > responsibility even clearer for everyone.
> >> > > >
> >> > > > I’m still gathering stats to see what we can automate in the CI
> >> soon.
> >> > > > Today’s triage (66 actions out of 500) shows that more PRs are
> >> passing
> >> > > our
> >> > > > criteria than being filtered out—which confirms that our main goal
> >> is
> >> > > > simply making the most of our human attention!
> >> > > >
> >> > > > Triage Summary:
> >> > > >
> >> > > >   *     mark-ready: 21
> >> > > >   *     workflow-approvals: 20
> >> > > >   *     reruns: 3
> >> > > >   *     violation folds (draft/comment): 7
> >> > > >   *     request-author-confirmation: 4
> >> > > >   *     pings: 4
> >> > > >   *     stale-draft closes: 5
> >> > > >
> >> > > > You can see the new notes in action here for example (screenshots
> >> are
> >> > > also
> >> > > > attached): https://github.com/apache/airflow/pull/67790
> >> > > >
> >> > > > I hope we can continue refining it together, and I think that
> >> thread was
> >> > > a
> >> > > > good opportunity to surface some of the issues.
> >> > > >
> >> > > > Best regards,
> >> > > > Jarek
> >> > > >
> >> > > >
> >> > > > On Thu, Jun 11, 2026 at 3:32 PM 김준영 <[email protected]
> <mailto:
> >> > > > [email protected]>> wrote:
> >> > > > That makes a lot of sense — thanks for taking the time to explain
> >> the
> >> > > > reasoning in detail. I have a much better understanding of the
> >> > > > project's philosophy now.
> >> > > >
> >> > > > Junyeong Kim
> >> > > >
> >> > > > 2026년 6월 11일 (목) 오후 10:27, Jarek Potiuk <[email protected]<mailto:
> >> > > > [email protected]>>님이 작성:
> >> > > > >
> >> > > > > That said, I think the key distinction is who controls the
> >> assignee
> >> > > > > slot. Rather than contributors (or agents) requesting
> assignment,
> >> what
> >> > > > > if maintainers were the ones to grant it — based on their own
> >> current
> >> > > > > capacity? Each maintainer could self-regulate how many issues
> >> they're
> >> > > > > actively triaging at a given time. Even if agents flood the
> queue
> >> with
> >> > > > > requests, nothing moves forward without a maintainer actively
> >> choosing
> >> > > > > to open a slot.
> >> > > > >
> >> > > > > This is precisely the point. We want to cut the noise, not add
> >> more. A
> >> > > > > maintainer mechanically assigning an assignee to the person
> >> requesting
> >> > > it
> >> > > > > is precisely what we do not want to do. Especially since we have
> >> no way
> >> > > > of
> >> > > > > knowing whether the person requesting it is a real person or a
> >> bot.
> >> > > It's
> >> > > > > really not a problem to have several people (or agents) working
> >> on the
> >> > > > same
> >> > > > > issue simultaneously. We even prefer people opening PRs without
> >> prior
> >> > > > > issues, and really de-duplication of that work is not a goal for
> >> us.
> >> > > > > Contributors working on the same thing in parallel will learn -
> >> even
> >> > > from
> >> > > > > others doing parallel implementation (if they are humans) or
> lose
> >> their
> >> > > > own
> >> > > > > tokens (if they are agents. We care about people learning, we do
> >> not
> >> > > care
> >> > > > > about others directing their tokens into whatever they feel they
> >> want.
> >> > > > >
> >> > > > > In short - at least as I see it (but I would love to hear
> >> > > > others)—handling
> >> > > > > assignments manually adds maintainers more (dull and completely
> >> > > > mechanical)
> >> > > > > work, while freeing people using agents without understanding
> the
> >> > > > workflow
> >> > > > > to save their tokens and spam us even more. It also gives them
> >> fewer
> >> > > > > opportunities to learn, so it's not worth it—only losses, no
> >> gains.
> >> > > > >
> >> > > > > On Thu, Jun 11, 2026 at 2:17 PM 김준영 <[email protected]
> >> <mailto:
> >> > > > [email protected]>> wrote:
> >> > > > >
> >> > > > > > Subject: Re: [DISCUSS] Agents opening PRs
> >> > > > > >
> >> > > > > > Hi Jarek,
> >> > > > > >
> >> > > > > > Thanks for the thoughtful response — the point about agents
> >> instantly
> >> > > > > > re-requesting assignment is a real concern.
> >> > > > > >
> >> > > > > > That said, I think the key distinction is who controls the
> >> assignee
> >> > > > > > slot. Rather than contributors (or agents) requesting
> >> assignment,
> >> > > what
> >> > > > > > if maintainers were the ones to grant it — based on their own
> >> current
> >> > > > > > capacity? Each maintainer could self-regulate how many issues
> >> they're
> >> > > > > > actively triaging at a given time. Even if agents flood the
> >> queue
> >> > > with
> >> > > > > > requests, nothing moves forward without a maintainer actively
> >> > > choosing
> >> > > > > > to open a slot.
> >> > > > > >
> >> > > > > > This shifts the bottleneck to maintainer bandwidth, which is
> >> already
> >> > > > > > the real constraint anyway. And it naturally filters signal
> from
> >> > > noise
> >> > > > > > — maintainers would prioritize issues worth acting on.
> >> > > > > >
> >> > > > > > Could that be a workable middle ground?
> >> > > > > >
> >> > > > > > Junyeong Kim
> >> > > > > >
> >> > > > > > 2026년 6월 11일 (목) 오후 9:07, Jarek Potiuk <[email protected]
> >> <mailto:
> >> > > > [email protected]>>님이 작성:
> >> > > > > > >
> >> > > > > > > Hi everyone,
> >> > > > > > >
> >> > > > > > > Just a quick update that’s quite relevant to this discussion
> >> and
> >> > > > Ash’s
> >> > > > > > > concerns about AGENTS.md. I had a great call yesterday with
> >> Jason
> >> > > > and our
> >> > > > > > > GSoC intern, Roy. We’ve decided to focus his internship on
> >> > > optimizing
> >> > > > > > > AGENTS.md by extracting key sections and defining evals for
> >> them,
> >> > > > > > inspired
> >> > > > > > > by the mini-eval framework in Magpie. This should help make
> >> our
> >> > > > agentic
> >> > > > > > > instructions much more deterministic. Since agents can
> >> struggle
> >> > > with
> >> > > > very
> >> > > > > > > long instructions, splitting these into smaller, focused
> >> "skills"
> >> > > > should
> >> > > > > > > really help them follow our guidelines more reliably.
> >> > > > > > >
> >> > > > > > > We’ll share a formal announcement on the devlist soon. I’d
> >> love for
> >> > > > us
> >> > > > > > all
> >> > > > > > > to jump in on the reviews—it’s a great chance for us to
> learn
> >> > > > together
> >> > > > > > > about agent limitations and how to better manage them.
> >> > > > > > >
> >> > > > > > > Junyeong, thanks for the suggestion on reintroducing
> >> assignments.
> >> > > > While I
> >> > > > > > > understand the intent, I'm a little worried it might
> >> backfire. In
> >> > > the
> >> > > > > > past,
> >> > > > > > > "assign and disappear" was a challenge, but my bigger
> concern
> >> now
> >> > > is
> >> > > > that
> >> > > > > > > agents can "request assignment" almost instantly after
> >> de-assigning
> >> > > > and
> >> > > > > > > practically for free (deterministically). Previously,
> >> requesting
> >> > > > > > > assignments created a lot of noise and required maintainers
> >> to act.
> >> > > > > > > However, even if we automate this - like some other
> >> projects—agents
> >> > > > could
> >> > > > > > > effectively block issues indefinitely, making it much harder
> >> for
> >> > > real
> >> > > > > > human
> >> > > > > > > contributors to find an opening.
> >> > > > > > >
> >> > > > > > > But - looking forward to hearing more thoughts.
> >> > > > > > >
> >> > > > > > > Best regards,
> >> > > > > > >
> >> > > > > > > Jarek
> >> > > > > > >
> >> > > > > > >
> >> > > > > > > On Thu, Jun 11, 2026 at 1:39 AM 김준영 <[email protected]
> >> > > <mailto:
> >> > > > [email protected]>> wrote:
> >> > > > > > >
> >> > > > > > > > Hi all,
> >> > > > > > > >
> >> > > > > > > > Thanks for the discussion — as a contributor, I've found
> it
> >> > > really
> >> > > > > > > > helpful to understand how maintainers are thinking about
> >> this.
> >> > > > > > > >
> >> > > > > > > > One thing I've noticed from the contributor side: without
> an
> >> > > > assignee
> >> > > > > > > > system, there's no clear signal at the issue level that
> >> someone
> >> > > is
> >> > > > > > > > already working on something. That lower friction might be
> >> part
> >> > > of
> >> > > > > > > > what's making it easier for agent-driven PRs to slip
> through
> >> > > > without
> >> > > > > > > > prior discussion.
> >> > > > > > > >
> >> > > > > > > > I'm not sure of the full history behind removing
> assignees,
> >> but I
> >> > > > > > > > wonder if the original "assign and abandon" problem could
> >> have
> >> > > been
> >> > > > > > > > addressed with an auto-unassign policy (e.g. 2 weeks of
> >> > > inactivity)
> >> > > > > > > > rather than removing the system entirely. Reintroducing
> >> assignees
> >> > > > with
> >> > > > > > > > that kind of timeout might act as an upstream complement
> to
> >> the
> >> > > > > > > > PR-level checks being discussed here.
> >> > > > > > > >
> >> > > > > > > > Could that be worth revisiting alongside Jarek's proposal?
> >> > > > > > > >
> >> > > > > > > > Junyeong Kim
> >> > > > > > > >
> >> > > > > > > > 2026년 6월 11일 (목) 오전 8:20, Jarek Potiuk <[email protected]
> >> <mailto:
> >> > > > [email protected]>>님이 작성:
> >> > > > > > > > >
> >> > > > > > > > > > I was watching the mail train and I think that sounds
> >> good.
> >> > > > Hope
> >> > > > > > the
> >> > > > > > > > > > check can be made early e.g. during build info and if
> >> > > possible
> >> > > > can
> >> > > > > > we
> >> > > > > > > > > > (once setting to DRAFT) kill all successor steps to
> >> save CI
> >> > > > > > capacity?
> >> > > > > > > > >
> >> > > > > > > > > Excellent idea - absolutely, we can build it into
> >> > > > "selective-checks"
> >> > > > > > to
> >> > > > > > > > > "fail" and make a clear statement during failure. I
> hadn't
> >> > > > thought of
> >> > > > > > > > that.
> >> > > > > > > > > There were some ideas about "pull_request_target", but
> >> yes, you
> >> > > > are
> >> > > > > > > > > completely right - all that checks are deterministic and
> >> can be
> >> > > > part
> >> > > > > > of
> >> > > > > > > > the
> >> > > > > > > > > "buid info" job that we use to determine what to do with
> >> the
> >> > > PR.
> >> > > > > > Should
> >> > > > > > > > be
> >> > > > > > > > > very simple.
> >> > > > > > > > >
> >> > > > > > > > >
> >> > > > > > > > > On Wed, Jun 10, 2026 at 8:43 PM Jens Scheffler <
> >> > > > [email protected]<mailto:[email protected]>>
> >> > > > > > > > wrote:
> >> > > > > > > > >
> >> > > > > > > > > > Hi,
> >> > > > > > > > > >
> >> > > > > > > > > > I was watching the mail train and I think that sounds
> >> good.
> >> > > > Hope
> >> > > > > > the
> >> > > > > > > > > > check can be made early e.g. during build info and if
> >> > > possible
> >> > > > can
> >> > > > > > we
> >> > > > > > > > > > (once setting to DRAFT) kill all successor steps to
> >> save CI
> >> > > > > > capacity?
> >> > > > > > > > > >
> >> > > > > > > > > > Otherwise I hope we can most constructive, not
> >> "Fighting fire
> >> > > > with
> >> > > > > > > > fire"
> >> > > > > > > > > > but rather aim to improve agent descriptions to
> optimize
> >> > > > other's
> >> > > > > > token
> >> > > > > > > > > > budgets in favor of our requirements. We can not turn
> >> back
> >> > > > time and
> >> > > > > > > > need
> >> > > > > > > > > > to assume the level of agent contributions will stay
> >> forever
> >> > > in
> >> > > > > > future.
> >> > > > > > > > > >
> >> > > > > > > > > > Jens
> >> > > > > > > > > >
> >> > > > > > > > > > On 10.06.26 08:55, Jarek Potiuk wrote:
> >> > > > > > > > > > > Hi everyone,
> >> > > > > > > > > > >
> >> > > > > > > > > > > I’ve spent some time reflecting on all the great
> >> points
> >> > > > raised
> >> > > > > > here.
> >> > > > > > > > Our
> >> > > > > > > > > > > shared goals are to ensure human ownership and
> >> review, keep
> >> > > > > > agents as
> >> > > > > > > > > > > helpful assistants rather than sole authors, and
> >> reduce the
> >> > > > > > cognitive
> >> > > > > > > > > > load
> >> > > > > > > > > > > from long AI-generated descriptions.
> >> > > > > > > > > > >
> >> > > > > > > > > > > I really like Shahar's proposal and would love to
> >> build on
> >> > > it
> >> > > > > > with a
> >> > > > > > > > few
> >> > > > > > > > > > > suggestions to make our expectations clear and
> >> supportive
> >> > > > for our
> >> > > > > > > > human
> >> > > > > > > > > > > contributors:
> >> > > > > > > > > > >
> >> > > > > > > > > > >    - Explicit Instructions: Let’s be very open in
> our
> >> > > > templates
> >> > > > > > and
> >> > > > > > > > > > > AGENTS.md. We can instruct agents to pause and ask
> the
> >> > > human
> >> > > > to
> >> > > > > > > > write the
> >> > > > > > > > > > > description, noting that this personal touch is
> >> essential
> >> > > for
> >> > > > > > the PR
> >> > > > > > > > to
> >> > > > > > > > > > > stay open.
> >> > > > > > > > > > >    - Human Review Checkbox: I suggest adding a
> >> checkbox: "-
> >> > > > [ ] I
> >> > > > > > > > have
> >> > > > > > > > > > > reviewed this code myself." We’ll instruct agents to
> >> leave
> >> > > > this
> >> > > > > > for
> >> > > > > > > > the
> >> > > > > > > > > > > human to check, ensuring that vital moment of
> >> reflection.
> >> > > > > > > > > > >    - Instead of copy-pasting (which I find awkward),
> >> we can
> >> > > > > > instruct
> >> > > > > > > > the
> >> > > > > > > > > > > agents to use `gh --web`, `--template` (to include
> the
> >> > > > > > template), and
> >> > > > > > > > > > > `--draft` (following Pierre's idea). This creates
> >> natural
> >> > > > > > > > > > > checkpoints—filling the description, checking the
> box,
> >> > > > clicking
> >> > > > > > > > submit,
> >> > > > > > > > > > and
> >> > > > > > > > > > > undrafting—that encourage human involvement.
> >> > > > > > > > > > >
> >> > > > > > > > > > > We should also state the consequences for
> >> non-compliance:
> >> > > To
> >> > > > > > keep our
> >> > > > > > > > > > queue
> >> > > > > > > > > > > healthy, we should use an automated process to close
> >> PRs
> >> > > that
> >> > > > > > miss
> >> > > > > > > > these
> >> > > > > > > > > > > steps, with a note explaining how to resubmit them
> >> with
> >> > > human
> >> > > > > > input.
> >> > > > > > > > > > >
> >> > > > > > > > > > > All those expectations and closing etc. should be
> >> equally
> >> > > > > > applied to
> >> > > > > > > > all
> >> > > > > > > > > > > PRs, including maintainer PRs. This will also allow
> >> those
> >> > > of
> >> > > > us
> >> > > > > > who
> >> > > > > > > > use
> >> > > > > > > > > > > agents to monitor the process and refine the
> >> instructions
> >> > > if
> >> > > > we
> >> > > > > > see
> >> > > > > > > > any
> >> > > > > > > > > > > loopholes that agents try to bypass or learn how to
> >> > > > circumvent.
> >> > > > > > This
> >> > > > > > > > will
> >> > > > > > > > > > > allow us to continuously improve the process.
> >> > > > > > > > > > >
> >> > > > > > > > > > > I believe this approach balances productivity with
> the
> >> > > > > > high-quality
> >> > > > > > > > human
> >> > > > > > > > > > > collaboration we all value.
> >> > > > > > > > > > >
> >> > > > > > > > > > > What do you think?
> >> > > > > > > > > > >
> >> > > > > > > > > > > Best regards,
> >> > > > > > > > > > >
> >> > > > > > > > > > > Jarek
> >> > > > > > > > > > >
> >> > > > > > > > > > >
> >> > > > > > > > > > > On Tue, Jun 9, 2026 at 5:00 PM Shahar Epstein <
> >> > > > [email protected]<mailto:[email protected]>
> >> > > > > > >
> >> > > > > > > > wrote:
> >> > > > > > > > > > >
> >> > > > > > > > > > >> Here's a more concrete suggestion:
> >> > > > > > > > > > >>
> >> > > > > > > > > > >> Updating the PR template in such a way that:
> >> > > > > > > > > > >> 1. Human summary is now a MUST - at least a
> >> oneliner* (or
> >> > > > more,
> >> > > > > > > > > > depending
> >> > > > > > > > > > >> on the scope - TBD) that describes the suggested
> >> changes
> >> > > > > > written by
> >> > > > > > > > the
> >> > > > > > > > > > >> PR's author themselves (without AI assistance).
> >> > > > > > > > > > >> 2. AI summary is optional. However, when included -
> >> it
> >> > > MUST
> >> > > > be
> >> > > > > > bound
> >> > > > > > > > > > within
> >> > > > > > > > > > >> a collapsible box, mainly to save cognitive load
> for
> >> > > > > > maintainers and
> >> > > > > > > > > > >> contributors, but also to encourage human
> >> interaction like
> >> > > > we
> >> > > > > > used
> >> > > > > > > > to do
> >> > > > > > > > > > >> before it all started.
> >> > > > > > > > > > >> 3. PR's author (human) should be the one declaring
> >> the AI
> >> > > > usage
> >> > > > > > > > > > checkbox -
> >> > > > > > > > > > >> added a short statement of ownership.
> >> > > > > > > > > > >>
> >> > > > > > > > > > >> Contributors will be instructed to use this
> template
> >> and
> >> > > > adhere
> >> > > > > > to
> >> > > > > > > > the
> >> > > > > > > > > > >> instructions when creating a PR.
> >> > > > > > > > > > >> Agents may push branches to forks, but they will be
> >> > > > instructed
> >> > > > > > to
> >> > > > > > > > avoid
> >> > > > > > > > > > >> creating PRs on their own to the upstream
> >> repository, and
> >> > > > > > instead
> >> > > > > > > > > > provide
> >> > > > > > > > > > >> the link for creating the PR using this template
> >> (they
> >> > > could
> >> > > > > > > > suggest an
> >> > > > > > > > > > AI
> >> > > > > > > > > > >> summary, but the contributor should copy and paste
> it
> >> > > > manually
> >> > > > > > to
> >> > > > > > > > the
> >> > > > > > > > > > >> collapsible box). Trying to work around that might
> >> result
> >> > > > in an
> >> > > > > > M&M
> >> > > > > > > > test
> >> > > > > > > > > > >> directly in the PR's description (TBD).
> >> > > > > > > > > > >>
> >> > > > > > > > > > >> Example is available here <
> >> > > > > > > > https://github.com/apache/airflow/pull/68055>
> >> > > > > > > > > > -
> >> > > > > > > > > > >> I've made HTML comments visible, they will be
> hidden
> >> in
> >> > > the
> >> > > > real
> >> > > > > > > > thing.
> >> > > > > > > > > > >>
> >> > > > > > > > > > >> Took inspiration for this idea from
> >> > > > https://tenbluelinks.org/ ,
> >> > > > > > > > that
> >> > > > > > > > > > hides
> >> > > > > > > > > > >> the AI overview on Google if you're not interested
> >> > > > > > > > (highly-recommended
> >> > > > > > > > > > >> btw).
> >> > > > > > > > > > >>
> >> > > > > > > > > > >> Can we live with that?
> >> > > > > > > > > > >>
> >> > > > > > > > > > >>
> >> > > > > > > > > > >> Shahar
> >> > > > > > > > > > >>
> >> > > > > > > > > > >> On Tue, Jun 9, 2026 at 3:30 PM Ash Berlin-Taylor <
> >> > > > > > [email protected]<mailto:[email protected]>>
> >> > > > > > > > > > wrote:
> >> > > > > > > > > > >>
> >> > > > > > > > > > >>> I don’t care one way or another about using AI as
> a
> >> tool
> >> > > > in CI,
> >> > > > > > > > that is
> >> > > > > > > > > > >>> secondary to my goal which is to try and do
> >> something to
> >> > > > make
> >> > > > > > it
> >> > > > > > > > clear
> >> > > > > > > > > > >> what
> >> > > > > > > > > > >>> we expect from people wanting to contribute to
> >> Airflow,
> >> > > > namely:
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>> 1. Human involvement.
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>> By submitting a PR you are saying “yes I want to
> be
> >> a
> >> > > > member
> >> > > > > > of the
> >> > > > > > > > > > >>> community”. Agents submitting without human
> >> interaction
> >> > > go
> >> > > > > > against
> >> > > > > > > > > > this.
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>> 2. Human ownership.
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>> It is _your responsibility_ as the PR author to
> >> follow up
> >> > > > on
> >> > > > > > it,
> >> > > > > > > > > > address
> >> > > > > > > > > > >>> comments, and request reviews.
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>> I frankly find the AI generated triage comments
> >> verbose,
> >> > > > and a
> >> > > > > > > > waste
> >> > > > > > > > > > of
> >> > > > > > > > > > >>> time and pure noise even without the `@` spam.
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>> If the user doesn’t care enough about their own PR
> >> to
> >> > > > follow
> >> > > > > > up on
> >> > > > > > > > it:
> >> > > > > > > > > > >>> close it after some time. We don’t need to baby
> sit
> >> them.
> >> > > > Nor
> >> > > > > > do I
> >> > > > > > > > need
> >> > > > > > > > > > >> yet
> >> > > > > > > > > > >>> more commit email messages to read through.
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>> So how does it sound: It sounds like hell to me
> and
> >> an
> >> > > even
> >> > > > > > bigger
> >> > > > > > > > > > waste
> >> > > > > > > > > > >>> of electricity in a climate crisis.
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>> I want to be involved in a community of humans
> >> working to
> >> > > > build
> >> > > > > > > > > > software.
> >> > > > > > > > > > >>> I do not want to see LLMs producing so much output
> >> that
> >> > > > other
> >> > > > > > > > people
> >> > > > > > > > > > need
> >> > > > > > > > > > >>> LLMs to summarise it, with no humans looking at
> >> things.
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>> -ash
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>>> On 9 Jun 2026, at 13:18, Jarek Potiuk <
> >> [email protected]
> >> > > > <mailto:[email protected]>>
> >> > > > > > wrote:
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>>> Why? Because AI “instructions” cannot be
> trusted.
> >> And I
> >> > > > am
> >> > > > > > after
> >> > > > > > > > a
> >> > > > > > > > > > >>> signal
> >> > > > > > > > > > >>>> that people are blindly using LLMs without enough
> >> human
> >> > > > > > > > introversion.
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>> But is not that what you are doing? This proposal
> >> is
> >> > > about
> >> > > > > > adding
> >> > > > > > > > > > >> another
> >> > > > > > > > > > >>>> AI instruction (just hidden in HTML) - how is
> that
> >> going
> >> > > > to
> >> > > > > > help?
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>>> You already updated the instructions to not `@`
> >> the
> >> > > > reviewer
> >> > > > > > here
> >> > > > > > > > > > >>>> Indeed, LLMs are not deterministic by nature. But
> >> they
> >> > > are
> >> > > > > > > > improvable.
> >> > > > > > > > > > >>>> Through iterations of refinement and adding more
> >> > > > guardrails
> >> > > > > > we can
> >> > > > > > > > > > >>> improve
> >> > > > > > > > > > >>>> it—and this is exactly why I am running it
> >> manually to
> >> > > > make it
> >> > > > > > > > better.
> >> > > > > > > > > > >>> This
> >> > > > > > > > > > >>>> is the same as in regular breeze development in
> the
> >> > > past.
> >> > > > > > > > Initially,
> >> > > > > > > > > > >>> there
> >> > > > > > > > > > >>>> were many small issues - and I remember how you
> >> > > complained
> >> > > > > > about
> >> > > > > > > > them
> >> > > > > > > > > > >> and
> >> > > > > > > > > > >>>> how unnecessary they seemed—yet we now perfected
> >> it over
> >> > > > time.
> >> > > > > > > > Now, it
> >> > > > > > > > > > >>>> allows all contributors and maintainers to work
> >> much
> >> > > more
> >> > > > > > > > efficiently
> >> > > > > > > > > > >> and
> >> > > > > > > > > > >>>> lose less time. BTW. Thanks for notifying me; I
> >> must
> >> > > > > > strengthen
> >> > > > > > > > this
> >> > > > > > > > > > >> one
> >> > > > > > > > > > >>>> and see why, as there might be another
> improvement
> >> to
> >> > > > > > implement.
> >> > > > > > > > This
> >> > > > > > > > > > >> is
> >> > > > > > > > > > >>>> also why we are not "yet" doing CI analysis by
> AI -
> >> > > > because I
> >> > > > > > > > want to
> >> > > > > > > > > > >>>> iterate on it and fix it in the way to know which
> >> parts
> >> > > > are
> >> > > > > > > > > > >>> deterministic.
> >> > > > > > > > > > >>>>> I want to do anything and everything to reduce
> the
> >> > > drive
> >> > > > by
> >> > > > > > > > > > >> contribution
> >> > > > > > > > > > >>>> with no human activity. I’m happy to spend my
> time
> >> > > helping
> >> > > > > > > > humans, but
> >> > > > > > > > > > >> if
> >> > > > > > > > > > >>>> they are just going to feed that back to an LLM
> >> and burn
> >> > > > an
> >> > > > > > > > egregious
> >> > > > > > > > > > >>>> amount of carbon: no thank you.
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>> And again I am not sure how the proposal to add
> >> that
> >> > > > > > instruction
> >> > > > > > > > would
> >> > > > > > > > > > >>>> address this particular issue? Are you just
> >> proposing to
> >> > > > add
> >> > > > > > > > another
> >> > > > > > > > > > >>>> instruction for the LLM (or am I wrong?). How
> does
> >> it
> >> > > > solve
> >> > > > > > the
> >> > > > > > > > > > >> problem?
> >> > > > > > > > > > >>>>  From what I understand we have two basic
> proposals
> >> > > here -
> >> > > > > > that
> >> > > > > > > > > > >> contradict
> >> > > > > > > > > > >>>> each other:
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>> * Ash - do not use AI to fight with AI at all
> >> > > > > > > > > > >>>> * Amoght, Shahar - use AI in CI
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>> But I think, the triage I am running now shows a
> >> third
> >> > > > way:
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>> * we use AI to try out and generate triage action
> >> and
> >> > > > figure
> >> > > > > > out
> >> > > > > > > > which
> >> > > > > > > > > > >>>> parts are practically 100% deterministic and can
> >> help
> >> > > with
> >> > > > > > triage
> >> > > > > > > > > > (this
> >> > > > > > > > > > >>> is
> >> > > > > > > > > > >>>> the stats I am gathering now)
> >> > > > > > > > > > >>>> * qe use AI to convert the SKILLS we have into
> >> > > > deterministic
> >> > > > > > CI
> >> > > > > > > > code
> >> > > > > > > > > > >> that
> >> > > > > > > > > > >>>> does those triage steps (no AI used at all at
> >> runtime)
> >> > > > > > > > > > >>>> * we continue perfecting the manually-triggered
> AI
> >> > > SKILLS
> >> > > > to
> >> > > > > > get
> >> > > > > > > > more
> >> > > > > > > > > > >> AI
> >> > > > > > > > > > >>>> heuristics that we can turn into deterministic CI
> >> code
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>> This seems to fulfill seemingly contradictory
> >> > > expectations
> >> > > > > > that
> >> > > > > > > > > > >> different
> >> > > > > > > > > > >>>> people have in a nice way. I am about to produce
> >> stats
> >> > > > from
> >> > > > > > the
> >> > > > > > > > last
> >> > > > > > > > > > >> run
> >> > > > > > > > > > >>>> and was just about to propose this approach.
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>> How does it sound Ash, Amogh, Shahar and others ?
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>> J.
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>>
> >> > > > > > > > > > >>>> On Tue, Jun 9, 2026 at 12:55 PM Ash
> Berlin-Taylor <
> >> > > > > > [email protected]<mailto:[email protected]>
> >> > > > > > > > >
> >> > > > > > > > > > >>> wrote:
> >> > > > > > > > > > >>>>> Why? Because AI “instructions” cannot be
> trusted.
> >> And I
> >> > > > am
> >> > > > > > after
> >> > > > > > > > a
> >> > > > > > > > > > >>> signal
> >> > > > > > > > > > >>>>> that people are blindly using LLMs without
> enough
> >> human
> >> > > > > > > > introversion.
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>> Want a prime example?
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>> The pr triage skill.
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>> You already updated the instructions to not `@`
> >> the
> >> > > > reviewer
> >> > > > > > here
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>
> >> > > > > > > > > >
> >> > > > > > > >
> >> > > > > >
> >> > > >
> >> > >
> >>
> https://github.com/apache/airflow-steward/blob/76cfa5e1d2e682b88df5205e9cda396df51a66b6/skills/pr-management-triage/comment-templates.md#reviewer-mention-policy
> >> > > > > > > > > > >>>>>> When a comment's only addressee is the PR
> author
> >> (the
> >> > > > > > > > > > >>>>> request-author-confirmation, reviewer-ping
> >> > > > author-primary,
> >> > > > > > and
> >> > > > > > > > > > >>> review-nudge
> >> > > > > > > > > > >>>>> author-primary templates), the body references
> the
> >> > > > reviewer
> >> > > > > > > > without
> >> > > > > > > > > > >>>>> @-mentioning them
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>> And yet the LLM did it again:
> >> > > > > > > > > > >>>>>
> >> > > > > > > >
> >> > > >
> https://github.com/apache/airflow/pull/66633#discussion_r3344849352
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>>> @korex-f — A reviewer (@ashb) has requested
> >> changes on
> >> > > > this
> >> > > > > > PR,
> >> > > > > > > > so
> >> > > > > > > > > > >> I've
> >> > > > > > > > > > >>>>> removed the ready for maintainer review label —
> >> the
> >> > > next
> >> > > > > > step is
> >> > > > > > > > on
> >> > > > > > > > > > >> your
> >> > > > > > > > > > >>>>> side. Could you address the review comments
> (push
> >> a
> >> > > fix,
> >> > > > or
> >> > > > > > reply
> >> > > > > > > > > > >>> in-thread
> >> > > > > > > > > > >>>>> explaining why the feedback doesn't apply)? Once
> >> > > > addressed,
> >> > > > > > > > > > re-request
> >> > > > > > > > > > >>>>> review from @ashb or re-mark the PR ready and it
> >> > > returns
> >> > > > to
> >> > > > > > the
> >> > > > > > > > > > >>> maintainer
> >> > > > > > > > > > >>>>> queue. Thank you.
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>> And frankly I’m tired of all this shit.
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>> I want to do anything and everything to reduce
> the
> >> > > drive
> >> > > > by
> >> > > > > > > > > > >> contribution
> >> > > > > > > > > > >>>>> with no human activity. I’m happy to spend my
> time
> >> > > > helping
> >> > > > > > > > humans,
> >> > > > > > > > > > but
> >> > > > > > > > > > >>> if
> >> > > > > > > > > > >>>>> they are just going to feed that back to an LLM
> >> and
> >> > > burn
> >> > > > an
> >> > > > > > > > egregious
> >> > > > > > > > > > >>>>> amount of carbon: no thank you.
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>> -ash
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>>>> On 9 Jun 2026, at 10:38, Jarek Potiuk <
> >> > > [email protected]
> >> > > > <mailto:[email protected]>>
> >> > > > > > wrote:
> >> > > > > > > > > > >>>>>>
> >> > > > > > > > > > >>>>>> Hi Ash, Amogh, and Shahar,
> >> > > > > > > > > > >>>>>>
> >> > > > > > > > > > >>>>>> Ash, I'm curious to learn more about how the
> >> "brown
> >> > > m&m
> >> > > > > > test"
> >> > > > > > > > > > differs
> >> > > > > > > > > > >>>>> from
> >> > > > > > > > > > >>>>>> our current request for agents to identify
> >> themselves.
> >> > > > > > Could you
> >> > > > > > > > > > help
> >> > > > > > > > > > >>> me
> >> > > > > > > > > > >>>>>> understand the flow and the specific benefits
> >> you see?
> >> > > > It
> >> > > > > > feels
> >> > > > > > > > > > >> similar
> >> > > > > > > > > > >>>>> to
> >> > > > > > > > > > >>>>>> me, but I'd love to hear your perspective in
> >> case I'm
> >> > > > > > missing a
> >> > > > > > > > > > >> nuance.
> >> > > > > > > > > > >>>>>> Regarding the gh pr create --web approach, we
> >> included
> >> > > > those
> >> > > > > > > > > > >>> instructions
> >> > > > > > > > > > >>>>>> to ensure we meet ASF legal guidelines for
> Gen-AI
> >> > > > headers,
> >> > > > > > and
> >> > > > > > > > to
> >> > > > > > > > > > >>> support
> >> > > > > > > > > > >>>>>> contributors who might not have Copilot. That
> >> said, if
> >> > > > you
> >> > > > > > have
> >> > > > > > > > > > ideas
> >> > > > > > > > > > >>> on
> >> > > > > > > > > > >>>>>> how to trim the context or improve the
> >> templates, we
> >> > > > truly
> >> > > > > > > > > > appreciate
> >> > > > > > > > > > >>> PRs
> >> > > > > > > > > > >>>>>> that improve them—and many people already have.
> >> > > > AGENTS.md
> >> > > > > > is a
> >> > > > > > > > team
> >> > > > > > > > > > >>>>> effort,
> >> > > > > > > > > > >>>>>> and we’re always looking for ways to make it
> >> better.
> >> > > > Let's
> >> > > > > > keep
> >> > > > > > > > our
> >> > > > > > > > > > >>>>>> collaboration positive as we refine these
> >> processes
> >> > > > > > together.
> >> > > > > > > > > > >>>>>>
> >> > > > > > > > > > >>>>>> Amogh and Shahar, yep the idea of an validatio
> >> step in
> >> > > > the
> >> > > > > > CI
> >> > > > > > > > for
> >> > > > > > > > > > >>>>>> first-time contributions is something we should
> >> > > > implement
> >> > > > > > > > sooner or
> >> > > > > > > > > > >>>>> later.
> >> > > > > > > > > > >>>>>> I have actually been gathering stats on this
> for
> >> the
> >> > > > last
> >> > > > > > two
> >> > > > > > > > weeks.
> >> > > > > > > > > > >>> I’ve
> >> > > > > > > > > > >>>>>> been preparing to see how manually triggered
> >> triage
> >> > > > tasks
> >> > > > > > can
> >> > > > > > > > turn
> >> > > > > > > > > > >> into
> >> > > > > > > > > > >>>>>> automated ones—I'm gathering stats on when
> human
> >> > > > judgment is
> >> > > > > > > > needed.
> >> > > > > > > > > > >> I
> >> > > > > > > > > > >>>>>> shared some stats about this recently and will
> >> > > continue
> >> > > > > > > > gathering
> >> > > > > > > > > > >> them.
> >> > > > > > > > > > >>>>> The
> >> > > > > > > > > > >>>>>> next step is discussing here what and how we
> can
> >> > > > automate.
> >> > > > > > > > > > >>>>>>
> >> > > > > > > > > > >>>>>> Also, the current triage process already uses
> >> our Pull
> >> > > > > > Request
> >> > > > > > > > > > >> criteria
> >> > > > > > > > > > >>>>> to
> >> > > > > > > > > > >>>>>> pre-classify the PRs and only marks them with
> >> "ready
> >> > > for
> >> > > > > > > > maintainer
> >> > > > > > > > > > >>>>> review"
> >> > > > > > > > > > >>>>>> if those criteria are met. So, if there are any
> >> > > specific
> >> > > > > > > > criteria
> >> > > > > > > > > > >> you’d
> >> > > > > > > > > > >>>>>> like to see added to our "Pull request
> >> criteria," PRs
> >> > > > are
> >> > > > > > most
> >> > > > > > > > > > >> welcome
> >> > > > > > > > > > >>>>>> there as well.
> >> > > > > > > > > > >>>>>>
> >> > > > > > > > > > >>>>>> Best regards,
> >> > > > > > > > > > >>>>>>
> >> > > > > > > > > > >>>>>> Jarek
> >> > > > > > > > > > >>>>>
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>>
> >> > > > > > > >
> >> > > >
> >> ---------------------------------------------------------------------
> >> > > > > > > > > > >>> To unsubscribe, e-mail:
> >> > > [email protected]
> >> > > > <mailto:[email protected]>
> >> > > > > > > > > > >>> For additional commands, e-mail:
> >> > > > [email protected]<mailto:[email protected]>
> >> > > > > > > > > > >>>
> >> > > > > > > > > > >>>
> >> > > > > > > > > >
> >> > > > > > > > > >
> >> > > > > >
> >> ---------------------------------------------------------------------
> >> > > > > > > > > > To unsubscribe, e-mail:
> >> [email protected]
> >> > > > <mailto:[email protected]>
> >> > > > > > > > > > For additional commands, e-mail:
> >> [email protected]
> >> > > > <mailto:[email protected]>
> >> > > > > > > > > >
> >> > > > > > > > > >
> >> > > > > > > >
> >> > > > > > > >
> >> > > >
> >> ---------------------------------------------------------------------
> >> > > > > > > > To unsubscribe, e-mail:
> [email protected]
> >> > > <mailto:
> >> > > > [email protected]>
> >> > > > > > > > For additional commands, e-mail:
> >> [email protected]
> >> > > > <mailto:[email protected]>
> >> > > > > > > >
> >> > > > > > > >
> >> > > > > >
> >> > > > > >
> >> ---------------------------------------------------------------------
> >> > > > > > To unsubscribe, e-mail: [email protected]
> >> <mailto:
> >> > > > [email protected]>
> >> > > > > > For additional commands, e-mail: [email protected]
> >> <mailto:
> >> > > > [email protected]>
> >> > > > > >
> >> > > > > >
> >> > > >
> >> > > >
> >> ---------------------------------------------------------------------
> >> > > > To unsubscribe, e-mail: [email protected]
> <mailto:
> >> > > > [email protected]>
> >> > > > For additional commands, e-mail: [email protected]
> >> <mailto:
> >> > > > [email protected]>
> >> > > >
> >> > > >
> >> > >
> >>
> >> ---------------------------------------------------------------------
> >> To unsubscribe, e-mail: [email protected]
> >> For additional commands, e-mail: [email protected]
> >>
> >>
>

Re: Discuss/proposal: Update our AI coding policy to "forbid" agents opening PRs (not banning LLM generated-code)

Reply via email to