Why? Because AI “instructions” cannot be trusted. And I am after a signal that people are blindly using LLMs without enough human introversion.
Want a prime example? The pr triage skill. You already updated the instructions to not `@` the reviewer here https://github.com/apache/airflow-steward/blob/76cfa5e1d2e682b88df5205e9cda396df51a66b6/skills/pr-management-triage/comment-templates.md#reviewer-mention-policy > When a comment's only addressee is the PR author (the > request-author-confirmation, reviewer-ping author-primary, and review-nudge > author-primary templates), the body references the reviewer without > @-mentioning them And yet the LLM did it again: https://github.com/apache/airflow/pull/66633#discussion_r3344849352 > @korex-f — A reviewer (@ashb) has requested changes on this PR, so I've > removed the ready for maintainer review label — the next step is on your > side. Could you address the review comments (push a fix, or reply in-thread > explaining why the feedback doesn't apply)? Once addressed, re-request review > from @ashb or re-mark the PR ready and it returns to the maintainer queue. > Thank you. And frankly I’m tired of all this shit. I want to do anything and everything to reduce the drive by contribution with no human activity. I’m happy to spend my time helping humans, but if they are just going to feed that back to an LLM and burn an egregious amount of carbon: no thank you. -ash > On 9 Jun 2026, at 10:38, Jarek Potiuk <[email protected]> wrote: > > Hi Ash, Amogh, and Shahar, > > Ash, I'm curious to learn more about how the "brown m&m test" differs from > our current request for agents to identify themselves. Could you help me > understand the flow and the specific benefits you see? It feels similar to > me, but I'd love to hear your perspective in case I'm missing a nuance. > > Regarding the gh pr create --web approach, we included those instructions > to ensure we meet ASF legal guidelines for Gen-AI headers, and to support > contributors who might not have Copilot. That said, if you have ideas on > how to trim the context or improve the templates, we truly appreciate PRs > that improve them—and many people already have. AGENTS.md is a team effort, > and we’re always looking for ways to make it better. Let's keep our > collaboration positive as we refine these processes together. > > Amogh and Shahar, yep the idea of an validatio step in the CI for > first-time contributions is something we should implement sooner or later. > I have actually been gathering stats on this for the last two weeks. I’ve > been preparing to see how manually triggered triage tasks can turn into > automated ones—I'm gathering stats on when human judgment is needed. I > shared some stats about this recently and will continue gathering them. The > next step is discussing here what and how we can automate. > > Also, the current triage process already uses our Pull Request criteria to > pre-classify the PRs and only marks them with "ready for maintainer review" > if those criteria are met. So, if there are any specific criteria you’d > like to see added to our "Pull request criteria," PRs are most welcome > there as well. > > Best regards, > > Jarek
