RE: Re: [DISCUSS] AI-Assisted Contributions and AI Tooling Support in Apache Flink

Mehrtens, Mazrim Tue, 16 Jun 2026 08:25:55 -0700

Hi Martijn and everyone,
Thanks for driving this discussion! It's great to see the Flink community 
getting ahead of this topic. +1 from me on the proposal.
I wanted to share some context from my work at AWS, where we've been investing 
in exactly this space: making Apache Flink more accessible to AI tooling, 
specifically for end users building Flink applications.

We recently open-sourced an Agent Skill [5] for Apache Flink as part of the 
Agent Toolkit for AWS [1]. This skill provides AI coding agents with deep, 
structured context about developing applications on Managed Service for Apache 
Flink. It covers many of the challenging aspects of Flink development that I 
support customers with day in and day out, including core building blocks like 
dependency management and basic configurations, to best practices like operator 
UIDs and resource management, to operations best practices like monitoring and 
testing approaches, as well as specific use cases like Flink 1.x to 2.x 
migrations, Iceberg sinks, CDC sources, and so on. It's designed to be 
installed manually or discovered automatically by MCP documentation search 
tools (via the AWS MCP server [2]), which also search AWS documentation and 
blog content relevant to user prompts.

This builds on our v1 release [3], which took a similar "steering files" 
approach to giving AI agents project-specific context — conceptually very close 
to what AGENTS.md does. That v1 will be deprecated in favor of the Agent 
Toolkit release, but the experience we gained from it directly informs our 
thinking here. We also published a blog post [4] demonstrating how this skill 
can be used in practice both for creating new Flink projects from scratch and 
for analyzing existing applications for things like Flink 1.x to 2.x 
migrations, performance analysis, and identifying anti-patterns.

We're open-sourcing all of this because we want to share this approach (Agent 
Skills + Documentation via MCP) and content (the actual Flink Skill [1]) with 
the broader Flink community. We believe this kind of structured, 
agent-consumable project context is valuable for the entire Flink community, 
both in and outside of AWS, and the Agent Skills [5] framework specifically 
provides a great reusable open-source standard for this type of agentic 
development challenge.

I think there's an opportunity to align on a shared approach here. The work 
we've done on the "consumer" side (helping users build Flink apps) and what 
this thread proposes on the "contributor" side (helping developers contribute 
to Flink) are two faces of the same coin. If the community is interested, we'd 
love to collaborate on ensuring these efforts are complementary.

Happy to discuss further or answer any questions about the approach we've taken.

Best regards,
Mazrim

[1] 
https://github.com/aws/agent-toolkit-for-aws/tree/main/skills/specialized-skills/analytics-skills/developing-applications-on-managed-service-for-apache-flink
[2] 
https://docs.aws.amazon.com/agent-toolkit/latest/userguide/getting-started-aws-mcp-server.html
[3] 
https://github.com/awslabs/managed-service-for-apache-flink-agent-steering-files
[4] 
https://aws.amazon.com/blogs/big-data/build-streaming-applications-on-amazon-managed-service-for-apache-flink-with-ai-assisted-guidance/
[5] https://agentskills.io/home

On 2026/03/13 01:50:12 Leonard Xu wrote:
> Hi Martijn,
>
> Thanks for kicking off this discussion. I've been thinking along similar 
> lines recently, so you have a +1 from me on this proposal.
>
> I also have a suggestion regarding activity on the users' mailing list. Could 
> we consider introducing an AI agent to help answer users' questions? I've 
> noticed that many inquiries on user@flink currently go unanswered, yet most 
> of them could be effectively addressed by an agent.
>
>
> Best,
> Leonard
>
> > 2026 3月 13 05:03，Martijn Visser <[email protected]<mailto:[email protected]>> 
> > 写道：
> >
> > Hi all,
> >
> > I'd like to start a discussion about how the Flink community should handle
> > AI-assisted contributions and how we can make the Flink codebase more
> > accessible to AI tooling.
> >
> > The ASF has published guidance on generative AI tooling [1], and several
> > Apache projects have already adopted project-specific guidelines on top of
> > that. I think Flink should too.
> >
> > The most comprehensive example I've seen is Apache Airflow. They've added
> > an AGENTS.md [2] with instructions for AI coding agents, including PR
> > templates with an AI disclosure checkbox, a self-review checklist, and the
> > Generated-by: commit message token that the ASF guidance recommends. Apache
> > Iceberg recently adopted AI contribution guidelines [3] focused on
> > contributor accountability: you must be able to debug, explain, and own the
> > changes. Other projects like Paimon [4], Mahout [5], and Ozone [6] have
> > adopted similar policies.
> >
> > I'd like to propose the following for Flink:
> >
> > 1. Adopt contribution guidelines for AI-assisted PRs. Contributors must
> > disclose when AI tooling was used (using Generated-by: <Tool Name and
> > Version> in the commit message), and must be able to explain and take
> > ownership of all changes. AI-generated code is held to the same review
> > standards as human-written code.
> > 2. Add AGENTS.md files to the Flink repository. AGENTS.md [7] is a
> > convention for giving AI coding agents project-specific context. It can
> > contain information like build instructions, test commands, coding
> > conventions, commit message format. I think we should add one at the root
> > of apache/flink.
> > 3. Add module-level context for AI tooling. This is where I think we can
> > take a step forward. Each Flink module (e.g. flink-streaming-java,
> > flink-table-planner, flink-clients) would benefit from its own AGENTS.md
> > explaining the module's role, key abstractions, testing patterns, and
> > common pitfalls. This also serves as architectural documentation that helps
> > human contributors.
> >
> > I'm looking forward to hearing what others think about this.
> >
> > Best regards,
> >
> > Martijn
> >
> > [1] https://www.apache.org/legal/generative-tooling.html
> > [2] https://github.com/apache/airflow/blob/main/AGENTS.md
> > [3]
> > https://iceberg.apache.org/contribute/#guidelines-for-ai-assisted-contributions
> > [4]
> > https://github.com/apache/paimon/blob/master/.github/PULL_REQUEST_TEMPLATE.md?plain=1#L22
> > [5]
> > https://github.com/apache/mahout/blob/main/docs/community/pr-policy-and-review-guidelines.md
> > [6]
> > https://github.com/apache/ozone-site/blob/master/src/pages/release-notes/2.0.0.md?plain=1#L408
> > [7] https://agents.md/
>
>

RE: Re: [DISCUSS] AI-Assisted Contributions and AI Tooling Support in Apache Flink

Reply via email to