GitHub user sfirke edited a discussion: Expectations for first-time Superset pull request authors in the age of AI
## Background This is prompted by a few PRs I've seen recently after tagging an issue "good first issue" and by these two blog posts: - The Zig project's take on [Contributor Poker and AI](https://kristoff.it/blog/contributor-poker-and-ai/) - [Simon Willson's writeup of that post](https://simonwillison.net/2026/Apr/30/zig-anti-ai/) I am not proposing a ban on AI-generated code from new contributors. For one thing, the majority of top contributors to Superset seem to be using AI to write and review code and even to comment on GitHub issues. I write my code contributions with AI. But I think there is a phenomenon to discuss. ## Recent Superset experiences The blog posts note that we (Superset committers) can probably move the project faster if we write PRs ourselves rather than review and give feedback on PRs from first-time authors. The value proposition for us is that we're growing potential long-term contributors who will over time give more to the project than they require in review. That's why I tag issues "good first issue" instead of just fixing them myself, to get new people involved (at a time cost to myself as I work with the new author). It raises the question of what we expect from first-time PR authors. I have recently seen PRs that consist only of: - An AI-generated code output that has not been manually tested - AI-generated comments in the PR When a human generates the AI code they can still add value: manually testing the output via a local Superset deployment, working through edge cases, or adding human comments to an issue or PR. These authors did none of these. I have twice now asked for screenshots and the contributors admitted they don't have a local instance of Superset running. To me that's unacceptable, to send code that you can't or won't even deploy yourself (except for maybe editing the docs, which was one of the cases). If setting up a dev environment is too big a hurdle then you should find another project to work on. ## What to do about it I would love to have some statement for potential new contributors that sets a certain bar. It's not _about_ AI but it is only relevant in this era -- before AI you knew they had a certain amount invested already. For starters I would say the expectations include: - You have checked for existing open PRs that do the same thing - You have deployed the changes in your own Superset environment and tested the new behavior directly as a user. Include screenshots if relevant - You understand all of the code you are submitting - You get `pre-commit` checks passing before a committer reviews - If you are using AI, also use that AI to conduct an adversarial review of your code and address anything it finds Apache Airflow asks PR authors if they used AI to generate the code. At this point I assume any new code is AI generated so I actually don't think we need that. Agree? Disagree? Other thoughts? GitHub link: https://github.com/apache/superset/discussions/39784 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
