Re: Merging AI-generated program code

Lukas-Fabian Moser Mon, 01 Jun 2026 00:50:52 -0700

Hi Dan,

thanks for bringing this up.

Now, I want to ask whether anyone sees any issue with mergingAI-generated program code.

I think there are all sorts of issues, and I won't go into the broaderissues regarding the copyright status of AI-generated code (which mightcontain copyrighted bits of training material), the environmental impactof AI data centers etc. Instead, I'll focus on the question of qualityof code contributions, but these of course automatically pertain to howthat quality gets assessed.

This is somewhat urgent in the sense that we have such an MR in draft,though of course we could delay it until there is certainty. I justwouldn't want the uncertainty to last so long that a capablecontributor gets frustrated and leaves.
To my knowledge (and I have been paying only minimal attention), theFSF views AI-assisted contributions to GNU projects as potentiallyproblematic but has not established a policy.
As a reviewer, I strongly desire two things:

1. openness about the origin of the code I'm reviewing
2. accountability of the human submitter (not reviewers)
   for the code that is merged
For the MR that is in draft now, there were tells in the patch, but Ihad to ask the submitter twice before he confirmed that it was"AI-assisted."

In my opinion, LilyPond should only contain code that a humanunderstands. It is essential that the creator of a MR understands theircode, and it is desirable that a code reviewer understands the code aswell (the latter, of course, depending very much on the time andgenerosity of people willing to do reviews). This is not just a questionabout AI contributions: It means that LilyPond also shouldn't containhuman-written code of the "I added that line and then the problemsomehow went away, knock on wood" type. It's of course hard to enforcethis, but a thorough review where questions can be raised and must bedealt with makes it more probable.

To streamline this in the future, I propose configuring a template fordefault MR descriptions something like this:


    ##### Description

    <!-- Describe your motivation and your work briefly
    to orient reviewers.  If you have not described
    your commits well, go back and do that first. -->

    ##### Question

    What percentage of this work is AI-generated?  <!-- 0-100 -->

Do you think that would effectively address that specific concern?

Of course, since the number given will (according to your proposal)influence how the MR is dealt with, we depend on getting an honestanswer to that question. I don't want to seem paranoid, but maybe itwould be wise to add - somewhere in the CG - a statement along the linesof: Commits with non-disclosed AI-generated code get refused (or may getreverted later).

In the discussions of the current MR I noticed the term "AI-assisted".Maybe it would we a good idea to distinguish various kinds and degreesof AI assistance: IIUC, not every way of using an LLM during developmentleads to longer, coherent blocks of AI generated code.

Therefore, I suggest adopting a new policy: AI-generated program codedoes not automatically move forward without a human reviewer'sacknowledgment. It should be full acknowledgment, not, for example,"C++ LGTM; don't know about Scheme."
It would fall to the "patch meister" to help people follow this policyand to allow sensible exceptions, such as if a contributor with a goodrecord vouches for the quality of his own AI-generated submission inan area where he has developed expertise.

I support this. This basically means that you either have to motivatereviewers to look at your code, or you have to build a reputation bysmaller patches that both show and increase your familiarity with thecodebase.


Lukas

Re: Merging AI-generated program code

Reply via email to