On 03/07/24 23:40, Matthew Brett wrote:
Hi,

We recently got a set of well-labeled PRs containing (reviewed)
AI-generated code:

https://github.com/numpy/numpy/pull/26827
https://github.com/numpy/numpy/pull/26828
https://github.com/numpy/numpy/pull/26829
https://github.com/numpy/numpy/pull/26830
https://github.com/numpy/numpy/pull/26831

Do we have a policy on AI-generated code?   It seems to me that
AI-code in general must be a license risk, as the AI may well generate
code that was derived from, for example, code with a GPL-license.

There is definitely the issue of copyright to keep in mind, but I see two other issues: the quality of the contributions and one moral issue.

IMHO the PR linked above are not high quality contributions: for example, the added examples are often redundant with each other. In my experience these are representative of automatically generate content: as there is little to no effort involved into writing it, the content is often repetitive and with very low information density. In the case of documentation, I find this very detrimental to the overall quality.

Contributions generated with AI have huge ecological and social costs. Encouraging AI generated contributions, especially where there is absolutely no need to involve AI to get to the solution, as in the examples above, makes the project co-responsible for these costs.

Cheers,
Dan

_______________________________________________
NumPy-Discussion mailing list -- numpy-discussion@python.org
To unsubscribe send an email to numpy-discussion-le...@python.org
https://mail.python.org/mailman3/lists/numpy-discussion.python.org/
Member address: arch...@mail-archive.com

Reply via email to