> From: Philip Kaludercic <[email protected]>
> Cc: [email protected],  [email protected],  [email protected],
>   [email protected],  [email protected]
> Date: Sat, 24 Jul 2021 09:31:38 +0000
> 
> >> Not necessarily, if it generates a pure, top-level function. Someone
> >> could type something like "Sort list of postcodes" and it generates a
> >> Radix Sort function. And if this is part of some code that was copied a
> >> lot, the model might tend to generate this verbatim even more likely.
> >
> > A sort function must state at least the data type before it can be
> > compiled.  And if you are talking about pseudo-code that is data-type
> > agnostic, then that's an algorithm, and is not copyrightable, AFAIK.
> 
> No, I was thinking about concrete code, that depending on the language
> might even just rely on the standard library, especially if the language
> has generics. Seeing how often SO code has been found in random
> repositories[0], I don't think it is improbable that the trained models
> might notice these patterns.

Sorry, I don't understand what you have in mind.  Can you show an
example of useful code that could be copied verbatim into a program
without at least some renaming, without breaking the program?

Reply via email to