[[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]]
Looking at this scenario, my conclusion is that the training program is effectively a compiler: the training data set is the source code it compiles, and the trained model is object code that it produces. Thus, I agree that the trained model made from a private modified version of the training data set is unethical. It's a compiled program, released without sources. I don't think it makes sense to try to prevent this problem by changing the license of either the training program or the inference program. That would be comparable to licensing an interpreter so that it can only be used to run free programs -- it wouldn't be wise, and (from what lawyers have told me) is not lawful use of copyright in the US. -- Dr Richard Stallman President, Free Software Foundation (gnu.org, fsf.org) Internet Hall-of-Famer (internethalloffame.org) Skype: No way! See stallman.org/skype.html.
