Simon McVittie <s...@debian.org> writes: > On Sat, 19 Apr 2025 at 13:56:17 -0400, M. Zhou made this GR proposal:
>> The "AI models released under open source license without original >> training data or program", a particular type of files as explained >> above, are not seen as DFSG-compliant. Hence, they can not be included >> in the "main" section of the Debian archive. > Do we have an idea of whether/how many models that match this definition > already exist in main? In the Policy process it's usual to require an > estimate of how many packages a particular Policy change will make > "insta-RC-buggy", and I think GRs that change our self-imposed rules for > what we consider to be Free should do similarly. gnubg, at least, comes with neural network weights that do not have source code under this definition. I have to admit that while I was maintaining the package I didn't give this a ton of a thought because it predates the whole LLM craze. I'm not even sure if the data on which it's trained (backgammon games, I think mostly against bots) is copyrightable. I suspect no source technically exists for those weights anywhere, since upstream's training work was fairly manual as I recall and is not something they tend to work iteratively on, but it's possible that one of the upstream developers has all of the data and scripts somewhere. I'm not sure how many other old-school machine learning applications like that we may have lurking around. I also have no strong opinion about what we should do with such packages, to be clear; this should not be taken as an objection to the proposed GR. -- Russ Allbery (r...@debian.org) <https://www.eyrie.org/~eagle/>