Simon McVittie <s...@debian.org> writes:
> On Sat, 19 Apr 2025 at 13:56:17 -0400, M. Zhou made this GR proposal:

>> The "AI models released under open source license without original
>> training data or program", a particular type of files as explained
>> above, are not seen as DFSG-compliant. Hence, they can not be included
>> in the "main" section of the Debian archive.

> Do we have an idea of whether/how many models that match this definition
> already exist in main? In the Policy process it's usual to require an
> estimate of how many packages a particular Policy change will make
> "insta-RC-buggy", and I think GRs that change our self-imposed rules for
> what we consider to be Free should do similarly.

gnubg, at least, comes with neural network weights that do not have source
code under this definition. I have to admit that while I was maintaining
the package I didn't give this a ton of a thought because it predates the
whole LLM craze. I'm not even sure if the data on which it's trained
(backgammon games, I think mostly against bots) is copyrightable.

I suspect no source technically exists for those weights anywhere, since
upstream's training work was fairly manual as I recall and is not
something they tend to work iteratively on, but it's possible that one of
the upstream developers has all of the data and scripts somewhere.

I'm not sure how many other old-school machine learning applications like
that we may have lurking around. I also have no strong opinion about what
we should do with such packages, to be clear; this should not be taken as
an objection to the proposed GR.

-- 
Russ Allbery (r...@debian.org)              <https://www.eyrie.org/~eagle/>

Reply via email to