Re: GPLed code on github (given the copilot controversy)

2021-07-14 Thread jahoti
On 7/12/21 9:16 PM, marc wrote: Hi, me again So I am going to respond to multiple comments in one go: I had a look at Julia Reda's post, and as far as I can make out, she only focuses on the fact that individual snippets are very short - but doesn't make any mention that inserting *lots*

Re: GPLed code on github (given the copilot controversy)

2021-07-14 Thread Daniele "Mte90" Scasciafratte
I think that the point is mainly how much means a copyright issue to train a machine learning model that can recreate the original with a specific percentage of similarity bu chunks (and define what means a chunk for code). As example if GPL license says that you can use the 30% of code lines

Re: GPLed code on github (given the copilot controversy)

2021-07-13 Thread Paul Boddie
On Monday, 12 July 2021 23:16:22 CEST marc wrote: > Hi, me again > > So I am going to respond to multiple comments in one go: > > I had a look at Julia Reda's post, and as far as I can > make out, she only focuses on the fact that individual > snippets are very short - but doesn't make any

Re: GPLed code on github (given the copilot controversy)

2021-07-13 Thread jahoti
Hi, you've certainly raised some interesting and important questions, as well as some deep philosophical comments. Unfortunately they're (obviously) not all things I can help with; however, some clarifications might be useful: * The corpus of software is not part of the copilot "software"

Re: GPLed code on github (given the copilot controversy)

2021-07-13 Thread Michael Pöhn
Maybe future versions of GPL could cover this with an extended copyleft clause. I think it would be justified that software like copilot (including their datasets) also get GPL'd if they build on top of GPL source-code. br. Michael On 10.07.21 10:58, marc wrote: Hi The way I understand

Re: GPLed code on github (given the copilot controversy)

2021-07-13 Thread marc
Hi, me again So I am going to respond to multiple comments in one go: I had a look at Julia Reda's post, and as far as I can make out, she only focuses on the fact that individual snippets are very short - but doesn't make any mention that inserting *lots* snippets algorithmically is *all* that

Re: GPLed code on github (given the copilot controversy)

2021-07-13 Thread Daniele "Mte90" Scasciafratte
Also I experience something similar of CoPilot for the Mozilla Italia DeepSpeech Italian model (https://github.com/MozillaItalia/DeepSpeech-Italian-Model). When I studied how to deal with various audio+text/text-only italian datasets I talked a bit with other people of the Machine Learning

Re: GPLed code on github (given the copilot controversy)

2021-07-12 Thread Alex
On Mon, 12 Jul 2021 11:12:51 +0200 Paul Schaub wrote: > Hey, > > while I can't answer your questions, here is an article by Julie Reda, > arguing that Copilot is not in fact infringing copyright and that the > copyleft movement would not benefit from stricter copyright rules: > >

Re: GPLed code on github (given the copilot controversy)

2021-07-12 Thread Paul Schaub
Hey, while I can't answer your questions, here is an article by Julie Reda, arguing that Copilot is not in fact infringing copyright and that the copyleft movement would not benefit from stricter copyright rules: https://juliareda.eu/2021/07/github-copilot-is-not-infringing-your-copyright/

GPLed code on github (given the copilot controversy)

2021-07-12 Thread marc
Hi The way I understand this (I welcome corrections), is that copilot is a piece of proprietary software which was built using a corpus of software hosted on github. And if I understand it properly, this corpus of software includes code nominally released under the GPL and similar licenses. Is