[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-18 Thread Kimmo Virtanen
Some interesting updates on current developments Open Assistant is open source ChatGPT clone with crowdsourced fine-tuning - https://open-assistant.io Redpajama is project for reproducing LLaMA and releasing the model under open source licence. Current status is that they have released the

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-03 Thread Samuel Klein
> > At this point I guess I would recommend adding five or so > g2.cores8.ram36.disk20 flavor VPSs to WMCS, with between one and three > RTX A6000 GPUs each, plus a 1TB SSD each, which should cost under > $60k. That should allow for very widely multilingual models somewhere > between GPT-3.5 and 4

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-02 Thread Lauren Worden
On Sun, Apr 2, 2023 at 5:40 PM Erik Moeller wrote: > > I can't comment on the hardware requirements, but I would note that in > addition to the llama.cpp repository > (https://github.com/ggerganov/llama.cpp), which currently focuses on > LLaMA/Alpaca, there are other efforts to reduce the

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-02 Thread Chris Keating
> > > > Are you infringing Stability AI's copyright by clicking this link? If > not, are you infringing Stability AI's copyright by then writing a > Python script that uses this file to generate images, if you only run > it locally on your GPU? > > Even if a court answers either question with

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-02 Thread Lauren Worden
On Sat, Apr 1, 2023 at 10:18 PM rupert THURNER wrote: > On Sat, Apr 1, 2023 at 11:36 PM Erik Moeller wrote: > > > > ... I am confident (based on, e.g., the recent > > results with Alpaca: https://crfm.stanford.edu/2023/03/13/alpaca.html) > > that the performance of smaller models will continue

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-01 Thread rupert THURNER
On Sat, Apr 1, 2023 at 11:36 PM Erik Moeller wrote: > Openly licensed models for machine translation like Facebook's M2M > (https://huggingface.co/facebook/m2m100_418M) or text generation like > Cerebras-GPT-13B (https://huggingface.co/cerebras/Cerebras-GPT-13B) > and GPT-NeoX-20B

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-01 Thread Erik Moeller
Lauren: > Erik, I see your point now and agree with you. But doesn't it seem > like obtaining a perfect license is at present the enemy of the urgent > good of bringing a concerted effort to bear on problems that are > clearly detrimental to project integrity? I don't think the licensing

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-01 Thread Chris Keating
On Fri, Mar 31, 2023 at 3:05 PM Erik Moeller wrote: > On Thu, Mar 30, 2023 at 12:25 PM Lauren Worden > wrote: > > > If you don't obtain this agreement, you cannot meaningfully enforce > > > the "license" because the downloader never agreed to it in the first > > > place. Moreover, you'll have

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-01 Thread Jan Ainali
Den lör 1 apr. 2023 kl 10:21 skrev Lauren Worden : > How would you characterize the harm of hosting BLOOM until a > comparable FOSS model is available? There are a few risks that could be harmful, although I don't think they are neither certain nor very direct. But, if we do give up our

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-04-01 Thread Lauren Worden
Erik, I see your point now and agree with you. But doesn't it seem like obtaining a perfect license is at present the enemy of the urgent good of bringing a concerted effort to bear on problems that are clearly detrimental to project integrity? I haven't been able to tell whether any of the

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-31 Thread Erik Moeller
On Fri, Mar 31, 2023 at 8:38 AM wrote: > Downloading computer programs and electronic databases (and downloading for > purposes outside > the listed exception) requires an express consent of the copyright holder, > i.e. a license. > In other words, you _cannot_ download a GPL program without

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-31 Thread petr . kadlec
Hi, On Thu, Mar 30, 2023 at 1:28 PM Erik Moeller wrote: > One core principle in open source licenses is that you are not > required to agree to the license in order to download or run copies. > The GPL makes this explicit: "You are not required to accept this > License in order to receive or

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-31 Thread Erik Moeller
On Thu, Mar 30, 2023 at 12:25 PM Lauren Worden wrote: > > If you don't obtain this agreement, you cannot meaningfully enforce > > the "license" because the downloader never agreed to it in the first > > place. Moreover, you'll have to make sure that _everyone else making > > copies of the file_

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-30 Thread Lauren Worden
On Thu, Mar 30, 2023 at 4:28 AM Erik Moeller wrote: > If you want to impose _additional restrictions_ on a person for stuff > they download from you, that actually requires proactive agreement > from the user to those restrictions at the time they download the > thing. > > If you don't obtain

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-30 Thread Jan Ainali
Den tors 30 mars 2023 kl 02:33 skrev Lauren Worden : > > Is the BLOOM RAIL license [ > https://huggingface.co/spaces/bigscience/license ] proprietary? > Yes. The common definition is that if it is not open source, it is proprietary. But you don't need to take my word for it. > So I expect the

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-30 Thread Erik Moeller
On Wed, Mar 29, 2023 at 1:49 PM Jan Ainali wrote: > On the contrary, I think it is important to, as early as possible, deter all > these attempts > to weaken the concept of "open" and that we as a movement need to take a hard > stance > against them. I agree with Jan on this. Licenses are the

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-30 Thread Kimmo Virtanen
Hi, >> My understanding is that is not proprietary, and the only reason it doesn't qualify for Open Source Initiative approval is because of these use restrictions: > > To generate or disseminate information or content, in any context (e.g. posts, articles, tweets, chatbots or other kinds of

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-29 Thread Lauren Worden
On Wed, Mar 29, 2023 at 1:50 PM Jan Ainali wrote: > > I think it is important to, as early as possible, deter all these attempts to > weaken the concept of "open" and that we as a movement need to take a hard > stance against them. > These proprietary licenses do not fit the spirit of sharing

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-29 Thread Jan Ainali
Hi Alek, nice to see you here! On the contrary, I think it is important to, as early as possible, deter all these attempts to weaken the concept of "open" and that we as a movement need to take a hard stance against them. These proprietary licenses do not fit the spirit of sharing all knowledge

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-29 Thread Alek Tarkowski
Hi, (I’m Alek from Open Future Foundation, I largely lurk here, so I want to say “Hi everyone!” first). Jan, you’re right that the RAIL license does not meet any FOSS definitions. But its authors, in their white paper, position this license not just as “responsible” but also “open”. And

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-28 Thread Jan Ainali
Den tis 28 mars 2023 kl 12:08 skrev Lauren Worden : > First, the Foundation should host a fork of BLOOM [ > https://huggingface.co/bigscience/bloom ], which if I remember correctly > was described by the Foundation's Machine Learning Director Chris Albon as > the only LLM at the scale of GPT-3

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-28 Thread Lauren Worden
Since proposals which don't fit in to existing discussions elsewhere are on topic here, I want to boldly recommend the following while the annual planning process is still ongoing, because it's far beyond the scope of what could be accomplished at a hackathon or on WMCS in a responsible fashion:

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-27 Thread Yael Weissburg
Hello again everyone, Thanks again to those who made it to the call last week - it felt like such a luxury to be able to drop deeply into this subject for an hour (plus) with all of you. For those who were unable to join, we captured extensive notes on Meta

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-24 Thread Paulo Santos Perneta
Yes, please, make this a regular event, at least for the time being. These discussions are incredibly useful, given the speed the developments are happening in this area, and the complexity of the challenges we are facing due to them. And thank's a lot for organizing the meeting yesterday! Paulo

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-23 Thread Samuel Klein
The Bau lab (that produced ROME) is great; see their update MEMIT https://memit.baulab.info scaling that approach. On Thu, Mar 23, 2023 at 3:43 PM Lauren Worden wrote: > On Thu, Mar 23, 2023 at 12:20 PM Samuel Klein wrote: > >> Thanks Yael and all for hosting this! A great conversation which

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-23 Thread Yael Weissburg
Thank you, all, for such a great conversation! I'd love to make this something we do regularly, and wonder if there would be appetite for rotating hosts? I softly nominate the Basque community to host the next one! One way or another, we'll find a way to make this more regular and will come back

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-23 Thread Lauren Worden
On Thu, Mar 23, 2023 at 12:20 PM Samuel Klein wrote: > Thanks Yael and all for hosting this! A great conversation which we > should revisit regularly. > Yes, I hope that this can be a (monthly?) regularly occurring event given the current state of very substantial advancements and improvements

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-23 Thread Samuel Klein
Thanks Yael and all for hosting this! A great conversation which we should revisit regularly. On Thu, Mar 9, 2023 at 4:40 PM Yael Weissburg wrote: > Hi Everyone, > > Last year, as part of our annual planning process, the Wikimedia > Foundation shared a list of external trends >

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

2023-03-21 Thread Elena Lappen
Hi everyone! Reminder that this conversation is coming up on Thursday! We will have a member of the Foundation's Legal team with us to discuss possible legal implications, many of which have been raised on this list over the past few days. You can still register for the Zoom room by emailing