Nice! I like how well you have escalated and handled this, Jarek. This is cool: https://thenewstack.io/ai-is-spamming-open-source-repos-with-fake-issues/
Thanks & Regards, Amogh Desai On Tue, Feb 4, 2025 at 8:07 PM Jarek Potiuk <ja...@potiuk.com> wrote: > Here is the article in The New Stack: > https://thenewstack.io/ai-is-spamming-open-source-repos-with-fake-issues/ > > On Fri, Jan 31, 2025 at 12:39 PM Jarek Potiuk <ja...@potiuk.com> wrote: > > > Hey, > > > > I am at FOSDEM now but I have some progress and more information about > the > > whole stuf: > > > > Some facts first (without any judgment from my side): > > > > * my posts in social media reached many, many people (thanks to you - > > posting it and people from other foundations) - the reach out is amazing > - > > more than 200.000 views, and Outlier.ai and Scale.ai noticed > > * Scale.ai reached out to me. They apologized for what they have done (in > > private so far), and admitted it has been their fault entirely. They > > admitted their instructions were poorly written and led the people they > > hired to submit issues where they were never supposed to do it > > * We had a call where we discussed why and how it happened and > > they explained also that they changed those instructions with bright > flashy > > "do not submit issues" and added explicit instructions there > > * they could not (would not) really explain (what exactly the project and > > generating issues was for) - initially they mentioned training agents to > > behave like humans do and create issues that will have higher probability > > to be accepted but they backed out later - telling that it's only about > > "assessing" issue > > * they promised to come back to me with more details - and questions > about > > what was the extent of the project and what they want to achieve > > * I was (and thanks to an unnamed ASF member who put me in contact with) > > contacted by a journalist from The New Stack writing a story about it. I > > had a call with her and I also contacted them with Scale.ai so that they > > can share their part of the story as well) > > * GitHub is - faster and faster it seems - reacting and blocking those > > users based on our report - they got some understanding about the > situation > > and they started to see the patterns and block those users when reported. > > Even today I got 3 confirmations about blocked accounts. > > > > My take (and this is speculation from my side not fact): > > * they WERE training agent AI to be able to submit issues (even if now > > they deny it) > > * this is a VERY bad idea that will make our live harder when put in > > practice - the probabilistic / hallucinating nature of that will make > > counter-productive and make a lot of overhead for us, maintainers > > * we should work with local AI "good" partners to build system to detect > > and prevent those kind issues and "help" maintainers to battle with > those - > > there will be more and more of those cases happening > > * we should continue reporting those cases to GitHub - this eventually > > works as well > > > > This shows the power of the community! I will keep you posted. > > > > J. > > > > On Thu, Jan 30, 2025 at 5:26 PM Ryan Hatter > > <ryan.hat...@astronomer.io.invalid> wrote: > > > >> Here's a boilerplate response that I'm going to start using moving > >> forward: > >> > >> Hello, > >> > > >> > This Issue appears to be AI-generated spam. If this is a mistake, > please > >> > let us know—otherwise, any further spam Issues may lead us to report > >> your > >> > account to GitHub. > >> > > >> > >> On Mon, Jan 27, 2025 at 10:16 AM Jarek Potiuk <ja...@potiuk.com> wrote: > >> > >> > FYI. My friend - who is an engineering manager at GitHub in the "AI" > >> space > >> > - saw my post and reached out to their internal team fighting with > >> Spam. I > >> > hope they will be able to do something about it as well. > >> > > >> > On Mon, Jan 27, 2025 at 8:21 AM Jarek Potiuk <ja...@potiuk.com> > wrote: > >> > > >> > > > Given what I read about the company, they will most likely ignore > >> you. > >> > > Some people think the company is a scam, and they seem to engage in > >> > > unethical business practices. > >> > > > >> > > Yeah, but it absolutely does not mean we should give up and do > >> nothing. > >> > > > >> > > Hopefully that will give at least some people a pause when they > apply > >> for > >> > > Outlier.ai jobs. There are a few people there who seem to have a > >> > "real-ish" > >> > > history of contribution and they actually came back after our > reaction > >> > and > >> > > apologised as they were tricked into doing it by the instructions > from > >> > > Outlier. > >> > > > >> > > So at least we can make rounds on awareness of the whole situation. > >> > That's > >> > > already something. Even if they will shrug seeing all the reaction > >> (which > >> > > is amazing - I did not know we have so many people caring about > >> > > maintainers). > >> > > > >> > > And hopefully also GitHub (I have a few friends from GitHub reacting > >> to > >> > > it) will take notice and will make it harder for "new" people and > >> maybe > >> > > even detect such spam on their own. Eventually we are - and this is > >> just > >> > a > >> > > beginning - in an arm's race where we will have to employ AI to > battle > >> > the > >> > > AI created by "roque" players - it's inevitable. It will happen, > >> > absolutely > >> > > no doubts about it. So we have to strengthen our efforts to make > >> people > >> > > aware of those dangers and prepare to react for them. > >> > > > >> > > We already (and few other Apache projects) work with DoSu - and they > >> are > >> > > great AI "open-source friendly" company - for quite a few months > >> their AI > >> > > is classifying our issues and putting the right labels on them (and > >> they > >> > do > >> > > a great job on that) - so it's only a matter of their focus (and I > >> spoke > >> > to > >> > > the Devin - the creator of DoSu) on them being able to analyse those > >> > issues > >> > > and maybe even automatically marking them as "AI spam". In fact - > >> their > >> > > algorithms already also learn from us marking the issues with the > "AI > >> > spam" > >> > > label. and if they enable this label and we have enough cases, their > >> AI > >> > > will likely automatically help with that "AI spam" to be > automatically > >> > set. > >> > > > >> > > So ... I think being active, sharing things like that, learning and > >> > > thinking how to battle those cases is something we all should learn > >> how > >> > to > >> > > do. It will be needed. This is just a canary in the mine. > >> > > > >> > > J. > >> > > > >> > > > >> > > > >> > > > >> > > On Mon, Jan 27, 2025 at 12:57 AM Justin Mclean < > >> jus...@classsoftware.com > >> > > > >> > > wrote: > >> > > > >> > >> Hi, > >> > >> > >> > >> Given what I read about the company, they will most likely ignore > >> you. > >> > >> Some people think the company is a scam, and they seem to engage in > >> > >> unethical business practices. > >> > >> > >> > >> Kind Regards, > >> > >> Justin > >> > >> > --------------------------------------------------------------------- > >> > >> To unsubscribe, e-mail: dev-unsubscr...@airflow.apache.org > >> > >> For additional commands, e-mail: dev-h...@airflow.apache.org > >> > >> > >> > >> > >> > > >> > > >