[nexa] OpenAI disputes authors’ claims that every ChatGPT response is a derivative work

Fabio Alemagna Thu, 31 Aug 2023 06:22:29 -0700

This week, OpenAI finally responded to a pair of nearly identical
class-action lawsuits from book authors—including Sarah Silverman,
Paul Tremblay, Mona Awad, Chris Golden, and Richard Kadrey—who earlier
this summer alleged that ChatGPT was illegally trained on pirated
copies of their books.


In OpenAI's motion to dismiss (filed in both lawsuits), the company
asked a US district court in California to toss all but one claim
alleging direct copyright infringement, which OpenAI hopes to defeat
at "a later stage of the case."

The authors' other claims—alleging vicarious copyright infringement,
violation of the Digital Millennium Copyright Act (DMCA), unfair
competition, negligence, and unjust enrichment—need to be "trimmed"
from the lawsuits "so that these cases do not proceed to discovery and
beyond with legally infirm theories of liability," OpenAI argued.

OpenAI claimed that the authors "misconceive the scope of copyright,
failing to take into account the limitations and exceptions (including
fair use) that properly leave room for innovations like the large
language models now at the forefront of artificial intelligence."

According to OpenAI, even if the authors' books were a "tiny part" of
ChatGPT's massive data set, "the use of copyrighted materials by
innovators in transformative ways does not violate copyright." Unlike
plagiarists who seek to directly profit off distributing copyrighted
materials, OpenAI argued that its goal was "to teach its models to
derive the rules underlying human language" to do things like help
people "save time at work," "make daily life easier," or simply
entertain themselves by typing prompts into ChatGPT.

The purpose of copyright law, OpenAI argued, is "to promote the
Progress of Science and useful Arts" by protecting the way authors
express ideas, but "not the underlying idea itself, facts embodied
within the author’s articulated message, or other building blocks of
creative," which are arguably the elements of authors' works that
would be useful to ChatGPT's training model. Citing a notable
copyright case involving Google Books, OpenAI reminded the court that
"while an author may register a copyright in her book, the
'statistical information' pertaining to 'word frequencies, syntactic
patterns, and thematic markers' in that book are beyond the scope of
copyright protection."

"Under the resulting judicial precedent, it is not an infringement to
create 'wholesale cop[ies] of [a work] as a preliminary step' to
develop a new, non-infringing product, even if the new product
competes with the original," OpenAI wrote.

Continua qui: 
https://arstechnica.com/tech-policy/2023/08/openai-disputes-authors-claims-that-every-chatgpt-response-is-a-derivative-work/

Faccio notare che questa è anche la posizione di Creative Commons:
https://creativecommons.org/2023/02/17/fair-use-training-generative-ai/

Fabio
_______________________________________________
nexa mailing list
[email protected]
https://server-nexa.polito.it/cgi-bin/mailman/listinfo/nexa

[nexa] OpenAI disputes authors’ claims that every ChatGPT response is a derivative work

Reply via email to