Re: [Wikidata] [Wikimedia-l] Solve legal uncertainty of Wikidata

Sebastian Hellmann Thu, 17 May 2018 23:07:33 -0700

Hi Denny,


On 18.05.2018 02:54, Denny Vrandečić wrote:

Rob Speer wrote:
> The result of this, by the way, is that commercial entities sell modified
> versions of Wikidata with impunity. It undermines the terms of other
> resources such as DBPedia, which also contains facts extracted from
> Wikipedia and respects its Share-Alike terms. Why would anyone use DBPedia
> and have to agree to share alike, when they can get similar data from
> Wikidata which promises them it's CC-0?

The comparison to DBpedia is interesting: the terms for DBpedia state
"Attribution in this case means keep DBpedia URIs visible and activethrough at least one (preferably all) of @href, <link />, or "Link:".If live links are impossible (e.g., when printed on paper), a textualblurb-based attribution is acceptable."
http://wiki.dbpedia.org/terms-imprint
So according to these terms, when someone displays data from DBpedia,it is entirely sufficient to attribute DBpedia.
What that means is that DBpedia follows exactly the same theory asWikidata: it is OK to extract data from Wikipedia and republish it asyour own dataset under your own copyright without requiringattribution to the original source of the extraction.
(A bit more problematic might be the fact that DBpedia alsorepublishes whole paragraphs of Text under these terms, but that'sanother story)
My understanding is that all that Wikidata has extracted fromWikipedia is non-copyrightable in the first place and thusrepublishing it under a different license (or, as in the case ofDBpedia for simple triples, with a different attribution) is legallysound.

In the SmartDataWeb project https://www.smartdataweb.de/ we hiredlawyers to write a legal review about the extraction situation. Factscan be extracted and republished under CC-0 without problem as is thecase of infoboxes.. Copying a whole database is a different becausedatabase rights hold. If you only extract ~ two sentences it falls undercitation, which is also easy. If it is more than two sentence, thencopyright applies.

I can check whether it is ready and shareable. The legal review(Gutachten) is quite a big thing as it has some legal relevancy and canbe cited in court.

Hence we can switch to ODC-BY with facts as CC-0 and the text asshare-alike. However the attribution mentioned in the imprint is stillfine, since it is under database and not the content/facts.I am still uncertain about the attribution. If you remix and publish youneed to cite the direct sources. But if somebody takes from you, does heonly attribute to you or to everybody you used in a transitive way.

Anyhow, we are sharpening the whole model towards technology, notdata/content. So the databus will be a transparent layer and it is mucheasier to find the source like Wikipedia and Wikidata and docontributions there, which is actually one of the intentions ofshare-alike (getting work pushed back/upstream).


All the best,
Sebastian

If there is disagreement with that, I would be interested whichcontent exactly is considered to be under copyright and where licensehas not been followed on Wikidata.

For completion: the discussion is going on in parallel on the Wikidataproject chat and in Phabricator:


https://phabricator.wikimedia.org/T193728#4212728

https://www.wikidata.org/wiki/Wikidata:Project_chat#Wikipedia_and_other_Wikimedia_projects


I would appreciate if we could keep the discussion in a single place.

Gnom1 on Phabricator has offered to actually answer legal questions,but we need to come up with the questions that we want to ask. If itshould be, for example, as Rob Speer states on the bug, "has thecopyright of interwiki links been breached by having them be moved toWikidata?", I'd be quite happy with that question - if that's thedisagreement, let us ask Legal help and see if my understanding oryours is correct.

Does this sound like a reasonable question? Or which other questionwould you like to ask instead?

On Thu, May 17, 2018 at 4:15 PM Rob Speer <[email protected]<mailto:[email protected]>> wrote:


    > As always, copyright is predatory. As we can prove that
    copyright is the
    enemy of science and knowledge

    Well, this kind of gets to the heart of the issue, doesn't it.

    I support the Creative Commons license, including the share-alike
    term,
    which requires copyright in order to work, and I've contributed to
    multiple
    Wikimedia projects with the understanding that my work would be
    protected
    by CC-By-SA.

    Wikidata is engaged in a project-wide act of disobedience against
    CC-By-SA.
    I would say that GerardM has provided an excellent summary of the
    attitude
    toward Creative Commons that I've encountered on Wikidata: "it's
    holding us
    back", "it's the enemy", "you can't copyright knowledge", "you
    can't make
    us follow it", etc.

    The result of this, by the way, is that commercial entities sell
    modified
    versions of Wikidata with impunity. It undermines the terms of other
    resources such as DBPedia, which also contains facts extracted from
    Wikipedia and respects its Share-Alike terms. Why would anyone use
    DBPedia
    and have to agree to share alike, when they can get similar data from
    Wikidata which promises them it's CC-0?

    On Wed, 16 May 2018 at 21:43 Gerard Meijssen
    <[email protected] <mailto:[email protected]>>
    wrote:

    > Hoi,
    > Thank you for the overly broad misrepresentation. As always,
    copyright is
    > predatory. As we can prove that copyright is the enemy of
    science and
    > knowledge we should not be upset that *copyright *is abused we
    should
    > welcome it as it proves the point. Also when we use texts from
    everywhere
    > and rephrase it in Wikipedia articles "we" are not lily white
    either.
    >
    > In "them old days" generally we felt that when people would use
    Wikipedia,
    > it would only serve our purpose; share the sum of all knowledge.
    I still
    > feel really good about that. And, it has been shown that what we do;
    > maintain / curate / update that data that it is not easily given
    to do as
    > well as "we" do it.
    >
    > When we are to be more precise with our copyright, there are a
    few things
    > we could do to make copyright more transparent. When data is to
    be uploaded
    > (Commons / Wikipedia or Wikidata) we should use a user that is
    OWNED and
    > operated by the copyright holder. The operation may be by proxy
    and as a
    > consequence there is no longer a question about copyright as the
    copyright
    > holder can do as we wants. This makes any future noises just that,
    > annoying.
    >
    > As to copyright on Wikidata, when you consider copyright using
    data from
    > Wikipedia. The question is: "What Wikipedia" I have copied a lot
    of data
    > from several Wikipedias and believe me, from a quality point of
    view there
    > is much to be gained by using Wikidata as an instrument for good
    because it
    > is really strong in identifying friends and false friends. It is
    superior
    > as a tool for disambiguation.
    >
    > About the copyright on data, the overriding question with data
    is: do you
    > copy data wholesale in Wikidata. That is what a database
    copyright is
    > about. As I wrote on my blog [1], the best data to include is
    data that is
    > corroborated by the fact that it is present in multiple sources.
    This
    > negates the notion of a single source, it also underscores that
    much of the
    > data everywhere is replicated a lot. It also underscores, again,
    the notion
    > that data that is only present in single sources is what needs
    attention.
    > It needs tender loving care, it needs other sources to establish
    > credentials. That is in its own right what makes any claim of
    copyright
    > moot. It is in this process that it becomes a "creative" process
    negating
    > the copyright held on databases.
    >
    > I welcome the attention that is given to copyright in Wikidata.
    However our
    > attention to copyright is predatory in two ways. It is how can
    we get
    > around existing copyright and how can we protect our own.  As
    argued,
    > Wikidata shines when it is used for what it is intended to be;
    the place
    > that brings data, of Wikipedias first and elsewhere second,
    together to be
    > used as a repository of quality, open and linked data.
    > Thanks,
    >        GerardM
    >
    > [1]
    >
    >
    
https://ultimategerardm.blogspot.nl/2018/05/wikidata-copyright-and-linked-data.html
    >
    > On 11 May 2018 at 23:10, Rob Speer <[email protected]
    <mailto:[email protected]>> wrote:
    >
    > > Wow, thanks for the heads up. When I was getting upset about
    projects
    > that
    > > change the license on Wikimedia content and commercialize it,
    I had no
    > idea
    > > that Wikidata was providing them the cover to do so. The
    Creative Commons
    > > violation is coming from inside the house!
    > >
    > > On Tue, 8 May 2018 at 03:48 mathieu stumpf guntz <
    > > [email protected]
    <mailto:[email protected]>> wrote:
    > >
    > > > Hello everybody,
    > > >
    > > > There is a phabricator ticket on Solve legal uncertainty of
    Wikidata
    > > > <https://phabricator.wikimedia.org/T193728> that you might be
    > interested
    > > > to look at and participate in.
    > > >
    > > > As Denny suggested in the ticket to give it more visibility
    through the
    > > > discussion on the Wikidata chat
    > > > <
    > > > https://www.wikidata.org/wiki/Wikidata:Project_chat#
    > > Importing_datasets_under_incompatible_licenses>,
    > > >
    > > > I thought it was interesting to highlight it a bit more.
    > > >
    > > > Cheers
    > > >
    > > > _______________________________________________
    > > > Wikimedia-l mailing list, guidelines at:
    > > > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and
    > > > https://meta.wikimedia.org/wiki/Wikimedia-l
    > > > New messages to: [email protected]
    <mailto:[email protected]>
    > > > Unsubscribe:
    https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
    > > > <mailto:[email protected]
    <mailto:[email protected]>?subject=unsubscribe>
    > > _______________________________________________
    > > Wikimedia-l mailing list, guidelines at:
    https://meta.wikimedia.org/
    > > wiki/Mailing_lists/Guidelines and https://meta.wikimedia.org/
    > > wiki/Wikimedia-l
    > > New messages to: [email protected]
    <mailto:[email protected]>
    > > Unsubscribe:
    https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
    > > <mailto:[email protected]
    <mailto:[email protected]>?subject=unsubscribe>
    > _______________________________________________
    > Wikimedia-l mailing list, guidelines at:
    > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and
    > https://meta.wikimedia.org/wiki/Wikimedia-l
    > New messages to: [email protected]
    <mailto:[email protected]>
    > Unsubscribe:
    https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
    > <mailto:[email protected]
    <mailto:[email protected]>?subject=unsubscribe>
    _______________________________________________
    Wikimedia-l mailing list, guidelines at:
    https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and
    https://meta.wikimedia.org/wiki/Wikimedia-l
    New messages to: [email protected]
    <mailto:[email protected]>
    Unsubscribe:
    https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
    <mailto:[email protected]
    <mailto:[email protected]>?subject=unsubscribe>



_______________________________________________
Wikidata mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata


--
All the best,
Sebastian Hellmann

Director of Knowledge Integration and Linked Data Technologies (KILT)Competence Center

at the Institute for Applied Informatics (InfAI) at Leipzig University
Executive Director of the DBpedia Association

Projects: http://dbpedia.org, http://nlp2rdf.org,http://linguistics.okfn.org, https://www.w3.org/community/ld4lt<http://www.w3.org/community/ld4lt>

Homepage: http://aksw.org/SebastianHellmann
Research Group: http://aksw.org

_______________________________________________
Wikidata mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata

Re: [Wikidata] [Wikimedia-l] Solve legal uncertainty of Wikidata

Reply via email to