Re: [Wikidata] Data model explanation and protection

2015-11-09 Thread Magnus Manske
Hi Ben,

looked at the first result from your query. Merge was done by a new user,
who seems to have an "interest" in biology:
https://www.wikidata.org/wiki/Special:Contributions/H%C3%AA_de_tekhn%C3%AA_makr%C3%AA

Second result, another user with no user page, same pattern:
https://www.wikidata.org/wiki/Special:Contributions/Nguyenld

Note that both do other things to items in the "realm", so it doesn't
appear to be my game. Both have done multiple merges.

Haven't looked at more results/ Will look into game mods anyway.

Cheers,
Magnus

On Mon, Nov 9, 2015 at 11:34 PM Benjamin Good 
wrote:

> Magnus,
>
> We are seeing more and more of these problematic merges.  See:
> http://tinyurl.com/ovutz5x for the current list of (today 61) problems.
> Are these coming from the wikidata game?
>
> All of the editors performing the merges seem to be new and the edit
> patterns seem to match the game.  I thought the edits were tagged with a
> statement about them coming from the game, but I don't see that?  If they
> are, could you just take genes and proteins out of the 'potential merge'
> queue ?  I'm guessing that their frequently very similar names are putting
> many of them into the list.
>
> We are starting to work on a bot to combat this, but would like to stop
> the main source of the damage if its possible to detect it.  This is making
> Wikipedia integration more challenging than it already is...
>
> thanks
> -Ben
>
>
> On Wed, Oct 28, 2015 at 3:41 PM, Magnus Manske <
> magnusman...@googlemail.com> wrote:
>
>> I fear my games may contribute to both problems (merging two items, and
>> adding a sitelink to the wrong item). Both are facilitated by identical
>> names/aliases, and sometimes it's hard to tell that a pair is meant to be
>> different, especially if you don't know about the intricate structures of
>> the respective knowledge domain.
>>
>> An item-specific, but somewhat heavy-handed approach would be to prevent
>> merging of any two items where at least one has P1889, no matter what it
>> specifically points to. At least, give a warning that an item is
>> "merge-protected", and require an additional override for the merge.
>>
>> If that is acceptable, it would be easy for me to filter all items with
>> P1889, from the merge game at least.
>>
>> On Wed, Oct 28, 2015 at 8:50 PM Peter F. Patel-Schneider <
>> pfpschnei...@gmail.com> wrote:
>>
>>> On 10/28/2015 12:08 PM, Tom Morris wrote:
>>> [...]
>>> > Going back to Ben's original problem, one tool that Freebase used to
>>> help
>>> > manage the problem of incompatible type merges was a set of curated
>>> sets of
>>> > incompatible types [5] which was used by the merge tools to warn users
>>> that
>>> > the merge they were proposing probably wasn't a good idea.  People
>>> could
>>> > ignore the warning in the Freebase implementation, but Wikidata could
>>> make it
>>> > a hard restriction or just a warning.
>>> >
>>> > Tom
>>>
>>> I think that this idea is a good one.  The incompatibility information
>>> could
>>> be added to classes in the form of "this class is disjoint from that
>>> other
>>> class".  Tools would then be able to look for this information and
>>> produce
>>> warnings or even have stronger reactions to proposed merging.
>>>
>>> I'm not sure that using P1889 "different from" is going to be adequate.
>>> What
>>> links would be needed?  Just between a gene and its protein?  That
>>> wouldn't
>>> catch merging a gene and a related protein.  Between all genes and all
>>> proteins?  It seems to me that this is better handled at the class level.
>>>
>>> peter
>>>
>>>
>>> ___
>>> Wikidata mailing list
>>> Wikidata@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>>
>>
>> ___
>> Wikidata mailing list
>> Wikidata@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Data model explanation and protection

2015-11-09 Thread Benjamin Good
Magnus,

We are seeing more and more of these problematic merges.  See:
http://tinyurl.com/ovutz5x for the current list of (today 61) problems.
Are these coming from the wikidata game?

All of the editors performing the merges seem to be new and the edit
patterns seem to match the game.  I thought the edits were tagged with a
statement about them coming from the game, but I don't see that?  If they
are, could you just take genes and proteins out of the 'potential merge'
queue ?  I'm guessing that their frequently very similar names are putting
many of them into the list.

We are starting to work on a bot to combat this, but would like to stop the
main source of the damage if its possible to detect it.  This is making
Wikipedia integration more challenging than it already is...

thanks
-Ben


On Wed, Oct 28, 2015 at 3:41 PM, Magnus Manske 
wrote:

> I fear my games may contribute to both problems (merging two items, and
> adding a sitelink to the wrong item). Both are facilitated by identical
> names/aliases, and sometimes it's hard to tell that a pair is meant to be
> different, especially if you don't know about the intricate structures of
> the respective knowledge domain.
>
> An item-specific, but somewhat heavy-handed approach would be to prevent
> merging of any two items where at least one has P1889, no matter what it
> specifically points to. At least, give a warning that an item is
> "merge-protected", and require an additional override for the merge.
>
> If that is acceptable, it would be easy for me to filter all items with
> P1889, from the merge game at least.
>
> On Wed, Oct 28, 2015 at 8:50 PM Peter F. Patel-Schneider <
> pfpschnei...@gmail.com> wrote:
>
>> On 10/28/2015 12:08 PM, Tom Morris wrote:
>> [...]
>> > Going back to Ben's original problem, one tool that Freebase used to
>> help
>> > manage the problem of incompatible type merges was a set of curated
>> sets of
>> > incompatible types [5] which was used by the merge tools to warn users
>> that
>> > the merge they were proposing probably wasn't a good idea.  People could
>> > ignore the warning in the Freebase implementation, but Wikidata could
>> make it
>> > a hard restriction or just a warning.
>> >
>> > Tom
>>
>> I think that this idea is a good one.  The incompatibility information
>> could
>> be added to classes in the form of "this class is disjoint from that other
>> class".  Tools would then be able to look for this information and produce
>> warnings or even have stronger reactions to proposed merging.
>>
>> I'm not sure that using P1889 "different from" is going to be adequate.
>> What
>> links would be needed?  Just between a gene and its protein?  That
>> wouldn't
>> catch merging a gene and a related protein.  Between all genes and all
>> proteins?  It seems to me that this is better handled at the class level.
>>
>> peter
>>
>>
>> ___
>> Wikidata mailing list
>> Wikidata@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Query Help

2015-11-09 Thread Hampton Snowball
I think a lot of the freebase handles can be out of date and incorrect in
what I previously saw.

On Mon, Nov 9, 2015 at 2:56 PM, Tom Morris  wrote:

> Freebase has another 18,000 Twitter handles which are linked to IMDB, G+,
> etc which don't have English Wikipedia links (as well as 13K which are
> linked to English Wikipedia, although those should be in Wikidata too).
> http://tinyurl.com/omb6bxf
>
> I know some Wikipedias actively discourage links to social networking
> sites like Twitter [1].  What is Wikidata's position on Twitter handles?
> Is the existence of the property P2002 sufficient justification to fill it
> in? (Unlike enwiki's "Yes, we have Twitter template, but you shouldn't be
> using it" stance [2]).
>
> Tom
>
> [1]
> https://en.wikipedia.org/wiki/Wikipedia:External_links#Links_normally_to_be_avoided
> [2] https://en.wikipedia.org/wiki/Template:Twitter
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Query Help

2015-11-09 Thread Hampton Snowball
Thanks to everyone who added their answers - this all did the trick! :)

On Mon, Nov 9, 2015 at 1:59 PM, Tom Morris  wrote:

> This StackExchange answer describes how to get site links:
>
>
> http://opendata.stackexchange.com/questions/6050/get-wikipedia-urls-sitelinks-in-wikidata-sparql-query#
>
> You could graft in the appropriate clause from that answer to get enwiki
> sitelinks.  It looks like there are a total of 37092 Twitter handles, of
> which 29945 have links to enwiki and 29930 have an English label as well
> (of course you could get the English names for the others by looking at the
> title of the Wikipedia article).
>
> Two things to be aware of are that this isn't a one-to-one correspondence
> and the quality of the links varies, so, for example, @ADFP_Peru has 7
> English wikipedia articles nominally "about" it, but none of them actually
> describe the Peruvian professional football association. That article is
> the rather generically named
> https://en.wikipedia.org/wiki/Professional_Football_Sports_Association
>
> There are only ~160 Twitter accounts with multiple articles though.  All
> 22 the links for the account @status should be ignored as their the result
> of a bots bad parse of things like the link to
> https://twitter.com/WAFCA/status/410033143148978176 in
> https://en.wikipedia.org/wiki/Tye_Sheridan#cite_note-22
>
> Tom
>
> prefix schema: 
> PREFIX wd: 
> PREFIX wdt: 
> PREFIX rdfs: 
>
> SELECT ?Twitter ?item ?item_label ?article WHERE {
>   {?item wdt:P2002 ?Twitter FILTER(?Twitter = "ADFP_Peru") }
>   {?item rdfs:label ?item_label filter (lang(?item_label) = "en") .}
>   {
>   ?article schema:about ?item .
>   FILTER (SUBSTR(str(?article), 1, 25) = "https://en.wikipedia.org/";)
>   }
> }
>
>
>
> On Mon, Nov 9, 2015 at 12:25 PM, Hampton Snowball <
> hamptonsnowb...@gmail.com> wrote:
>
>> Thank you for doing what you could! In terms of wikipedia links it does
>> look like the wikidata records do link to wikpedia, e.g. at the bottom of
>> this record:
>>
>> https://www.wikidata.org/wiki/Q25369
>>
>> Since there is multiple wikpedia entries, I guess I'd be going for the
>> "en" english one.
>>
>> Best,
>> HS
>>
>>
>> On Mon, Nov 9, 2015 at 12:18 PM, Remko de Keijzer <
>> re...@remkodekeijzer.nl> wrote:
>>
>>> No. The label is the label in Wikidata. Not a sitelink. You're not
>>> querying Wikipedia, but Wikidata.
>>> If you really need the sitelink, I hope someone else can help you
>>> further, since I'm a mere beginner at SPARQL and was happy I could get this
>>> result.
>>>
>>> --
>>> Mbch331
>>>
>>> When you have eliminated the impossible, whatever remains, however 
>>> improbable, must be the truth.
>>> (Sir Arthur Conan Doyle)
>>>
>>> Op 9-11-2015 om 18:13 schreef Hampton Snowball:
>>>
>>> Maybe I misunderstood.  I think the item label is actually what's used
>>> in the wikipedia article url, just convert spaces to underscores?
>>>
>>> Thanks!
>>>
>>> On Mon, Nov 9, 2015 at 11:47 AM, Hampton Snowball <
>>> hamptonsnowb...@gmail.com> wrote:
>>>
 Thank you. Is there a way to export it though with the Wikipedia
 Article name with underscores or wikipedia url?  So like Barack_Obama or
 en.wikipedia.org/wiki/Barack_Obama.

 On Mon, Nov 9, 2015 at 11:34 AM, Remko de Keijzer <
 re...@remkodekeijzer.nl> wrote:

> I think this: http://tinyurl.com/o5lahko is what you wanted. It's got
> the item ID, the label and the value of P2002.
>
> --
> Mbch331
>
> When you have eliminated the impossible, whatever remains, however 
> improbable, must be the truth.
> (Sir Arthur Conan Doyle)
>
> Op 9-11-2015 om 17:09 schreef Hampton Snowball:
>
> I'm looking to use the  
> https://query.wikidata.org/ interface to export to csv all wikidatas
> with this property P2002.
>
>
> 
> https://www.wikidata.org/w/index.php?title=Special:WhatLinksHere/Property:P2002&limit=500&from=21542767&back=20967485
>
> I am looking for the Wikipedia Article_Name + the value associated
> with that property.
>
> Thanks in advance!
>
>
>
> ___
> Wikidata mailing 
> listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>

>>>
>>>
>>> ___
>>> Wikidata mailing 
>>> listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
>>>
>>>
>>>
>>> 

Re: [Wikidata] Query Help

2015-11-09 Thread Tom Morris
Freebase has another 18,000 Twitter handles which are linked to IMDB, G+,
etc which don't have English Wikipedia links (as well as 13K which are
linked to English Wikipedia, although those should be in Wikidata too).
http://tinyurl.com/omb6bxf

I know some Wikipedias actively discourage links to social networking sites
like Twitter [1].  What is Wikidata's position on Twitter handles?  Is the
existence of the property P2002 sufficient justification to fill it in?
(Unlike enwiki's "Yes, we have Twitter template, but you shouldn't be using
it" stance [2]).

Tom

[1]
https://en.wikipedia.org/wiki/Wikipedia:External_links#Links_normally_to_be_avoided
[2] https://en.wikipedia.org/wiki/Template:Twitter
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Query Help

2015-11-09 Thread Tom Morris
This StackExchange answer describes how to get site links:

http://opendata.stackexchange.com/questions/6050/get-wikipedia-urls-sitelinks-in-wikidata-sparql-query#

You could graft in the appropriate clause from that answer to get enwiki
sitelinks.  It looks like there are a total of 37092 Twitter handles, of
which 29945 have links to enwiki and 29930 have an English label as well
(of course you could get the English names for the others by looking at the
title of the Wikipedia article).

Two things to be aware of are that this isn't a one-to-one correspondence
and the quality of the links varies, so, for example, @ADFP_Peru has 7
English wikipedia articles nominally "about" it, but none of them actually
describe the Peruvian professional football association. That article is
the rather generically named
https://en.wikipedia.org/wiki/Professional_Football_Sports_Association

There are only ~160 Twitter accounts with multiple articles though.  All 22
the links for the account @status should be ignored as their the result of
a bots bad parse of things like the link to
https://twitter.com/WAFCA/status/410033143148978176 in
https://en.wikipedia.org/wiki/Tye_Sheridan#cite_note-22

Tom

prefix schema: 
PREFIX wd: 
PREFIX wdt: 
PREFIX rdfs: 

SELECT ?Twitter ?item ?item_label ?article WHERE {
  {?item wdt:P2002 ?Twitter FILTER(?Twitter = "ADFP_Peru") }
  {?item rdfs:label ?item_label filter (lang(?item_label) = "en") .}
  {
  ?article schema:about ?item .
  FILTER (SUBSTR(str(?article), 1, 25) = "https://en.wikipedia.org/";)
  }
}



On Mon, Nov 9, 2015 at 12:25 PM, Hampton Snowball  wrote:

> Thank you for doing what you could! In terms of wikipedia links it does
> look like the wikidata records do link to wikpedia, e.g. at the bottom of
> this record:
>
> https://www.wikidata.org/wiki/Q25369
>
> Since there is multiple wikpedia entries, I guess I'd be going for the
> "en" english one.
>
> Best,
> HS
>
>
> On Mon, Nov 9, 2015 at 12:18 PM, Remko de Keijzer  > wrote:
>
>> No. The label is the label in Wikidata. Not a sitelink. You're not
>> querying Wikipedia, but Wikidata.
>> If you really need the sitelink, I hope someone else can help you
>> further, since I'm a mere beginner at SPARQL and was happy I could get this
>> result.
>>
>> --
>> Mbch331
>>
>> When you have eliminated the impossible, whatever remains, however 
>> improbable, must be the truth.
>> (Sir Arthur Conan Doyle)
>>
>> Op 9-11-2015 om 18:13 schreef Hampton Snowball:
>>
>> Maybe I misunderstood.  I think the item label is actually what's used in
>> the wikipedia article url, just convert spaces to underscores?
>>
>> Thanks!
>>
>> On Mon, Nov 9, 2015 at 11:47 AM, Hampton Snowball <
>> hamptonsnowb...@gmail.com> wrote:
>>
>>> Thank you. Is there a way to export it though with the Wikipedia Article
>>> name with underscores or wikipedia url?  So like Barack_Obama or
>>> en.wikipedia.org/wiki/Barack_Obama.
>>>
>>> On Mon, Nov 9, 2015 at 11:34 AM, Remko de Keijzer <
>>> re...@remkodekeijzer.nl> wrote:
>>>
 I think this: http://tinyurl.com/o5lahko is what you wanted. It's got
 the item ID, the label and the value of P2002.

 --
 Mbch331

 When you have eliminated the impossible, whatever remains, however 
 improbable, must be the truth.
 (Sir Arthur Conan Doyle)

 Op 9-11-2015 om 17:09 schreef Hampton Snowball:

 I'm looking to use the  
 https://query.wikidata.org/ interface to export to csv all wikidatas
 with this property P2002.


 
 https://www.wikidata.org/w/index.php?title=Special:WhatLinksHere/Property:P2002&limit=500&from=21542767&back=20967485

 I am looking for the Wikipedia Article_Name + the value associated with
 that property.

 Thanks in advance!



 ___
 Wikidata mailing 
 listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata



 ___
 Wikidata mailing list
 Wikidata@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata


>>>
>>
>>
>> ___
>> Wikidata mailing 
>> listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>>
>>
>> ___
>> Wikidata mailing list
>> Wikidata@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>>
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
__

Re: [Wikidata] Query Help

2015-11-09 Thread James Heald

Hi Hampton,

The SPARQL syntax needed to extract wiki-sitelinks isn't the best, and 
with luck will get updated when the data design is next reviewed. 
(Something like the proposed new scheme for identifiers would be 
better).  But I think the following should be more or less what you were 
asking for:


tinyurl.com/nsfwr8k

All best,

   James.


On 09/11/2015 17:25, Hampton Snowball wrote:

Thank you for doing what you could! In terms of wikipedia links it does
look like the wikidata records do link to wikpedia, e.g. at the bottom of
this record:

https://www.wikidata.org/wiki/Q25369

Since there is multiple wikpedia entries, I guess I'd be going for the "en"
english one.

Best,
HS


On Mon, Nov 9, 2015 at 12:18 PM, Remko de Keijzer 
wrote:


No. The label is the label in Wikidata. Not a sitelink. You're not
querying Wikipedia, but Wikidata.
If you really need the sitelink, I hope someone else can help you further,
since I'm a mere beginner at SPARQL and was happy I could get this result.

--
Mbch331

When you have eliminated the impossible, whatever remains, however improbable, 
must be the truth.
(Sir Arthur Conan Doyle)

Op 9-11-2015 om 18:13 schreef Hampton Snowball:

Maybe I misunderstood.  I think the item label is actually what's used in
the wikipedia article url, just convert spaces to underscores?

Thanks!

On Mon, Nov 9, 2015 at 11:47 AM, Hampton Snowball <
hamptonsnowb...@gmail.com> wrote:


Thank you. Is there a way to export it though with the Wikipedia Article
name with underscores or wikipedia url?  So like Barack_Obama or
en.wikipedia.org/wiki/Barack_Obama.

On Mon, Nov 9, 2015 at 11:34 AM, Remko de Keijzer <
re...@remkodekeijzer.nl> wrote:


I think this: http://tinyurl.com/o5lahko is what you wanted. It's got
the item ID, the label and the value of P2002.

--
Mbch331

When you have eliminated the impossible, whatever remains, however improbable, 
must be the truth.
(Sir Arthur Conan Doyle)

Op 9-11-2015 om 17:09 schreef Hampton Snowball:

I'm looking to use the  
https://query.wikidata.org/ interface to export to csv all wikidatas
with this property P2002.



https://www.wikidata.org/w/index.php?title=Special:WhatLinksHere/Property:P2002&limit=500&from=21542767&back=20967485

I am looking for the Wikipedia Article_Name + the value associated with
that property.

Thanks in advance!



___
Wikidata mailing 
listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata



___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata







___
Wikidata mailing 
listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata



___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata






___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata




___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Query Help

2015-11-09 Thread Hampton Snowball
Thank you for doing what you could! In terms of wikipedia links it does
look like the wikidata records do link to wikpedia, e.g. at the bottom of
this record:

https://www.wikidata.org/wiki/Q25369

Since there is multiple wikpedia entries, I guess I'd be going for the "en"
english one.

Best,
HS


On Mon, Nov 9, 2015 at 12:18 PM, Remko de Keijzer 
wrote:

> No. The label is the label in Wikidata. Not a sitelink. You're not
> querying Wikipedia, but Wikidata.
> If you really need the sitelink, I hope someone else can help you further,
> since I'm a mere beginner at SPARQL and was happy I could get this result.
>
> --
> Mbch331
>
> When you have eliminated the impossible, whatever remains, however 
> improbable, must be the truth.
> (Sir Arthur Conan Doyle)
>
> Op 9-11-2015 om 18:13 schreef Hampton Snowball:
>
> Maybe I misunderstood.  I think the item label is actually what's used in
> the wikipedia article url, just convert spaces to underscores?
>
> Thanks!
>
> On Mon, Nov 9, 2015 at 11:47 AM, Hampton Snowball <
> hamptonsnowb...@gmail.com> wrote:
>
>> Thank you. Is there a way to export it though with the Wikipedia Article
>> name with underscores or wikipedia url?  So like Barack_Obama or
>> en.wikipedia.org/wiki/Barack_Obama.
>>
>> On Mon, Nov 9, 2015 at 11:34 AM, Remko de Keijzer <
>> re...@remkodekeijzer.nl> wrote:
>>
>>> I think this: http://tinyurl.com/o5lahko is what you wanted. It's got
>>> the item ID, the label and the value of P2002.
>>>
>>> --
>>> Mbch331
>>>
>>> When you have eliminated the impossible, whatever remains, however 
>>> improbable, must be the truth.
>>> (Sir Arthur Conan Doyle)
>>>
>>> Op 9-11-2015 om 17:09 schreef Hampton Snowball:
>>>
>>> I'm looking to use the  
>>> https://query.wikidata.org/ interface to export to csv all wikidatas
>>> with this property P2002.
>>>
>>>
>>> 
>>> https://www.wikidata.org/w/index.php?title=Special:WhatLinksHere/Property:P2002&limit=500&from=21542767&back=20967485
>>>
>>> I am looking for the Wikipedia Article_Name + the value associated with
>>> that property.
>>>
>>> Thanks in advance!
>>>
>>>
>>>
>>> ___
>>> Wikidata mailing 
>>> listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
>>>
>>>
>>>
>>> ___
>>> Wikidata mailing list
>>> Wikidata@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>>
>>>
>>
>
>
> ___
> Wikidata mailing 
> listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Query Help

2015-11-09 Thread Remko de Keijzer
No. The label is the label in Wikidata. Not a sitelink. You're not 
querying Wikipedia, but Wikidata.
If you really need the sitelink, I hope someone else can help you 
further, since I'm a mere beginner at SPARQL and was happy I could get 
this result.


--
Mbch331

When you have eliminated the impossible, whatever remains, however improbable, 
must be the truth.
(Sir Arthur Conan Doyle)

Op 9-11-2015 om 18:13 schreef Hampton Snowball:
Maybe I misunderstood.  I think the item label is actually what's used 
in the wikipedia article url, just convert spaces to underscores?


Thanks!

On Mon, Nov 9, 2015 at 11:47 AM, Hampton Snowball 
mailto:hamptonsnowb...@gmail.com>> wrote:


Thank you. Is there a way to export it though with the Wikipedia
Article name with underscores or wikipedia url?  So like
Barack_Obama or en.wikipedia.org/wiki/Barack_Obama
.

On Mon, Nov 9, 2015 at 11:34 AM, Remko de Keijzer
mailto:re...@remkodekeijzer.nl>> wrote:

I think this: http://tinyurl.com/o5lahko is what you wanted.
It's got the item ID, the label and the value of P2002.

-- 
Mbch331


When you have eliminated the impossible, whatever remains, however 
improbable, must be the truth.
(Sir Arthur Conan Doyle)

Op 9-11-2015 om 17:09 schreef Hampton Snowball:

I'm looking to use the https://query.wikidata.org/ interface
to export to csv all wikidatas with this property P2002.


https://www.wikidata.org/w/index.php?title=Special:WhatLinksHere/Property:P2002&limit=500&from=21542767&back=20967485

I am looking for the Wikipedia Article_Name + the value
associated with that property.

Thanks in advance!



___
Wikidata mailing list
Wikidata@lists.wikimedia.org

https://lists.wikimedia.org/mailman/listinfo/wikidata



___
Wikidata mailing list
Wikidata@lists.wikimedia.org 
https://lists.wikimedia.org/mailman/listinfo/wikidata





___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Query Help

2015-11-09 Thread Hampton Snowball
Maybe I misunderstood.  I think the item label is actually what's used in
the wikipedia article url, just convert spaces to underscores?

Thanks!

On Mon, Nov 9, 2015 at 11:47 AM, Hampton Snowball  wrote:

> Thank you. Is there a way to export it though with the Wikipedia Article
> name with underscores or wikipedia url?  So like Barack_Obama or
> en.wikipedia.org/wiki/Barack_Obama.
>
> On Mon, Nov 9, 2015 at 11:34 AM, Remko de Keijzer  > wrote:
>
>> I think this: http://tinyurl.com/o5lahko is what you wanted. It's got
>> the item ID, the label and the value of P2002.
>>
>> --
>> Mbch331
>>
>> When you have eliminated the impossible, whatever remains, however 
>> improbable, must be the truth.
>> (Sir Arthur Conan Doyle)
>>
>> Op 9-11-2015 om 17:09 schreef Hampton Snowball:
>>
>> I'm looking to use the  
>> https://query.wikidata.org/ interface to export to csv all wikidatas
>> with this property P2002.
>>
>>
>> https://www.wikidata.org/w/index.php?title=Special:WhatLinksHere/Property:P2002&limit=500&from=21542767&back=20967485
>>
>> I am looking for the Wikipedia Article_Name + the value associated with
>> that property.
>>
>> Thanks in advance!
>>
>>
>>
>> ___
>> Wikidata mailing 
>> listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>>
>>
>> ___
>> Wikidata mailing list
>> Wikidata@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>>
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Query Help

2015-11-09 Thread Hampton Snowball
Thank you. Is there a way to export it though with the Wikipedia Article
name with underscores or wikipedia url?  So like Barack_Obama or
en.wikipedia.org/wiki/Barack_Obama.

On Mon, Nov 9, 2015 at 11:34 AM, Remko de Keijzer 
wrote:

> I think this: http://tinyurl.com/o5lahko is what you wanted. It's got the
> item ID, the label and the value of P2002.
>
> --
> Mbch331
>
> When you have eliminated the impossible, whatever remains, however 
> improbable, must be the truth.
> (Sir Arthur Conan Doyle)
>
> Op 9-11-2015 om 17:09 schreef Hampton Snowball:
>
> I'm looking to use the  
> https://query.wikidata.org/ interface to export to csv all wikidatas with
> this property P2002.
>
>
> https://www.wikidata.org/w/index.php?title=Special:WhatLinksHere/Property:P2002&limit=500&from=21542767&back=20967485
>
> I am looking for the Wikipedia Article_Name + the value associated with
> that property.
>
> Thanks in advance!
>
>
>
> ___
> Wikidata mailing 
> listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Query Help

2015-11-09 Thread Remko de Keijzer
I think this: http://tinyurl.com/o5lahko is what you wanted. It's got 
the item ID, the label and the value of P2002.


--
Mbch331

When you have eliminated the impossible, whatever remains, however improbable, 
must be the truth.
(Sir Arthur Conan Doyle)

Op 9-11-2015 om 17:09 schreef Hampton Snowball:
I'm looking to use the https://query.wikidata.org/ interface to export 
to csv all wikidatas with this property P2002.


https://www.wikidata.org/w/index.php?title=Special:WhatLinksHere/Property:P2002&limit=500&from=21542767&back=20967485

I am looking for the Wikipedia Article_Name + the value associated 
with that property.


Thanks in advance!



___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


[Wikidata] weekly summary #183

2015-11-09 Thread Lydia Pintscher
Hey folks :)

Here's the summary of what's been happening around Wikidata over the past
week. If you are working on something interesting or have seen someone else
do something cool please do not hesitate to add it to the next weekly
summary via the link in the last section of this email.
Discussions

   - Open request for adminship: Lakokat
   


Events /Press/Blogs


   - Wikidata/Wikibase Json Dump Reader
   
   - Wikidata project to tackle language barriers in scientific reserach
   

   - Semantic Cities 
   - Q167545: Wikidata celebrated its third birthday
   

   - Slides from talk at UCSD on "Open biomedical knowledge using
   crowdsourcing and citizenscience"
   

   - Past: semwebpro (slides )
   - Past: ODI Summit
   - Past: MozFest (etherpad
   )
   - Upcoming: WikiConference Seoul 

Other Noteworthy Stuff

Highlights of 3 years of Wikidata by Madgalena Wiegner

   - List of Wikipedia articles without an image where Wikidata has one
   
   - Want to use data from Wikidata to enrich data in your own application? S
   wrote a good start.
   
   - Commons misconceptions and how to avoid them
    by
   School of Data

Did you know?

   - Newest properties
   : charted in
   , Danish parish code
   , venous drainage
   , lymphatic drainage
   , CRIStin ID
   , arterial supply
   , periapsis date
   , price
   , uses
   , Groeningemuseum work PID
   , iTunes album ID
   , Austrian Parliament ID
   , ambitus
   , Member of the Hellenic
   Parliament ID , Magdeburger
   Biographisches Lexikon , UEFA
   player code , World Health
   Organisation International Nonproprietary Name
   , Heidelberg Academy for
   Sciences and Humanities member ID
   , Hederich article
   
   - Ever noticed ranks ?

Development

   - Worked on the tests for the ArticlePlaceholder
   - Finished the create article button for the ArticlePlaceholder page
   - From Monday on a bzip2 compressed version of the beta Wikidata TTL
   dumps will be published along the gzip one
   - Getting close to make it possible to add the main value of a statement
   and its reference at the same time
   - Worked on adding a new section to item and property pages for
   identifiers
   - Did backend work for making identifiers useful in our machine-readable
   outputs (by actually linking them instead of just giving the identifier
   string) - more work needed
   - Fixed a bug where dates would have English months on non-English wikis
   (phabricator:T116503 )
   - Fixed a bug when editing labels on mobile (phabricator:T117184
   )
   - Worked more on making search work on mobile
   - Worked on a fix for a visual glitch in the table of content on mobile
   (the box is bigger than its content)

You can see all open tickets related to Wikidata here
.
Monthly Tasks

   - Hack on one of these
   

[Wikidata] Query Help

2015-11-09 Thread Hampton Snowball
I'm looking to use the https://query.wikidata.org/ interface to export to
csv all wikidatas with this property P2002.

https://www.wikidata.org/w/index.php?title=Special:WhatLinksHere/Property:P2002&limit=500&from=21542767&back=20967485

I am looking for the Wikipedia Article_Name + the value associated with
that property.

Thanks in advance!
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] how to map other identifiers to Wikidata entity IDs

2015-11-09 Thread Andy Mabbett
On 9 November 2015 at 12:40, Andrew Gray  wrote:

> Please do, if you have a good idea how it would work!

Done:

   
https://www.wikidata.org/wiki/Wikidata:Property_proposal/Property_metadata#Expected_completeness

-- 
Andy Mabbett
@pigsonthewing
http://pigsonthewing.org.uk

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] how to map other identifiers to Wikidata entity IDs

2015-11-09 Thread Andrew Gray
Please do, if you have a good idea how it would work!

A.

On 9 November 2015 at 11:00, Andy Mabbett  wrote:
> On 9 November 2015 at 09:28, Andrew Gray  wrote:
>
>> It might be worth thinking about whether we should record
>> these identifier properties as "will always be incomplete", "probably
>> complete", "expected to eventually be complete", etc. If a user
>> queries for an ISBN we don't have, the chances are high that it's a
>> good ISBN we don't cover - but if they query for a country code we
>> don't have, the chances are high that it's an invalid code...
>
> Yes. We can do this with a property, to be applied to other
> properties; and an item for each of the categories you describe.
>
> Will you propose the new property, or shall I?
>
> --
> Andy Mabbett
> @pigsonthewing
> http://pigsonthewing.org.uk
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata



-- 
- Andrew Gray
  andrew.g...@dunelm.org.uk

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] how to map other identifiers to Wikidata entity IDs

2015-11-09 Thread Daniel Kinzler
Am 09.11.2015 um 09:45 schrieb Daniel Kinzler:
> Stas: do we have a ticket for this somewhere? All I can find are the notes in
> the etherpad.

Lydia just found the ticket for me: https://phabricator.wikimedia.org/T99899

I'll add some notes from the etherpad.

-- 
Daniel Kinzler
Senior Software Developer

Wikimedia Deutschland
Gesellschaft zur Förderung Freien Wissens e.V.

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] how to map other identifiers to Wikidata entity IDs

2015-11-09 Thread Jane Darnell
I'll support it

On Mon, Nov 9, 2015 at 12:00 PM, Andy Mabbett 
wrote:

> On 9 November 2015 at 09:28, Andrew Gray 
> wrote:
>
> > It might be worth thinking about whether we should record
> > these identifier properties as "will always be incomplete", "probably
> > complete", "expected to eventually be complete", etc. If a user
> > queries for an ISBN we don't have, the chances are high that it's a
> > good ISBN we don't cover - but if they query for a country code we
> > don't have, the chances are high that it's an invalid code...
>
> Yes. We can do this with a property, to be applied to other
> properties; and an item for each of the categories you describe.
>
> Will you propose the new property, or shall I?
>
> --
> Andy Mabbett
> @pigsonthewing
> http://pigsonthewing.org.uk
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] how to map other identifiers to Wikidata entity IDs

2015-11-09 Thread Andy Mabbett
On 9 November 2015 at 09:28, Andrew Gray  wrote:

> It might be worth thinking about whether we should record
> these identifier properties as "will always be incomplete", "probably
> complete", "expected to eventually be complete", etc. If a user
> queries for an ISBN we don't have, the chances are high that it's a
> good ISBN we don't cover - but if they query for a country code we
> don't have, the chances are high that it's an invalid code...

Yes. We can do this with a property, to be applied to other
properties; and an item for each of the categories you describe.

Will you propose the new property, or shall I?

-- 
Andy Mabbett
@pigsonthewing
http://pigsonthewing.org.uk

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] how to map other identifiers to Wikidata entity IDs

2015-11-09 Thread Gerard Meijssen
Hoi,
What we could do for particular searches is to fallback on other resources
that are known to be complete. When we do not find an ISBN, we can fall
back to library systems, local libraries preferably.

There are many ways we can make a difference. When we do this for one field
of knowledge at a time, it will entice people to do more for their own
field.
Thanks,
 GerardM

On 9 November 2015 at 10:28, Andrew Gray  wrote:

> On 9 November 2015 at 08:45, Daniel Kinzler 
> wrote:
>
> >> Also, is this a temporary thing? Will Wikidata eventually have items
> for every
> >> book published, every musical recording, etc. and become a superset of
> all those
> >> unique identifiers?
> >
> > It's highly unlikely that wikidata will become a superset of any and all
> > vocuabularies in existance.
>
> Agree.
>
> *However*, there are some things where we may be able to say with
> confidence "Wikidata has a comprehensive set of X" (eg catalogues such
> as P1186). It might be worth thinking about whether we should record
> these identifier properties as "will always be incomplete", "probably
> complete", "expected to eventually be complete", etc. If a user
> queries for an ISBN we don't have, the chances are high that it's a
> good ISBN we don't cover - but if they query for a country code we
> don't have, the chances are high that it's an invalid code...
>
> --
> - Andrew Gray
>   andrew.g...@dunelm.org.uk
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] how to map other identifiers to Wikidata entity IDs

2015-11-09 Thread Andrew Gray
On 9 November 2015 at 08:45, Daniel Kinzler  wrote:

>> Also, is this a temporary thing? Will Wikidata eventually have items for 
>> every
>> book published, every musical recording, etc. and become a superset of all 
>> those
>> unique identifiers?
>
> It's highly unlikely that wikidata will become a superset of any and all
> vocuabularies in existance.

Agree.

*However*, there are some things where we may be able to say with
confidence "Wikidata has a comprehensive set of X" (eg catalogues such
as P1186). It might be worth thinking about whether we should record
these identifier properties as "will always be incomplete", "probably
complete", "expected to eventually be complete", etc. If a user
queries for an ISBN we don't have, the chances are high that it's a
good ISBN we don't cover - but if they query for a country code we
don't have, the chances are high that it's an invalid code...

-- 
- Andrew Gray
  andrew.g...@dunelm.org.uk

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] how to map other identifiers to Wikidata entity IDs

2015-11-09 Thread Daniel Kinzler
Am 09.11.2015 um 03:26 schrieb S Page:
> I think these other identifiers are all "Wikidata property representing a 
> unique
> identifier" and there are about 350 of them [2] But surprisingly, I couldn't
> find an easy way to look up a Wikidata item using these other identifiers.

We discussed some loose plans for implementing this in Currus when Stas was in
Berlin a few weeks ago. On Special:Search, you would ask for
property:P212:978-2-07-027437-6, and that would find the item with that ISBN.

Stas: do we have a ticket for this somewhere? All I can find are the notes in
the etherpad.

> Also, is this a temporary thing? Will Wikidata eventually have items for every
> book published, every musical recording, etc. and become a superset of all 
> those
> unique identifiers?

It's highly unlikely that wikidata will become a superset of any and all
vocuabularies in existance. Better integration of external identifiers is high
on our priority list right now. The first step will however be to property
expose URIs for them, so we are no longer a dead end in the linked data web.

But since we need to work on Cirrus integration anyway, I expect that we will
have search-by-property soonish, too. I certrainly hope so.


-- 
Daniel Kinzler
Senior Software Developer

Wikimedia Deutschland
Gesellschaft zur Förderung Freien Wissens e.V.

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] how to map other identifiers to Wikidata entity IDs

2015-11-09 Thread Jan Ainali
You could pass several ISBNs to wdq with OR (but I don't know if it will
support 100 ISBNs in one go):

https://wdq.wmflabs.org/api?q=string%5B957:%222-7071-1620-3%22%5D%20OR%20STRING[957:%222-7071-1562-2%22]


*Med vänliga hälsningar,Jan Ainali*

Verksamhetschef, Wikimedia Sverige 
0729 - 67 29 48


*Tänk dig en värld där varje människa har fri tillgång till mänsklighetens
samlade kunskap. Det är det vi gör.*
Bli medlem. 


2015-11-09 3:26 GMT+01:00 S Page :

> In the article "Presenting Wikidata knowledge" [1], I've Been a bit Bold
> and specified a recipe:
>
> 1. Find existing interesting wiki pages in the domain of your application.
> 2. View the Wikidata information for those pages, choose interesting
> properties.
> 3. Associate Wikidata entity IDs with entities of your application.
> 4. Display their Wikidata information in the user's language.
> 5. Use the Wikidata "sitelinks" information about the item to provide
> links to the full Wikipedia (and Wikiquote, Wikivoyage, etc.) article about
> the entity in the user's language.
>
> But I realize for something like a reference app there won't be Wikidata
> items for every entity in your app for step 3: not every book in print has
> a Wikidata item, nor does every musical recording, etc. For those there are
> already identifiers such as ISBNs and "MusicBrainz release group ID"s (mmm,
> brains). I assume reference app developers already use these more complete
> identifiers and so I'm inviting them to add Wikidata entity IDs where
> available.
>
> I think these other identifiers are all "Wikidata property representing a
> unique identifier" and there are about 350 of them [2] But surprisingly, I
> couldn't find an easy way to look up a Wikidata item using these other
> identifiers.
>
> I found you can do it one-by-one in Wikidata Query [3] and in Wikidata
> Query Serivce [4] but neither seems amenable to doing a query on the fly
> "Get me the Wikidata item for each of these 100 ISBNs "2-7071-1620-3", ...
>
> Also, is this a temporary thing? Will Wikidata eventually have items for
> every book published, every musical recording, etc. and become a superset
> of all those unique identifiers?
>
> Thanks!
>
> [1] https://www.mediawiki.org/wiki/API:Presenting_Wikidata_knowledge
> [2]
> https://www.wikidata.org/wiki/Special:WhatLinksHere/Q19847637?limit=500
> [2] https://wdq.wmflabs.org/api?q=string%5B957:"2-7071-1620-3"%5D and
> [3] https://query.wikidata.org with the the SPARQL (mmm, sparkly)
> PREFIX wdt: 
>
> SELECT ?book WHERE {
>?book  wdt:P957 "2-7071-1620-3"
> }
>
> --
> =S Page  WMF Tech writer
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata