Re: [Wikidata-l] Counting sitelinks - period

2013-09-24 Thread legoktm
On Tue, Sep 24, 2013 at 5:10 PM, Cristian Consonni wrote > > Quick question, the file says that in Wikidata there are: > 1276758 items link it.wiki, but it.wiki has "only" > 1066230 articles > So there are many (~210k) Wikidata items pointing to non-article > pages? Or maybe there is some double

Re: [Wikidata-l] Counting sitelinks - period

2013-09-24 Thread Cristian Consonni
2013/9/24 Luca Martinelli : > 2013/9/24 Magnus Manske : >> https://tools.wmflabs.org/magnustools/static_data/items_per_site.20130924.tab >> >> SQL query used: >> select ips_site_id,count(*) from wb_items_per_site group by ips_site_id >> >> For a list of all

Re: [Wikidata-l] Counting sitelinks of subclasses.

2013-09-24 Thread Andy Mabbett
What about alcohol-free beer? -- Andy Mabbett @pigsonthewing http://pigsonthewing.org.uk On Sep 24, 2013 3:49 PM, "Andrea Zanni" wrote: > Not sure if I udnerstood it well, but this could be a counterexample: > > Beer (https://www.wikidata.org/wiki/Q44) is a subclass of Alcoholic > beverage (htt

Re: [Wikidata-l] Counting sitelinks - period

2013-09-24 Thread Luca Martinelli
2013/9/24 Magnus Manske : > For now: > https://tools.wmflabs.org/magnustools/static_data/items_per_site.20130924.tab > > SQL query used: > select ips_site_id,count(*) from wb_items_per_site group by ips_site_id > > For a list of all items with these links, now that might be a

Re: [Wikidata-l] Counting sitelinks - period

2013-09-24 Thread Magnus Manske
For now: https://tools.wmflabs.org/magnustools/static_data/items_per_site.20130924.tab SQL query used: select ips_site_id,count(*) from wb_items_per_site group by ips_site_id For a list of all items with these links, now that might be a little long to put in an attachment... On Tue, Sep 24

Re: [Wikidata-l] Counting sitelinks - period

2013-09-24 Thread Federico Leva (Nemo)
Luca Martinelli, 24/09/2013 19:40: Hi all, sorry if I burst in with a simple question, but... is there a tool to count how many sitelinks for a single project are there in Wikidata? I mean, if I want to know how many (and which) items in Wikidata have a sitelink to, say, Yoruba Wikipedia, what

[Wikidata-l] Counting sitelinks - period

2013-09-24 Thread Luca Martinelli
Hi all, sorry if I burst in with a simple question, but... is there a tool to count how many sitelinks for a single project are there in Wikidata? I mean, if I want to know how many (and which) items in Wikidata have a sitelink to, say, Yoruba Wikipedia, what tool do I have to use, if exists? Th

Re: [Wikidata-l] 'Person' or 'human', upper ontologies and migrating 4 million claims

2013-09-24 Thread Gerard Meijssen
Hoi, When you read about upper ontologies, it says that the answer is highly political. We have already suffered the pain of using the DNB library system. For all our items with a GND identifier we can lookup what the "main type (GND)" is. We have identifiers to many external sources and for all

Re: [Wikidata-l] Counting sitelinks of subclasses.

2013-09-24 Thread Klein,Max
This is really useful, thanks Magnus, otherwise I thought I was going to have to put Wikidata in RAM myself. Maximilian Klein Wikipedian in Residence, OCLC +17074787023 From: wikidata-l-boun...@lists.wikimedia.org on behalf of Magnus Manske Sent: Tuesday, Sep

Re: [Wikidata-l] Counting sitelinks of subclasses.

2013-09-24 Thread Magnus Manske
This: http://208.80.153.172/api?q=claim[279]&props=279 will give you all the items that have the "subclass of" property, and the respective item they are a subclass of. Enough to make a subclass tree for all of Wikidata. You'll have to get the labels and page counts yourself ;-) On Tue, Sep 24

Re: [Wikidata-l] Counting sitelinks of subclasses.

2013-09-24 Thread Denny Vrandečić
I would be surprised if that theory held true. I expect that both very abstract (fruit) and extremely specific (golden delicious) items would have a lower sitelink count than the "golden layer of most useful terms" (apple) in the hierarchy (I am reminded of the theory of word length and term freque

Re: [Wikidata-l] 'Person' or 'human', upper ontologies and migrating 4 million claims

2013-09-24 Thread Benjamin Good
Tom, I totally agree with your sentiments here. Two questions. Do you believe there is any valuable use for upper ontologies in the wikidata system at all at this stage? Could you describe how you see a bubble-up classification scheme working in this context in a little detail? I can imagine s

Re: [Wikidata-l] Counting sitelinks of subclasses.

2013-09-24 Thread Andrea Zanni
Not sure if I udnerstood it well, but this could be a counterexample: Beer (https://www.wikidata.org/wiki/Q44) is a subclass of Alcoholic beverage (https://www.wikidata.org/wiki/Q154) Beer: 142 links Alcoholic beverage: 73 links Aubrey On Tue, Sep 24, 2013 at 1:12 AM, Klein,Max wrote: > Hel

Re: [Wikidata-l] 'Person' or 'human', upper ontologies and migrating 4 million claims

2013-09-24 Thread Tom Morris
On 22 September 2013 at 21:24:48, Antoine Isaac (ais...@few.vu.nl) wrote: First, getting a clean hierarchy won't make things easier, if you end up with a too static/formal view on the world. Second, the feeling about the W3C recommendations is wrong. W3C has actually pushed SKOS to allow 'softer