Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-22 Thread Denny Vrandecic
According to your definition, then LODStats is misnamed. It should be LOD Datasets Stats. Or am I misunderstanding something? On 22 Jun 2012, at 01:30, Sören Auer wrote: Am 21.06.2012 17:08, schrieb Hugh Glaser: Hi. On 21 Jun 2012, at 11:40, Sören Auer wrote: Am 21.06.2012 12:03, schrieb

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-22 Thread Sören Auer
Am 22.06.2012 11:30, schrieb Denny Vrandecic: According to your definition, then LODStats is misnamed. It should be LOD Datasets Stats. Or am I misunderstanding something? Maybe you are right Denny, but there is never a perfect name. Actually LODStats is both, a tool and a service. The

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Denny Vrandecic
This is really cool. On 2 Feb 2012, at 12:04, Sören Auer wrote: A demo installation collecting statistics from all LOD datasets registered on CKAN is available from: http://stats.lod2.eu Are you missing this one? http://thedatahub.org/dataset/linked-open-numbers Since you say all LOD

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Sören Auer
Am 21.06.2012 11:33, schrieb Denny Vrandecic: This is really cool. On 2 Feb 2012, at 12:04, Sören Auer wrote: A demo installation collecting statistics from all LOD datasets registered on CKAN is available from: http://stats.lod2.eu Are you missing this one?

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Hugh Glaser
Good work Sören and team. Interesting question from Denny. I guess you don't do http://thedatahub.org/dataset/sameas-org for the same reason. And http://thedatahub.org/dataset/dbpedia-lite (Or at least I couldn't find them.) I'm not sure you should claim all LOD datasets registered on CKAN if

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Sören Auer
I am starting to use LODStats and I think it is a very useful tool. Actually I would be interested on using it over SPARQL endpoints but I dont know how to do that. Does anybody knows whether it is possible? We don't have a SPARQL endpoint available (yet), but you can obtain a complete dump of

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Sören Auer
Am 21.06.2012 12:03, schrieb Hugh Glaser: Interesting question from Denny. I guess you don't do http://thedatahub.org/dataset/sameas-org for the same reason. And http://thedatahub.org/dataset/dbpedia-lite (Or at least I couldn't find them.) I'm not sure you should claim all LOD datasets

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Sarven Capadisli
On 2012-06-21 12:40, Sören Auer wrote: Am 21.06.2012 12:03, schrieb Hugh Glaser: Interesting question from Denny. I guess you don't do http://thedatahub.org/dataset/sameas-org for the same reason. And http://thedatahub.org/dataset/dbpedia-lite (Or at least I couldn't find them.) I'm not sure

Re: [pedantic-web] Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Kingsley Idehen
On 6/21/12 6:36 AM, Sören Auer wrote: I am starting to use LODStats and I think it is a very useful tool. Actually I would be interested on using it over SPARQL endpoints but I dont know how to do that. Does anybody knows whether it is possible? We don't have a SPARQL endpoint available (yet),

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Hugh Glaser
Hi. On 21 Jun 2012, at 11:40, Sören Auer wrote: Am 21.06.2012 12:03, schrieb Hugh Glaser: Interesting question from Denny. I guess you don't do http://thedatahub.org/dataset/sameas-org for the same reason. And http://thedatahub.org/dataset/dbpedia-lite (Or at least I couldn't find them.)

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Miguel Tinte
Hi Sören, Thanks for your answer. I think my question was not very clear because I am not looking for an SPARQL endpoint for lodstats: what I need is to run lodstats over datasets SPARQL endpoints. It seems that it is possible like this: (lodstats-env)root@ubuntu:/home/LODStats# lodstats -f sparql

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread miguel . tinte
El jueves, 2 de febrero de 2012 12:32:03 UTC+1, Richard Cyganiak escribió: Congrats, this is awesome. So you're automatically harvesting 200+ datasets by starting with the LOD Cloud metadata we're collecting on the Data Hub (ex CKAN), leading to a total of almost 2B triples. Also

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-06-21 Thread Sören Auer
Am 21.06.2012 17:08, schrieb Hugh Glaser: Hi. On 21 Jun 2012, at 11:40, Sören Auer wrote: Am 21.06.2012 12:03, schrieb Hugh Glaser: Interesting question from Denny. I guess you don't do http://thedatahub.org/dataset/sameas-org for the same reason. And

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-21 Thread Rinke Hoekstra
Hi Sören, others, LODStats is certainly great work. Congratulations! However... is it me, or isn't the 'almost 2B triples' a very disappointing number? If you go through all datasets advertised on the Data Hub, the advertised number of triples is over 40B ! This means that only one out of 20

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-21 Thread Sören Auer
Am 21.02.2012 15:38, schrieb Rinke Hoekstra: However... is it me, or isn't the 'almost 2B triples' a very disappointing number? If you go through all datasets advertised on the Data Hub, the advertised number of triples is over 40B ! This means that only one out of 20 triples in the linked

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-03 Thread Richard Cyganiak
On 2 Feb 2012, at 23:58, Bernard Vatant wrote: More than 60 [vocabularies] are either 404, time out or access denied, which does not come as a surprise, but is nevertheless a big issue. It means that data using those vocabularies are relying on semantics no one can check. The rest is

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-03 Thread Bernard Vatant
Hello Richard All in all, almost half of the vocabularies used in LOD are not meeting a minimal quality requirement : be published at their namespace. Now, if there was a list of these, annotated with some stats (used in how many datasets? occurring in how many triples?), then we could start

[Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Sören Auer
Dear all, We are happy to announce the first public *release of LODStats*. LODStats is a statement-stream-based approach for gathering comprehensive statistics about datasets adhering to the Resource Description Framework (RDF). LODStats was implemented in Python and integrated into the CKAN

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Michael Hausenblas
We are happy to announce the first public *release of LODStats*. Very nice! Does it output VoID [1]? Didn't find it skimming the source ... Cheers, Michael [1] http://www.w3.org/TR/void/ -- Dr. Michael Hausenblas, Research Fellow LiDRC - Linked Data Research Centre DERI -

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Richard Cyganiak
Congrats, this is awesome. So you're automatically harvesting 200+ datasets by starting with the LOD Cloud metadata we're collecting on the Data Hub (ex CKAN), leading to a total of almost 2B triples. Also fascinating is the list of 250 datasets that couldn't be automatically harvested due to

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Richard Cyganiak
On 2 Feb 2012, at 11:04, Sören Auer wrote: A demo installation collecting statistics from all LOD datasets registered on CKAN is available from: http://stats.lod2.eu One more thing. Can I search for the stats for a particular datasets somehow? Let's say I want to see the stats for the

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Sören Auer
Am 02.02.2012 12:18, schrieb Michael Hausenblas: We are happy to announce the first public *release of LODStats*. Very nice! Does it output VoID [1]? Didn't find it skimming the source ... It does, might not be directly linked yet, but we will add the links soon. However, not all LODStats

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Sören Auer
Am 02.02.2012 12:18, schrieb Michael Hausenblas: We are happy to announce the first public *release of LODStats*. Very nice! Does it output VoID [1]? Didn't find it skimming the source ... Have to correct myself, the VoID is already there, see for example:

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Sören Auer
Am 02.02.2012 12:32, schrieb Richard Cyganiak: Congrats, this is awesome. Thanks Richard, we are happy you like it ;-) So you're automatically harvesting 200+ datasets by starting with the LOD Cloud metadata we're collecting on the Data Hub (ex CKAN), leading to a total of almost 2B

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Bernard Vatant
Hello Sören Great work! Of course as you can imagine I jumped right away to http://stats.lod2.eu/vocabularies. Interesting to see the broad figures (205 vocabularies) vs 189 harvested as of today at http://labs.mondeca.com/dataset/lov So I would like to compare, see the overlap ... and complete

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Bernard Vatant
Hello all I've started comparing http://stats.lod2.eu/vocabularies with what we have in store in LOV. A few preliminary stats are available. Those who prefer raw data can go directly to the shared GDocs (waiting for better formats)

Re: [Ann] LODStats - Real-time Data Web Statistics

2012-02-02 Thread Sören Auer
Richard, These are all great suggestions, which we will try to implement in the next days. The LODSTats logo in the header was supposed to serve as a link to the About page (http://aksw.org/projects/LODStats ), but I guess we should place that more prominently. Thanks for your valuable feedback,