Richard, all

I've done my homework and added a voiD description of lingvoj.org dataset at http://www.lingvoj.org/void
It's still minimal, but at least got stats. Links stuff to be added ASAP.
For those who might care, note that it links to a new FOAF profile at http://www.lingvoj.org/foaf.rdf

Bernard

Richard Cyganiak a écrit :
The problem at hand is: How to get reasonably accurate and up-to-date statistics about the LOD cloud?

I see three workable methods for this.

1. Compile the statistics from voiD descriptions published by individual dataset maintainers. This is what Hugh proposes below. Enabling this is one of the main reason why we created voiD. There has to be better tools for creating voiD before this happens. The tools could be, for example, manual entry forms that spit out voiD (voiD-o-matic?), or analyzers that read a dump and spit out a skeleton voiD file.

2. Hand-compile the statistics by watching public-lod, trawling project home pages, emailing dataset maintainers, and fixing things when dataset maintainers complain. This is how I created the original LOD cloud diagram in Berlin, and after I left Berlin, Anja has done a great job keeping it up to date despite its massive growth. We will continue to update it on a best-effort basis for the foreseeable future. A voiD version of the information underlying the diagram is in the pipeline. Others can do as we did.

3. Anyone who has a copy of a big part of the cloud (e.g. OpenLink and we at Sindice) can potentially calculate the statistics. This is non-trivial because we just have triples, and we need to reverse-engineer datasets and linksets from them, it involves computation over quite serious amounts of data, and in the end you still won't have good labels or homepages for the datasets. While this approach is possible, it seems to me that there are better uses of engineering and research resources.

There is a fourth process that, IMO, does NOT work:

4. Send an email to public-lod asking "Everyone please enter your dataset in this wikipage/GoogleSpreadsheet/fancyAppOfTheWeek."

Best,
Richard



--

*Bernard Vatant
*Senior Consultant
Vocabulary & Data Engineering
Tel:       +33 (0) 971 488 459
Mail:     [email protected] <mailto:[email protected]>
----------------------------------------------------
*Mondeca**
*3, cité Nollez 75018 Paris France
Web:    www.mondeca.com <http://www.mondeca.com>
Blog:    Leçons de Choses <http://mondeca.wordpress.com/>
----------------------------------------------------**


Reply via email to