Re: Size matters -- How big is the danged thing

Richard Light Thu, 20 Nov 2008 04:29:18 -0800

In message <[EMAIL PROTECTED]>, Matthias Samwald<[EMAIL PROTECTED]> writes

Rather than trying to do a rapid expansion over the whole web throughvery light-weight, loose RDFization of all kinds of data, it might bemore rewarding to focus on creating rich, relatively consistent andinteroperable RDF/OWL representations of the information resources thatmatter the most. Of course, this is not an either-or decision, as bothprocesses (the improvement in quality and the increase in quantity)will happen in parallel. But I think that quality should have higherpriority than quantity, even if it might be harder to, uhm, quantify quality.

This is the sort of issue I am trying to get my head around in relationto my particular area of interest: the museums community. I'm trying toform a view on what museum collections information systems couldcontribute to the Linked Data effort, and my current thinking is"objects in a historical context".

I've had a go at putting up one museum's 60,000 objects asnot-very-linked-data, see e.g.:


http://collections.wordsworth.org.uk/object/rdf/GRMDC.C104.15

which gives an idea of the sort of information that might be present(for Fine Art materials, anyway).

One no-brainer is that this sort of exercise allows museums to assignpersistent URIs to their own objects, as I have done here.

Another obvious conclusion is that the museum community ought to get itsact together and agree on a vocabulary/ontology for the predicates inthese object descriptions. I'm currently using DBpedia properties, butthere are frameworks like the CIDOC Conceptual Reference Model whichmight serve better.


After that it all gets a bit hazy.

I've made a hook-up to Geonames if there is an "exact match" on theplace name in the data (which is done dynamically in the XSLT transformwhich generates the RDF). I could, in principle, go to resources likethe Getty AAT for techniques, etc., as and when it has an API whichallows me to query it and get XML back.

However, my biggest query is about people - in a museum/historicalcontext, you're talking about all the people who ever lived, whetherfamous or not. I could invent URIs for each person mentioned in theWordsworth Trust data, and publish those, but then they would be lockedinto a single silo with no prospect of interoperability with any othermuseum's personal data. Mapping names across thousands of museum triplestores is not a scalable option.

So ... is there a case for "deadpeople.org", a site which does forhistorical people what Geonames does for place names? ("dead" = "nodata protection issues": I'm not just being macabre.) The site shouldexpect a constant flood of new people (and should issue a unique URI foreach as it creates the central record), but should also allow queriesagainst existing entries, so that the matching process can happen on acase-by-case basis in a central place, rather than being done after theevent.


Richard Light

--
Richard Light

Re: Size matters -- How big is the danged thing

Reply via email to