Hi Milorad,
I need pairs of data sets that have been already linked following the
Linked Data principles [1]. For example, a data set containing data
about all the books published in Germany in the last 10 years, and
DBpedia as the second data set. I am interested in data
interlinking-connecting instances.
I need this as a gold standard (or reference interlinking), in order to
evaluate my own interlinking process (i.e. to compute its precision and
recall). So I would need a reference interlinking which was created
either manually, or applying tools like LogMap, or Silk, and was
afterwards reviewed by humans to add the links that the tools did not
discovered correctly or did not discover at all. I hope this clarifies
my previous email.
Kind regards,
Cristina
[1] http://linkeddatabook.com/editions/1.0/#htoc56
Am 26.08.2013 08:47, schrieb Milorad Tosic:
Hi,
Linked Data sets are linked by definition since they use URIs for data as well as meta
data identification. What do you exactly mean by linking when you say "I would need
pairs of data sets which have been manually linked, or ...". Do you mean data and
record linkage as given for example in [1] or something else?
Regards,
Milorad Tosic
Faculty of Electronic Engineering
University of Nis, Serbia
________________________________
From: Cristina Sarasua <[email protected]>
To: [email protected]
Sent: Thursday, August 22, 2013 5:06 PM
Subject: Linked data sets for evaluating interlinking?
Hi,
I am looking for pairs of linked data sets that can be used as gold standard
for evaluations. I would need pairs of data sets which have been manually
linked, or data sets which have been (semi-)automatically linked with
interlinking tools, and afterwards reviewed (to include the links which are not
identified by tools). I have looked into the DataHub catalogue and queried VoiD
descriptions, but unfortunately the information about how the interlinking
process was carried out is often missing.
Apart from the data sets which have been used in the OAEI-instance
matching track, could anyone recommend (based on past experience)
good data sets for evaluating data interlinking processes?
Thanks in advance.
Kind regards,
Cristina
--
Cristina Sarasua Institute for Web Science and Technologies (WeST) Universität
Koblenz-Landau
Universitätsstraße 1
56070 Koblenz
Germany e: [email protected] p: +49 261 287 2772
f: +49 261 287 100 2772
w: http://west.uni-koblenz.de
--
Cristina Sarasua
Institute for Web Science and Technologies (WeST)
Universität Koblenz-Landau
Universitätsstraße 1
56070 Koblenz
Germany
e: [email protected]
p: +49 261 287 2772
f: +49 261 287 100 2772
w: http://west.uni-koblenz.de