On Jul 29, 2009, at 9:41 AM, Kingsley Idehen wrote:
Hugh Glaser wrote:
On 29/07/2009 12:35, "Kingsley Idehen" <[email protected]>
wrote:
Hugh Glaser wrote:
On 28/07/2009 14:46, "Kingsley Idehen" <[email protected]>
wrote:
Hugh Glaser wrote:
Good stuff.
However, I don't think that Named Graphs are the answer.
I get my Linked Data by resolving URIs over http.
If I ask your Linked Data Space (I hope that is the right use
of your
terminology) for something like
curl -H "Accept: application/rdf+xml" http://dbpedia.org/resource/London
and follow the redirect don't I still get the non-wikipedia
data with the
wikipedia data?
Or am I not understanding something?
The link chain shouldn't be broken. Named Graphs should have
zero impact
on HTTP URIs.
That is what I thought.
So how is the linkage data kept separate when I do URI resolution?
Cheers
Hugh,
The Linkage isn't what needs separating.
Its when you make a data set that is 100% entity to entity links
triples (i.e., a linkset or linkbase) that needs separating (as
good
practice) from the main KB. Remember, there are times when the
main KB
and the source of cross links to external entities are produced by
separate parties. Thus, the linksets end up in their own Named
Graphs.
Purely for organization and maintenance. This kind of partitioning
allows the use SPARUL scoped to Named Graphs when fixing triple
statement errors (e.g. owl:sameAs triples), for instance.
That's great - I think you agree with me. :-)
Yes.
As I think when you say Named Graph you mean a different URI scheme
for
linkage information.
No, I mean a Named Collection of Triples :-)
An RDF graph *is* a set of triples, and a set *is* a collection, so...
Pat Hayes
Kingsley
Cheers
Hugh
Kingsley
I think Alan is saying: put what is best described as a linkbase
dump in
a separate Named Graph. Doing this shouldn't break the tapestry
inherent
in the HTTP URIs (the data conductors). We have tons of data in
<http://lod.openlinksw.com> partitioned across named graphs, and
none of
that breaks the "follow-your-nose" pattern. Remember, I am a
stickler
for keeping the HTTP URIs of entities in full scope of user
agents :-)
The only time you might have an issue is when performing SPARQL,
where
explicitly identifying the Named Graph in the FROM Clause may aid
performance (and even here this depends on the indexing in
placece re,
the RDF DBMS insta, these days re. Virtuoso that doesn't even
matter
since the default indexing scheme has been changed).
Kingsley
Best
Hugh
On 28/07/2009 11:17, "Kingsley Idehen" <[email protected]>
wrote:
Hugh Glaser wrote:
For the record ( © Alan!).
I consider it bad practice to keep the knowledge about linking
in the same
KB as the substantive knowledge you are representing.
You need two KBs: one for the knowledge you are publishing,
and one for
the
linkage you are working on.
These have very different provenance, maintenance patterns,
etc..
And you can include a link from URIs that you generate to the
linkage KB.
For terminology consolidation purposes, what you call a KB is a
"Linked Data Space" in my parlance :-)
Yes, the partitioning suggested above is very important. Thus,
you need
purpose specific Linked Data Spaces (hosing many Named Graphs)
if you
seek to make things a little clearer to data consumers and
their agents.
In fact, this would then help Alan's problem about sameAs:- he
could
simply
decide not to get your view of the linkage, whereas with
sameAs in the
resources he has no choice but to accept your view, and even
your
predicate
when he resolves a URI or queries the SPARQL.
And I do agree with you about minting URIs to your local
stuff, including
authors; it is error-prone to try to re-use things like
dbpedia for this,
on
any scale. And this is why you need to tackle the linkage
problem as a
separate engineering activity.
Best
Hugh
(Of course I do have some software and architecture that
supports separate
linkage KBs (our CRS) so I would say this, but nevertheless I
think it is
the correct engineering approach, however it is done.
Separation of
Concerns.)
Note, we've partitioned DBpedia in such a way that you now have
a Graph
IRI for each data set within this particular Linked Data Space.
Kingsley
On 28/07/2009 02:23, "Eric Lease Morgan" <[email protected]
>
wrote:
On Jul 25, 2009, at 5:09 AM, Bill Roberts wrote:
Regarding linking to external resources, what it seems you want
to do is to identify the dc:creator of the book, hence say that
the creator is the person whose name was Thomas More. You could
create your own URI and if you are managing a whole bunch of
data
about books and authors, then there could be reasons to do
that,
but in general if there is a satisfactory existing URI, it is
preferable to use it. Dbpedia seems to have become the de facto
standard...
Okay, then how's this for a recipe to create rich linked data of
electronic books and authors within my own site as well as to
the
outside world:
1. Mint URIs pointing to representations of local etexts
2. Mint URIs pointing to representations of authors of local
etexts
3. In resources of etexts, include owl:sameAs links to DBpedia
resources
4. In resources of etexts, point to local URIs of authors
5. In resources of authors, include owl:sameAs links to
DBpedia
resources
6. In resources of authors, include owl:creatorOf links to
local
etexts
7. For extra credit, do the same thing for subjects/keywords
For example, the following resource descriptions:
<!-- etext #1; points to local author and remote title -->
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:owl="http://www.w3.org/2002/07/owl#">
<rdf:Description
rdf:about="http://infomotions.com/etexts/id/more-utopia-221"
owl:sameAs="http://dbpedia.org/resource/Utopia_(book)">
<dcterms:title>Utopia</dcterms:title>
<dcterms:creator
rdf:resource="http://infomotions.com/etexts/authors/resource/thomas-more
" />
</rdf:Description>
</rdf:RDF>
<!-- etext #2; points to local author and remote title -->
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:owl="http://www.w3.org/2002/07/owl#">
<rdf:Description
rdf:about="http://infomotions.com/etexts/id/more-
reality-404"
owl:sameAs="http://dbpedia.org/resource/Reality_(book)">
<dcterms:title>Reality</dcterms:title>
<dcterms:creator
rdf:resource="http://infomotions.com/etexts/authors/resource/thomas-more
" />
</rdf:Description>
</rdf:RDF>
<!-- author; points to local etexts and remote author -->
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:owl="http://www.w3.org/2002/07/owl#">
<rdf:Description
rdf:about="http://infomotions.com/etexts/authors/resource/thomas-more
"
owl:sameAs="http://dbpedia.org/resource/Thomas_More">
<owl:creatorOf
rdf:resource="http://infomotions.com/etexts/id/more-utopia-221
"/>
<owl:creatorOf
rdf:resource="http://infomotions.com/etexts/id/more-reality-404
" />
</rdf:Description>
</rdf:RDF>
--
Eric Lease Morgan
--
Regards,
Kingsley Idehen Weblog: http://www.openlinksw.com/blog/
~kidehen
President & CEO
OpenLink Software Web: http://www.openlinksw.com
--
Regards,
Kingsley Idehen Weblog: http://www.openlinksw.com/blog/
~kidehen
President & CEO
OpenLink Software Web: http://www.openlinksw.com
--
Regards,
Kingsley Idehen Weblog: http://www.openlinksw.com/blog/
~kidehen
President & CEO
OpenLink Software Web: http://www.openlinksw.com
--
Regards,
Kingsley Idehen Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO OpenLink Software Web: http://www.openlinksw.com
------------------------------------------------------------
IHMC (850)434 8903 or (650)494 3973
40 South Alcaniz St. (850)202 4416 office
Pensacola (850)202 4440 fax
FL 32502 (850)291 0667 mobile
phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes