On 9/26/14 9:52 AM, Neubert Joachim wrote:
The uriburner seems to bring up data mostly from a lookup of the original uri. It includes (via scioc:links_to) the link from the English wikipedia page, yet misses that from the German one. So it also seems to cover only parts of the data hidden somewhere on the web.

It depends on what you are seeking, there's a little more to the URIBurner instance (and other Virtuoso instances for that matter). For instance, subject to ACLs, a Virtuoso SPARQL endpoint will allow you crawl the LOD cloud for additional relations in which your URI is either the subject or object [1][2]. To do that you simply need to invoke a SPARQL query where the Virtuoso Web crawl pragmas are enabled.

Trouble is that there need to be Linked Data sources in the mix for the crawler to de-reference, which is problematic here:

curl -IH "Accept: text/turtle" http://d-nb.info/gnd/120273152
HTTP/1.1 303 See Other
Date: Sat, 27 Sep 2014 19:52:17 GMT
Server: Apache
Location: http://d-nb.info/gnd/120273152/about/html

Bottom line, you can incorporate crawling into SPARQL when using Virtuoso endpoints, but that doesn't negate the need for URIs that adhere to Linked Data principles in regards to the pathways available for crawling.

For thing I need to investigate further is why the owl:sameAs relation object, from the German DBpedia dataset, isn't being de-refrenced as part of this SPARQL query solution processing pipeline.

Links:

[1] http://bit.ly/ZhXoBS -- SPARQL crawl example (scoped to relation predicate and objects) [2] http://bit.ly/1pysvhu -- ditto scoped to relation subject, predicate, and object

--
Regards,

Kingsley Idehen 
Founder & CEO
OpenLink Software
Company Web: http://www.openlinksw.com
Personal Weblog 1: http://kidehen.blogspot.com
Personal Weblog 2: http://www.openlinksw.com/blog/~kidehen
Twitter Profile: https://twitter.com/kidehen
Google+ Profile: https://plus.google.com/+KingsleyIdehen/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen
Personal WebID: http://kingsley.idehen.net/dataspace/person/kidehen#this


Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

Reply via email to