Re: A(nother) Guide to Publishing Linked Data Without Redirects

Kingsley Idehen Thu, 11 Nov 2010 05:31:38 -0800

On 11/11/10 8:07 AM, David Wood wrote:

On Nov 11, 2010, at 07:44, Kingsley Idehen wrote:
On 11/11/10 4:54 AM, Richard Light wrote:
In message<[email protected]>,Harry Halpin <[email protected]> writes
The question is how to build Linked Data on top of *only* HTTP 200 -
the case where the data publisher either cannot alter their server
set-up (.htaccess) files or does not care to.
Might it help to look at this problem from the other end of thetelescope? So far, the discussion has all been about what isreturned. How about considering what is requested?
Good idea.
I assume that we're talking about the situation where a user (humanor machine) is faced with a URI to resolve. The implication is thatthey have acquired this URI through some Linked Data activity suchas a SPARQL query, or reading a chunk of RDF from their own triplestore. (If we're not - if we're talking about auto-magicallyinferring Linked Data-ness from random URLs, then I would agree thatsticking RDFa into said random pages is a way to go, and leave thediscussion.)
The Linked Data guidelines make the assumption that said user iswilling and able to indicate what sort of content they want, in thiscase via the Accept header mechanism. This makes it reasonable tofurther specify that the fallback response, in the absence of asuitable Accept header, is to deliver a human-readable resource,i.e. an HTML web page. Thus the web of Linked Data behaves like partof the web of documents, if users take no special action whendereferencing URLs.
If we agree that it is reasonable for user agents to take someaction to indicate what type of response they want, then one verysimple solution for the content-negotiation-challenged datapublisher would be to establish a convention that adding '.rdf' to aURL should deliver an RDF description of the NIR signified by that URL.
Richard
Richard,
Yes, we should look at this differently. We should honor the factthat the burgeoning Web of Linked Data is an evolution of the Web ofLinked Document. To do this effectively, I believe we need to fix theDocument Web and Data Web false dichotomy.
There is no Linked Data to exploit without Documents at HTTPAddresses from which content is streamed.
Kingsley, your analysis is solid except for one part: You seem toforget that the issue that brought us to this point was that theaddress of an information resource describing something is not thesame as the address of the thing itself. It is that problem that isstill worth solving.


David,

I do believe Ian's solution solves the matter of Name / Addressdisambiguation. Using a Document URL (Address) as a Name requires theaforementioned disambiguation.

Question is: who has to do the disambiguation? The user agent or thedata server? I believe a user agent should perform Name / Addressdisambiguation via it semantic-fidelity choice. If high, then Ian'ssolution works i.e., the data is self-describing and the user agentshould interpret accordingly. The semantic fidelity of HTTP stops at theDocument, the problem at hand takes us into the realm of contentinterpretation. In a sense, like "beauty" this too lies in the eye ofthe beholder (the user agent).

I don't think a new code is necessary since HTTP is doing its job as adocument location and content access protocol.

Thus, if we reference document URLs from browsers and follow links,everything will be fine. If we even go as far as taking a descriptordocument's Subject URI (slash terminated) and then place that in abrowser, we will be sorta fine too, depending on which user agent we use.

If today's small pool of Linked Data aware user agents adopt the Ian'soption, then I'll drop "sorta" from the paragraph above :-)


Hope this helps.

Kingsley

Regards,
Dave
If we put the Web aside for a second, I am hoping we can accept thatin the real world we have Documents with different surface structuree.g. Blank Paper and Graph Paper.
We can scribble and doodle on blank paper. We can even describethings in sentences and paragraphs on blank paper, but when it comesto Observations ("Data") Graph Paper is better i.e., it delivershigh-fidelity expression of Observation by letting us place SubjectIdentifier, Subject Attributes, and Attribute values into cells.
In the real-world, we've been able to make References across bothtypes of paper (Documents):
1. Reference one Document from another
2. Reference a cell in one Document from a cell in another.
Enter the luxury of computers and hypermedia. These innovations allowus to replicate what I've outline above using hyperlinks. Some examples:
1. Word processors -- you could reference across Microsoft Worddocuments on a computer, but never across Word and WordPerfect
2. Spreadsheets -- you could use Reference values (Names orAddresses) to connect cell content within a single spreadsheet oracross several spreadsheets and workbooks, but you couldn't referencedata across Excel and Lotus 1-2-3
3. Database Tables -- could use Unique Keys to Identify records withForeign Keys are the Reference mechanism, but in the case ofrelational databases (majority) the tables didn't accept Referencevalues i.e., content was typed literals oriented; you could referencea table in Oracle from a Table in Microsoft SQL Server etc.
As you can see from the above:
#1 is still about scribbling on blank paper. References are scoped toentire documents or fragments.#2-3 is about graph paper oriented observation (data) capture andreference that leverages the fidelity of cells.
Enter the luxury of computers, hypermedia, and a network protocols(HTTP):
#1 looses its operating system and application specific scope. Wehave blank paper, so when we scribble we do so in HTML whichleverages HTTP for referencing other documents.
#2-3 loose their operating system and application specific scope. Wehave graph paper, so when we capture observation, leveraging thefidelity of cell level references, we do so via an EAV/SPO graph.
As you can see, the Document hasn't gone anywhere, its structure hasevolved with reference scope becoming more granular.
Thus, when you HTTP GET and a sever responds with 200 OK, it's safeand sound to assume that a Document has been located. It is also safeand sound for a user agent to express what type of Content it wouldexpect from a Document, and then interpret the Content retrieved atvarying levels of semantic fidelity.
Back to the point of looking at this differently re. userinteraction. I've held the position for a while that the Linked Datanarrative is back to front. I say this for the following reasons:
1. Document vs Data false dichotomy
2. Assumption that anytime soon people will think URIs when they arealready used to URLs.
Orderly Linked Data narrative in steps for Humans:
1. Users continue to enter Document URLs into Browsers e.g.<http://dbpedia.org/page/Paris> instead of<http://dbpedia.org/resource/Paris>2. Users will see a human comprehensible document with a clearlyidentified subject and all its associated attributes and attribute values3. They will follow their noses to wherever via the links in thedocument take them enjoying the power of serendipitous discovery ofrelevant things4. They will bookmark without confusion i.e. not magical changes inthe Browser address bar5. They will be also discover human limitations as time, data volume,data disparity intersect6. They will be happy and ultimately wiser (i.e., delegate stuff tosmart agents that can exploit these links without human limitations).
To conclude, Ian is suggesting a solution for high-semantic-fidelityuser-agents that doesn't break anything, and actually accentuates theDocument vs Data false dichotomy. HTTP is a document location andcontent retrieval protocol :-)
--

Regards,

Kingsley Idehen 
President&  CEO
OpenLink Software
Web:http://www.openlinksw.com
Weblog:http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca: kidehen



--

Regards,

Kingsley Idehen 
President&  CEO
OpenLink Software
Web: http://www.openlinksw.com
Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca: kidehen

Re: A(nother) Guide to Publishing Linked Data Without Redirects

Reply via email to