Re: httpRange-14 Change Proposal

Nathan Wed, 28 Mar 2012 14:25:57 -0700

Jeni,

First, thanks for confirming - many responses in line from here:


Jeni Tennison wrote:
> The server *can* return the same content from the /uri URI and from
> the /uri-documentation URI, but it does not have to, and it wouldn't
> be sensible to do so for an image. Your first question asked if the
> server could return the same content, your second asked if it must.

Apologies for any confusion from my wording, however I did mean "can"rather than "must".

In a nutshell then, this proposal says that you can return a 200 OK fora GET request on any URI, but if you return "a representation of adescription of the thing referred to by <uri>" rather than "arepresentation of the thing referred to by <uri>" then you should say itis so by including the special "<uri> :describedby <uri-documentation>"triple.

Additionally, rather than special casing this so that this rule let's apublisher override the default 200 OK return a representation of aresource, the proposal also aims to change web arch and the HTTPspecification such that a 200 OK in response to a GET no longer returnsa representation of the requested URI, rather it just returns arepresentation which you must consult to find out what it is.


That's quite a large change to the web / web arch / http.

On 28 Mar 2012, at 16:07, Nathan wrote:

Jeni Tennison wrote:

Yes, that's correct. With no constraining Accept headers, it could alternatively return HTML 
with embedded RDFa with a <link rel="describedby"> element, for example.

Is that universally true?

Suppose /uri identified a PDF formatted ebook, or a digital image of a monkey 
in JPEG format, or even an RDF document.


Then it would return those things. I think that you may have leapt to the 
conclusion that /uri *always* returns the same as /uri-documentation. There's 
nothing to my knowledge that says that, indeed given that you can have several 
:describedby links it would be impossible.

Sorry no, not *always* just *always could* or *always can*. As in, itwould be universally true that for any successful GET request you wouldreceive a representation, and that representation may be arepresentation of the <target-uri>, or it may be a representation of<some-other-uri> which describes the target-uri.

Question A:

Currently we have:
<http://example.org/uri>; - a JPEG image of a monkey.

When you issue a GET on that URI the server currently responds
200 OK
Content-Type: image/jpeg
Link: <http://example.org/uri-documentation>;; rel="describedby"

So under this new proposal, the server can return the contents of 
/uri-documentation with a status of 200 OK for a GET on /uri?
Under the proposal, the server would return the JPEG with a 200 OK for a GET on /uri. http://example.org/uri-documentation would return a description of the JPEG in some machine-readable format.

Or more accurately, the server MAY return the JPEG with a 200 OK for aGET on /uri, or it may return the same result as a successful GET on/uri-documentation (a description of the /uri in some machine readableformat).


Is this limited to machine readable format, why not human readable too?

It appears that if one can return text/turtle for a GET request on</foo>, where { </foo> a :Horse } then one should also be able to returnan image/jpeg which visually describes the horse.

If yes, this seems like massively unexpected functionality, like a proposal to treat 
"Accept: some/meta-data" like a DESCRIBE verb, and seems to exaggerate the URI 
substitution problem (as in /uri would be taking as naming the representation of 
/uri-documentation).

If no, where's the language which precludes this? (and how would that language 
go, given that it's exactly the same protocol flow and nothing has changed - 
other than the reader presuming that /uri now identifies something that does 
have a representation that can be transferred over HTTP vs identifying 
something that doesn't have a representation that can be transferred over HTTP).


I don't really understand what you think it needs to say I'm afraid.

Question B:

How would conneg work, and what would the presence of a Content-Location 
response header mean? Would HTTPBis need to be updated?


I can't see any way in which any of that would work differently from currently.

Okay, given the use-case of a GET on </uri> returning 200 OK, and theresponse containing a representation of </uri-documentation> in text/turtle:


What would the value of the Content-Location header be? /uri-documentation?

short version: this proposal would mean many sections of httpbis wouldneed to be reworded and changed, as it conflicts to the point of sayingthe opposite.

Question C:

Currently 303 "indicates that the requested resource does not have a representation of its own 
that can be transferred by the server over HTTP", and the Link header makes it clear that you 
are dealing with two different things (/uri and /uri-documentation), but where does this proposal 
make it clear at transfer protocol level that the representation included in the http response is a 
representation of another resource which describes the requested resource (rather than it being as 
the spec defines "a representation of the target resource")?


The proposal says that applications can draw no conclusions from information at 
the transfer protocol level about /uri. In particular, it can't tell whether 
the representation that is returned with /uri is *the content* of /uri or *the 
description* of /uri. Further information about /uri (eg that it is a 
foaf:Person) may help the application work out that the representation was *a 
description*.

Wow, so every URI no longer refers to anything unless it's explicitlystated in some RDF somewhere, and if one looks up in a browser andsees a picture of a monkey, they are incorrect for saying it refers to apicture of a monkey if some RDF document somewhere describes as a:SpaceShuttle.

Can the TAG really just say "okay, all http:// URIs no longer refer toanything"?

However, an application can draw conclusions about /uri-documentation, assuming 
it gives a 2XX response, because it has been retrieved as the result of 
following a :describedby link (or if it were the target of a 303 redirection). 
The application can tell that the representation from /uri-documentation is 
*the content* of /uri-documentation and *the description* of /uri.

I can't see how it could tell that "the representation from/uri-documentation is *the content* of /uri-documentation and *thedescription* of /uri". Perhaps that it's *a* description of /uri, butcertainly not that it's "the content of /uri-documentation", theproposal itself removes all notion of a representation being arepresentation of the current state of the requested uri.

if <a> is described by , and is described by <c>, then a GET on<a> can now return , whilst a get on can return <c>, and soforth, and if that :describedby triple is missing, or you don't get backRDF in some form, then you don't know what you retrieved or if therequested uri refers to it at all.

Either way, there is no implication that what you've got from 
http://example.org/uri is the content of http://example.org/uri (or that 
http://example.org/uri identifies an information resource), but there is an 
implication that what you get from http://example.org/uri-documentation is the 
content of http://example.org/uri-documentation (and that 
http://example.org/uri-documentation is an information resource).

Sorry I don't follow, how is there an implication from a 200 OK for <uri-a> that it's 
not an IR and for <uri-b> that it is an IR?


Because /uri-documentation was reached through a :describedby link. This extra 
information allows the application to draw the conclusion that the 
representation from /uri-documentation is *the content* of /uri-documentation.

and when you don't reach it via a ":describedby" link (as in 99.99% ofcases on the web)? also see above, same points.

If there was a Set of all Things (Set-A), then that set would have two sets, "the 
set of all things which can be transferred via a transfer protocol like HTTP" 
(Set-B), and then everything else (Set-C) which comprises Set-A minus Set-B. As far as I 
can tell, the one thing that determines whether something is a member of the Set-B or 
Set-C, for HTTP, is that 200 OK in response to a GET, hence why we need the 303.

This proposal appears to try and override that "rule" (fact) by saying let the 
content of a representation define what is a member of Set-B or Set-C, however the act of 
dereferencing itself is what determines whether an identified thing is a member of Set-B, 
as Set-B is the set of all things that can be dereferenced. Hence my confusion at this 
proposal.


The "fact" that a 200 OK determines whether something is a member of Set-A or 
Set-B is a design choice made by httpRange-14, not a fundamental truth of the universe. 
The proposal makes a different design choice, in saying that you need more than just a 
200 OK response to say, beyond all doubt, that a URI refers to something that is member 
of Set-B.

Apologies but I have to disagree completely here, I can say I'm agoldfish but I have the properties of a human and belong in the Set ofHumans, no matter how much I say, I'm never going to be a goldfish -there's no design choice there, similarly if something a representationof something was retrieved via HTTP, then it belongs to the set ofthings which can have their representations retrieved via HTTP, thatjust is a fact, not a design decision.

Sorry this appears so negative, but... well the above hopefullyexplains, personally I see it as ripping the foundational constraints ofthe web/uris/http away to try and save an extra GET request in a few cases.


Nathan

Re: httpRange-14 Change Proposal

Reply via email to