On 15/09/2008, at 11:13 PM, Julian Reschke wrote:

Mark Nottingham wrote:
On 12/09/2008, at 5:25 PM, Julian Reschke wrote:
As far as I can tell, if you use the ETag based approach, and multiple clients try to post to the same collection (POST URI), then you'll have to disambiguate the requests. That problem would go away if each of them would use a different URL.
I read the doc as saying that the server would provide unique ETags somehow...

Disambiguating by ETag probably would work, but that doesn't feel right to me. If multiple resumable transfers can be in progress at the same point of time, then this really sounds like multiple resources (thus multiple URIs), not multiple variants of the same resource to me.
Huh. That's very revealing, I think (if unintentional :) POST can already create a new resource with a new, server-selected URI, and the pattern for doing so is already described with POST, 201 and Location.

Of course POST can do that. Did anybody argue something else?

No, just laying the groundwork...


Question: If I want to make this sort of request resumeable, do I do this?
REQ: POST /a
REQ: Content-Range: bytes */100
RES: 308 Resume Incomplete
RES: Location: /b
REQ: POST /b
REQ: Content-Range: 0-100/100
REQ: [bytes]
RES: 200 OK
or this?
REQ: POST /a
REQ: Content-Range: bytes */100
RES: 308 Resume Incomplete
RES: Location: /z
REQ: POST /z
REQ: Content-Range: 0-100/100
REQ: [bytes]
RES: 201 Created
RES: Location: /b
?
The important part here is: is this protocol defining a "temporary" resource (with a very specific interface) for the Location in a 308 refers to, or is the Location in a 308 referring to a "regular" resource that's used for more than that?

If I understand correctly, in the first example the server immediately assigns the URI for the resource-to-be-created, let's the client know it, and lets it transfer the remaining bytes to that resource. In this case, the server would need to make sure that this resource is only available to the client until the transfer is completed.

... or use ETags on the requests to differentiate them.

In the second case, the server assigns a temporary URI which is just used to complete the transfer. Once that's done, the "final" resource is being created. This looks similar to what Roy proposed in <http://lists.w3.org/Archives/Public/ietf-http-wg/2008AprJun/0082.html >.

I think the second approach is more versatile, because it also covers cases where the server wouldn't create a new URI upon POST.

... as would using ETags to differentiate requests. :) Understand I'm not pushing that solution strongly (yet), It's just that they're functionally equivalent so far.


It's interesting to note that the second approach (with the temp resource) preserves the 201 status code in the interchange, while in the former approach, it's not there (308 usurps it).
Now look at it with PUT (to a not-yet-existent resource);
REQ: PUT /a
REQ: Content-Range: bytes */100
RES: 308 Resume Incomplete
RES: Location: /b
REQ: PUT /b
REQ: Content-Range: 0-100/100
REQ: [bytes]
RES: 201 Created
RES: Location: /a
Here, if we use URIs, /b *has* to be a "temporary" resource with a very specifically defined behaviour; it accepts PUTs and has a side effect of having its bytes copied to /a (presumably when the final 201 is sent).

Yes.

BTW: it wouldn't necessarily be PUT; it could be anything that allows "appending", such as POST or PATCH.

Good point.


My point here is that there are actually some pretty deep differences between the URI approach and the ETag approach; the URI approach is much more intrusive and needs to be specified in a different way (e.g., talking about what methods to use, the nature of the resource created, etc.).

Well, not entirely.

Let's say you've got three concurrent resumable transfers started to /a, and the server has assigned the etags "C1-1", "C2-1" and "C3-1" to the three clients.

Client 1 starts its upload:

REQ: POST /a
REQ: If-Match: "C1-1"
REQ: Content-Range: 0-50/100
REQ: [bytes]
RES: 200 OK
RES: ETag: "C1-2"

...what I'm concerned if is that we're essentially introduce ETag- based variant-selection here.

So what do these ETags represent in requests *other* than resumable uploads, such as in:

REQ: GET/a
REQ: If-Match: "C1-1"

?

Will it have an effect?

So just avoiding new URIs may look simpler first, but it also requires additional specification work.

It will require additional spec work if this use of ETags is incompatible with existing ones. If it is, they shouldn't be used for this purpose at all, but I don't *think* they are (happy to be proven wrong, as always).

I.e., yes, if-match will work as specified; what else needs to be said?


Back to your comment;
Disambiguating by ETag probably would work, but that doesn't feel right to me. If multiple resumable transfers can be in progress at the same point of time, then this really sounds like multiple resources (thus multiple URIs), not multiple variants of the same resource to me.
I don't know that I agree; with PUT, it's very natural to use ETags (you avoid creating the temporary resource, and have the option of 409'ing any concurrent PUTs after the first), whereas with POST, you're just

That assumes that PUT with Content-Range can be used today, which really isn't the case, unless the client can be confident that the server actually understands PUT with ranges.

That seems to be a problem with all the approaches on the table, according to the flows in the current document. By the letter of the law, if the server doesn't understand a Content-* header on a PUT request, it should refuse it, but we already have an open issue or two (#79, #102) on that...


pushing the assignment of a final identifier for the created resource until the entire request entity is received (which is the case with the URI-based approach anyway, unless you're arguing that POST is a special case and *doesn't* create a temporary resource, unlike PUT), and you still have the option of not assigning it any identity (just as many POST processors do today). So, I'm firmly leaning in the direction of the ETags-only approach now; I think the selection of a URI for created resources is separable, and should be separate.

I agree with that part; the URI assigned for the upload really should be temporary.

With that, the approaches are almost identical; in both cases unique identifiers are minted (ETags or URIs), the server needs to deal with house keeping, and the impact of other methods must be understood and specified.

BR, Julian




BTW, we should really be talking about an Internet-Draft at this point, rather than a wiki page. Google guys, when will that happen?


--
Mark Nottingham       [EMAIL PROTECTED]


Reply via email to