Re: HTTP transport?

Doug Cutting Fri, 09 Oct 2009 12:56:58 -0700

Sanjay Radia wrote:

Will the RPC over HTTP be transparent so that that we can replace with a
different layer if needed?


Yes.

My worry was the separation of data and checksums; someone had mentioned
that one could do this over 2 RPCs - that is not transparent.

That was suggested as a possibility if we did not want to use RPC fordata, but rather raw HTTP, e.g., with a separate URL per block. Thezerocopy support built into most HTTP servers only supports entireresponses from a single file, so if we wanted to take advantage of thesezerocopy implementations we'd not use RPC for block access, but coulduse HTTP and hence share security, etc. Using raw HTTP for block accessmight also perform better, since it can use TCP flow control, ratherthan RPC call/response. In my microbenchmarks, RPC call/response wasfast enough to easily saturate disks and networks, so that might bemoot, although RPC call/response for file data may use more CPU thanwe'd like. With our own transport implementation we could get RPCcall/response to use zerocopy for file data.

I assume that we
going to create a branch that moves the data transfer protocols to RPC and
test the performance and if it is good then we commit and move to RPC?

Yes. We obviously cannot change the file data transfer protocol withoutbenchmarking. Ideally file data transfer can share as much as possiblewith other protocols. The most optimistic approach would be to useHTTP-based RPC call/response, so we ought to benchmark that. This wasthe purpose of my recently-reported microbenchmarks.

We also need to determine whether both TCP flow-control and zerocopy arecritical to data file performance. If both are indeed critical, andHTTP proves sufficient for everything else, then we should considerusing non-RPC HTTP for file data transfer, since it supports bothzerocopy and TCP-based flow control, and the implementation of security,etc. could be shared. But, on the other hand, if HTTP is deemedinappropriate for security and we develop our own RPC transport thatpermits zerocopy, and TCP flow-control over entire blocks is notrequired, then we might use RPC for file data. What I'm hoping we canavoid is, as today, using different transports for different protocols,re-implementing security, connection pooling, async request processing,etc. for each, requiring separate configuration and ports for each, etc.But even that might be required. We don't know yet.

I think starting with HTTP as a hypothesis permits us to make progresswithout a lot of up-front investment.


Doug

Re: HTTP transport?

Reply via email to