[OMPI devel] fast path MPI_Sendrecv

Eugene Loh Sun, 21 Dec 2008 02:12:43 -0500

I've been looking at a "fast path" for sends and receives. This is likethe sendi function, which attempts to send "immediately", withoutcreating a bulky PML send request (which would be needed if, say, thesend had to be progressed over multiple user MPI calls). One can dosomething similar on the receive side, and I have a workspace in whicheach BTL has the option of defining a "recvi" (receive immediate)function. The speedups I see in the prototype are gratifying: np=2pingpong latencies are down 30%-2x, and they stay flat as np isincreased. (OMPI, straight out of the box, sees pingpong latenciesclimb as np climbs due to the costs of polling.)

I'd like to have MPI_Sendrecv see the same performance benefits as well,but the MPI layer performs an MPI_Sendrecv as a Irecv/Send/Wait. TheIrecv necessarily involves a receive request. So, the Send might befast, but you lose most of the benefit of doing a fast path. I thinkthe real way of doing a fast Sendrecv would be to do an immediate send(if you can) followed by an immediate receive.


It seems to me, there are two approaches here:

*) Teach the MPI layer about "fast path" sends and receives (sendi andrecvi).*) Teach the PML layer about "Sendrecv". That is, have MPI_Sendrecvcall something like mca_pml_ob1_sendrecv(). (This is the approach I'dprefer.)


Either way, the MPI/PML interface would need a new function (or two).

Any suggestions/comments?

Any guidelines on how I add a new function to the MPI/PML interface?

[OMPI devel] fast path MPI_Sendrecv

Reply via email to