Re: [OMPI devel] failure withzero-lengthReduce()andbothsbuf=rbuf=NULL

Christian Siebert Thu, 11 Feb 2010 12:19:22 -0500


Jeff Squyres wrote:

There's no synchronization *guarantee* in MPI collectives except forMPI_BARRIER. [...] BCAST *can* synchronize; I'm not saying it has to.

I fully agree with Jeff and would even go a step further.

As has already been noted, there are also some implicit datadependencies due to the fact that we do "message passing". This meansthat a receiver can only get a message after the sender has posted it.So yes, all processes get their broadcast message only after the rootcalled MPI_Bcast and the like. But does this necessarily imply thatall processes block in such a call and return only after the sendersjoined the communication? In my opinion, no correct and portable MPIprogram should rely on anything that is not explicitly stated in thestandard.

Example to think about: I developed an MPI wrapper several years ago(for a slow interconnect), which almost immediately returned fromblocking MPI calls. Instead of wasting time to wait for the senders,it utilized features of the virtual memory subsystem to protect thegiven message buffers from not-yet-allowed accesses (i.e., writeaccess for send buffers and read access for receive buffer), andstarted the communication in the background like the nonblockingvariants. The blocking (if at all) happened only at the time the datawas actually accessed by the processor (so this implicitsynchronization point we are taking about was just delayed). Thisenabled communication and computation overlap without rewriting theapplication (even for send operations or large messages due topipelining) - just relink and see if it gets faster. I'm not totallysure that this is 100% MPI conform - but as long as programmers don'trely on anything that is not explicitly stated in the standard, theycould benefit from such implementations...

Re: [OMPI devel] failure withzero-lengthReduce()andbothsbuf=rbuf=NULL

Reply via email to