Re: [OMPI users] Collective operations and synchronization

Ashley Pittman Tue, 24 Mar 2009 06:10:47 -0400


On 23 Mar 2009, at 23:36, Shaun Jackman wrote:

loop {
MPI_Ibsend (for every edge of every leaf node)
MPI_barrier
MPI_Iprobe/MPI_Recv (until no messages pending)
MPI_Allreduce (number of nodes removed)
} until (no nodes removed by any node)

Previously, I attempted to use a single MPI_Allreduce without theMPI_Barrier:

You need both the MPI_Barrier and the synchronisation semantics of theMPI_Allreduce in this example, it's important that each send matches arecv for the same iteration so you need to ensure all sends have beensent before you call probe and a Barrier is one way of doing this. Youalso need the syncronisation semantics of the Allreduce to ensure theiProbe doesn't match a send from the next iteration of the loop.

Perhaps there is a better way of accomplishing the same thing however,MPI_Barrier syncronises all processes so is potentially a lot moreheavyweight than it needs to be, in this example you only need tosyncronise with your neighbours so it might be quicker to use asend/receive for each of your neighbours containing a true/false valuerather than to rely on the existence of a message or not. i.e. thebarrier is needed because you don't know how many messages there are,it may well be quicker to have a fixed number of point to pointmessages rather than a extra global synchronisation. The addedadvantage of doing it this way would be you could remove the Probe aswell.

Potentially it would be possible to remove the Allreduce as well anduse the tag to identify the iteration count, assuming of course youdon't need to know the global number of branches at any iteration. Oneproblem with this approach can be that one process can get very slowand swamped with unexpected messages however assuming your neighbourcount is small this shouldn't be a problem. I'd expect their to notonly be a net gain changing to this way but for the application toscale better as well.

Finally I've always favoured iRecv/Send over Ibsend/Recv as in themajority of cases this tends to be faster, you'd have to benchmark yourspecific setup however.


Ashley,

Re: [OMPI users] Collective operations and synchronization

Reply via email to