Re: [OMPI users] Debugging memory use of Open MPI

Shaun Jackman Tue, 14 Apr 2009 14:02:54 -0400

Hi Eugene,

Eugene Loh wrote:

At 2500 bytes, all messages will presumably be sent "eagerly" -- withoutwaiting for the receiver to indicate that it's ready to receive thatparticular message. This would suggest congestion, if any, is on thereceiver side. Some kind of congestion could, I suppose, still occurand back up on the sender side.

Can anyone chime in as to what the message size limit is for an`eager' transmission?

On the other hand, I assume the memory imbalance we're talking about israther severe. Much more than 2500 bytes to be noticeable, I wouldthink. Is that really the situation you're imagining?

The memory imbalance is drastic. I'm expecting 2 GB of memory use perprocess. The heaving processes (13/16) use the expected amount ofmemory; the remainder (3/16) misbehaving processes use more than twiceas much memory. The specifics vary from run to run of course. So, yes,there is gigs of unexpected memory use to track down.

There are tracing tools to look at this sort of thing. The only one Ihave much familiarity with is Sun Studio / Sun HPC ClusterTools. Freedownload, available on Solaris or Linux, SPARC or x64, plays with OMPI.You can see a timeline with message lines on it to give you an idea ifmessages are being received/completed long after they were sent.Another interesting view is constructing a plot vs time of how manymessages are in-flight at any moment (including as a function ofreceiver). Lots of similar tools out there... VampirTrace (tracing sideonly, need to analyze the data), Jumpshot, etc. Again, though, there'sa question in my mind if you're really backing up 1000s or more ofmessages. (I'm assuming the memory imbalances are at least Mbytes.)


I'll check out Sun HPC ClusterTools. Thanks for the tip.

Assuming the problem is congestion and that messages are backing up,is there an accepted method of dealing with this situation? It seemsto me the general approach would be


if (number of outstanding messages > high water mark)
    wait until (number of outstanding messages < low water mark)

where I suppose the `number of outstanding messages' is defined as thenumber of messages that have been sent and not yet received by theother side. Is there a way to get this number from MPI without havingto code it at the application level?


Thanks,
Shaun

Re: [OMPI users] Debugging memory use of Open MPI

Reply via email to