Re: [OMPI users] Debugging memory use of Open MPI

Eugene Loh Tue, 14 Apr 2009 19:10:39 -0400

Shaun Jackman wrote:

Wow. Thanks, Eugene. I definitely have to look into the Sun HPCClusterTools. It looks as though it could be very informative.

Great. And, I didn't mean to slight TotalView. I'm just not familiarwith it.

What's the purpose of the 400 MB that MPI_Init has allocated?


It's for... um, I don't know.  Let's see...

About a third of it appears to be
vt_open() -> VTThrd_open() -> VTGen_open

which I'm guessing is due to the VampirTrace instrumentation (maybeallocating the buffers into which the MPI tracing data is collected).It seems to go away if one doesn't collect message-tracing data.

Somehow, I can't see further into the library. Hmm. It does seem likea bunch. The shared-memory area (which MPI_Init allocates for on-nodemessage passing) is much smaller. The remaining roughly 130Mbyte/process seems to be independent of the number of processes.


An interesting exercise for the reader.

The figure of in-flight messages vs time when the receiver sleeps isparticularly interesting. The sender appears to stop sending and blockonce there are 30'000 in-flight messages. Has Open MPI detected thesituation of congestion and begun waiting for the receiver to catchup? Or is it something simpler, such as the underlying write(2) callto the TCP socket blocking? If it's the first case, perhaps I couldtune this threshold to behave better for my application.

This particular case is for two on-node processes. So, no TCP isinvolved. There appear to be about 55K allocations, which looks likethe 85K peak minus the 30K at which the sender stalls. So, maybe someresource got exhausted at that point. Dunno.

Anyhow, this may be starting to get into more detail than you (or I)need to understand to address your problem. It *is* interesting stuff,though.

Re: [OMPI users] Debugging memory use of Open MPI

Reply via email to