Re: [OMPI users] mpirun only works when -np <4 (Gus Correa)

Eugene Loh Thu, 10 Dec 2009 18:29:25 -0500

Gus Correa wrote:

Why wouldn't shared memory work right on Nehalem?

We don't know exactly what is driving this problem, but the issueappears to be related to memory fences. Messages have to be posted to areceiver's queue. By default, each process (since OMPI 1.3.2) has onlyone queue. A sender acquires a lock to the queue, writes a pointer toits message, advances the queue index, and releases the lock. If thereare problems with memory barriers (or our use of them), messages can getlost, overwritten, etc. One manifestation could be hangs. Oneworkaround, as described on this mail list, is to increase the number ofqueues (FIFOs) so that each sender gets its own.

I think that's what's happening, but we don't know the root cause. Thetest case in 2043 on the node I used for testing works like a gem forGCC versions prior to 4.4.x, but with 4.4.x variants it falls hard onits face. Is the problem with GCC 4.4.x? Or, does that compiler exposea problem with OMPI? Etc.

It is amazing to me that this issue hasn't surfaced on this list before.

The trac ticket refers to a number of e-mail messages that might berelated. At this point, however, it's hard to know what's related andwhat isn't.


Gus Correa wrote:

FYI, I do NOT see the problem reported by Matthew et al. on our AMDOpteron Shanghai dual-socket quad-core. They run a quite outdatedCentOS kernel 2.6.18-92.1.22.el5, with gcc 4.1.2. and OpenMPI 1.3.2.

In my mind, GCC 4.1.2 may well be the ticket here. I find strongcorrespondence with GCC rev (< 4.4.x vs >= 4.4.x).

Moreover, all works fine if I oversuscribe up to 256 processes on onenode.Beyond that I get segmentation fault (not hanging) sometimes, but notalways.
I understand that extreme oversubscription is a no-no.


Sounds like another set of problems.

Re: [OMPI users] mpirun only works when -np <4 (Gus Correa)

Reply via email to