Re: [O-MPI users] LAM vs OPENMPI performance

Patrick Geoffray Wed, 4 Jan 2006 16:48:46 -0500

Hi Tom,

users-requ...@open-mpi.org wrote:

I am pretty sure that LAM exploits the fact that the virtual processorsare all

sharing the same memory,  so communication is via memory and/or the PCI bus
of the system, while my OPENMPI configuration doesn't exploit this.  Is this
a reasonable diagnosis of the dramatic difference in performance?  More

It would be more likely that OpenMPI is using shared memory and pollingon it whereas LAM is using sockets, or at least blocking on something.

Polling is a bad thing when oversubscribing processor. When you block ona socket (or any OS handle), the process immediately yield the CPU andis removed from the scheduler. When you poll waiting for a send orreceive to complete, you are burning cycles on the CPU and the schedulerwill wait for the next quantum of time before running another process.

So, if you send a message between 2 processes sharing the sameprocessor, the latency will be in the order of half of the schedulerquantum (10ms on Linux) if they are both polling. Things are much fasterwhen processes are polling on different CPUs (1-2 us) but the blockingsocket overhead (~20us) is way better than the quantum of time when youdon't have several processors.

importantly, how to I reconfigure OPENMPI to match the LAM performance.

Try disabling the shared memory device in OpenMPI. Unfortunately, I haveno clue how to do it.


Patrick
--
Patrick Geoffray
Myricom, Inc.
http://www.myri.com

Re: [O-MPI users] LAM vs OPENMPI performance

Reply via email to