Re: [OMPI users] How to use Multiple links withOpenMPI??????????????????

2009-06-01 Thread Jeff Squyres
On May 29, 2009, at 12:31 AM, shan axida wrote: Is it true to use bidirectianal communication with MPI in ethernet Cluster? Are you asking if Open MPI uses bi-direction TCP sockets? Yes, it does: we open one TCP socket between the MPI sender and receiver, and if the order is reversed

Re: [OMPI users] mpi trace visualization

2009-06-01 Thread Eugene Loh
Roman Martonak wrote: I would like to profile the MPI code using the vampir trace integrated in openmpi-1.3.2. In order to visualize the trace files, apart from commercial vampir, is there some free viewer for the OTF files ? I'm rusty on this stuff. If you go to

Re: [OMPI users] Problem getting OpenMPI to run

2009-06-01 Thread Joe Landman
Jeff Layton wrote: Jeff Squyres wrote: On Jun 1, 2009, at 2:04 PM, Jeff Layton wrote: error: executing task of job 3084 failed: execution daemon on host "compute-2-2.local" didn't accept task This looks like an error message from the resource manager/scheduler -- not from OMPI (i.e., OMPI

Re: [OMPI users] Performance testing software?

2009-06-01 Thread Eugene Loh
HPL can "stress test" the MPI, but it is typically relatively insensitive to MPI performance. The usual use produces a measure of the peak floating-point performance of the system. A broader set of system performance measurements are found in the HPCC (HPC Challenge) tests, which include

Re: [OMPI users] Problem getting OpenMPI to run

2009-06-01 Thread Jeff Squyres
On Jun 1, 2009, at 2:58 PM, Jeff Layton wrote: >> error: executing task of job 3084 failed: execution daemon on host >> "compute-2-2.local" didn't accept task > > This looks like an error message from the resource manager/scheduler > -- not from OMPI (i.e., OMPI tried to launch a process on a

Re: [OMPI users] Problem getting OpenMPI to run

2009-06-01 Thread Rolf Vandevaart
On 06/01/09 14:58, Jeff Layton wrote: Jeff Squyres wrote: On Jun 1, 2009, at 2:04 PM, Jeff Layton wrote: error: executing task of job 3084 failed: execution daemon on host "compute-2-2.local" didn't accept task This looks like an error message from the resource manager/scheduler -- not

Re: [OMPI users] Problem getting OpenMPI to run

2009-06-01 Thread Jeff Layton
Jeff Squyres wrote: On Jun 1, 2009, at 2:04 PM, Jeff Layton wrote: error: executing task of job 3084 failed: execution daemon on host "compute-2-2.local" didn't accept task This looks like an error message from the resource manager/scheduler -- not from OMPI (i.e., OMPI tried to launch a

Re: [OMPI users] Problem getting OpenMPI to run

2009-06-01 Thread Jeff Squyres
On Jun 1, 2009, at 2:04 PM, Jeff Layton wrote: error: executing task of job 3084 failed: execution daemon on host "compute-2-2.local" didn't accept task This looks like an error message from the resource manager/scheduler -- not from OMPI (i.e., OMPI tried to launch a process on a node and

[OMPI users] Problem getting OpenMPI to run

2009-06-01 Thread Jeff Layton
Good morning, I think I sent this out last week but I did some "experimentation" and kind-of/sort-of got my OpenMPI application to run. But I do have a weird problem. I can get the application (build with OpenMPI-1.3.2 with gcc and the app is built with Intel 10.2) to run on the IB network (not

Re: [OMPI users] make vt_tracefilter.cc:133: internal compilererror: Segmentation fault - openmpi-1.3.2

2009-06-01 Thread Jeff Squyres
This looks like your compiler seg faulted. I think you should contact your compiler vendor and find out why. Additionally, you can disable the optional/3rd-party-add-on VampirTrace package with --enable-contrib-no-build=vt. This is the part of the code where your compiler seg faulted, so

[OMPI users] overlapping communicators?

2009-06-01 Thread tsilva
Hi, I have a Multiple Program Multiple Data with three programs running in parallel, say A, B and C. C is much slower so in order to balance the load I want to parallelize C into C0 to Cn (SPMD). There are very frequent communications between Ci processes and not so frequent, but still

Re: [OMPI users] sync problem

2009-06-01 Thread Gus Correa
Hi Danesh Make sure you have 700GB of RAM on the sum of all nodes you are using. Otherwise context switching and memory swapping may be the problem. MPI doesn't perform well in this conditions (and may break, particularly on large problems, I suppose). A good way to go about it is to look at