Re: [OMPI users] bin/orted: Command not found.

2009-08-07 Thread Ralph Castain
Not that I know of - I don't think we currently have any way for you to specify a location for individual nodes. Is there some reason why you installed it this way? On Fri, Aug 7, 2009 at 11:27 AM, Kenneth Yoshimoto wrote: > > Hello, > > I have three sets of nodes, each

Re: [OMPI users] Automated tuning tool

2009-08-07 Thread Edgar Gabriel
Gus Correa wrote: Terry Frankcombe wrote: There's been quite some discussion here lately about the effect of OMPI tuning parameters, particularly w.r.t. collectives. Is there some tool to probe performance on any given network/collection of nodes to aid optimisation of these parameters? (I'm

[OMPI users] x4100 with IB

2009-08-07 Thread Michael Di Domenico
I have several Sun x4100 with Infiniband which appear to be running at 400MB/sec instead of 800MB/sec. It a freshly reformatted cluster converting from solaris to linux. We also reset the bios settings with "load optimal defaults". Does anyone know which bios setting i changed to dump the BW?

Re: [OMPI users] Automated tuning tool

2009-08-07 Thread john casu
I'm not sure that there is a general "best set" of parameters, given the dependence of that set on comms patterns, etc... Still, this *is* a classic parameter sweep and optimization problem (unlike ATLAS), with a small number of parameters, and is the sort of thing one should be able to hook

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread Gus Correa
Hi Craig, Terry, Neeraj, list Craig: A fellow here runs WRF. I grep the code and there is plenty of collectives there: MPI_[All]Gather[v], MPI_[All]Reduce, etc. Domain decomposition code like WRF, MITgcm, and other atmosphere and ocean codes has point-to-point communication to exchange

Re: [OMPI users] pipes system limit

2009-08-07 Thread Yann JOBIC
Rolf Vandevaart wrote: This message is telling you that you have run out of file descriptors. I am surprised that the -mca parameter setting did not fix the problem. Can you run limit or ulimit on your shell and send the information? I typically set my limit to 65536 assuming the system allows

Re: [OMPI users] pipes system limit

2009-08-07 Thread Rolf Vandevaart
This message is telling you that you have run out of file descriptors. I am surprised that the -mca parameter setting did not fix the problem. Can you run limit or ulimit on your shell and send the information? I typically set my limit to 65536 assuming the system allows it. burl-16 58

[OMPI users] pipes system limit

2009-08-07 Thread Yann JOBIC
Hello all, I'm using hpc8.2 : Lidia-jobic% ompi_info Displaying Open MPI information for 32-bit ... Package: ClusterTools 8.2 Open MPI: 1.3.3r21324-ct8.2-b09j-r40 [...] And i've got a X4600 machine (8*4 cores). When i'm trying to run a 32 processor jobs, i've got

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread Craig Tierney
Terry Dontje wrote: Craig, Did your affinity script bind the processes per socket or linearly to cores. If the former you'll want to look at using rankfiles and place the ranks based on sockets. TWe have found this especially useful if you are not running fully subscribed on your machines.

Re: [OMPI users] init failing

2009-08-07 Thread Jeff Squyres
I'm way behind on my mail; apologies for the delay in replying. Did you figure this out? As a pure guess, it sounds like you have a heterogeneous setup -- nodes have different distros and/or versions. As such, your glibc's may be different, etc. In such situations, it is definitely

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread Craig Tierney
nee...@crlindia.com wrote: Hi Craig, How was the nodefile selected for execution? Whether it was provided by scheduler say LSF/SGE/PBS or you manually gave it? With WRF, we observed giving sequential nodes (Blades which are in the same order as in enclosure) gave us some performance

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread Craig Tierney
Terry Dontje wrote: Craig, Did your affinity script bind the processes per socket or linearly to cores. If the former you'll want to look at using rankfiles and place the ranks based on sockets. TWe have found this especially useful if you are not running fully subscribed on your machines.

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread neeraj
Hi Terry, I feel hierarchical collectives are slower compare to tuned one. I had done some benchmark in the past specific to collectives, and this is what i feel based on my observation. Regards Neeraj Chourasia (MTS) Computational Research Laboratories Ltd. (A wholly Owned Subsidiary

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread neeraj
Hi Craig, WRF has pattern of talking to nearest neighbours like p+1, p-1, p+3 and p-3, where p is the particular process. But in addition to that, it also uses collective calls like MPI_Bcast, MPI_AlltoAllv, MPI_Allgather, MPI_Gather, MPI_Gatherv, MPI_Scatterv. Apparently

Re: [OMPI users] Bus Error (7) on PS3 running HPL (OpenMPI 1.2.8)

2009-08-07 Thread vipin kumar
may be because of insufficient storage space...???( I mean hard disk space.) On Thu, Aug 6, 2009 at 11:23 PM, Jeff Squyres wrote: > Any chance you could re-try the experiment with Open MPI 1.3.3? > > > On Aug 4, 2009, at 11:10 AM, Hoelzlwimmer Andreas -