Re: [OMPI users] slurm and all-srun orterun

2008-03-03 Thread Ralph Castain
Hello I don't monitor the user list any more, but a friendly elf sent this along to me. I'm not entirely sure what problem might be causing the behavior you are seeing. Neither mpirun nor any orted should be impacted by IB problems as they aren't MPI processes and thus never interact with IB. Onl

Re: [OMPI users] eager limits?

2008-03-03 Thread Brian W. Barrett
On Mon, 3 Mar 2008, Brock Palen wrote: I was writing a simple code to demonstrate deadlock. I create an array of some number of doubles, when sizeof(double)*SIZE is > btl_sm_eager_limit (on a smp machine) I expect the code to deadlock but not until the buffer size is greater than the eager limi

[OMPI users] eager limits?

2008-03-03 Thread Brock Palen
I was writing a simple code to demonstrate deadlock. I create an array of some number of doubles, when sizeof(double)*SIZE is > btl_sm_eager_limit (on a smp machine) I expect the code to deadlock but not until the buffer size is greater than the eager limit. Now in 1.2.3 sm_eager_limit is

[OMPI users] slurm and all-srun orterun

2008-03-03 Thread Sacerdoti, Federico
Hi, We are migrating to openmpi on our large (~1000 node) cluster, and plan to use it exclusively on a multi-thousand core infiniband cluster in the near future. We had extensive problems with parallel processes not dying after a job crash, which was largely solved by switching to the slurm resour

Re: [OMPI users] ScaLapack and BLACS on Leopard

2008-03-03 Thread Terry Dontje
What kind of system lib errors are you seeing and do you have a stack trace? Note, I was trying something similar with Solaris and 64-bit on a SPARC machine and was seeing segv's inside the MPI Library due to a pointer being passed through an integer (thus dropping the upper 32 bits). Funny t