Re: [OMPI users] gadget-3 locks up using openmpi and infiniband (or myrinet)

2010-05-17 Thread Jaime Perea
El Lunes 17 Mayo 2010, Scott Atchley escribió: > On May 16, 2010, at 1:32 PM, Lydia Heck wrote: > > When running over gigabit using -mca btl tcp,self,sm the code runs > > alright, which is good as the largest part of our cluster is over > > gigabit, and as Gadget-3 scales rather well, the penalty

Re: [OMPI users] gadget-3 locks up using openmpi and infiniband (or myrinet)

2010-05-17 Thread Barrett, Brian W
I don't know if it's the same problem or not (and we haven't tested on Myrinet), but we have one code which frequently hangs on smallish (64 node) runs. I unfortunately haven't been able to deep dive into the problem, but the hang is in a bcast call, where peers are doing sendrecv calls. All b

Re: [OMPI users] gadget-3 locks up using openmpi and infiniband (or myrinet)

2010-05-16 Thread Scott Atchley
On May 16, 2010, at 1:32 PM, Lydia Heck wrote: > When running over gigabit using -mca btl tcp,self,sm the code runs alright, > which is good as the largest part of our cluster is over gigabit, and as > Gadget-3 scales rather well, the penalty for running over gigabit is not > prohibitive. > We