El Lunes 17 Mayo 2010, Scott Atchley escribió:
> On May 16, 2010, at 1:32 PM, Lydia Heck wrote:
> > When running over gigabit using -mca btl tcp,self,sm the code runs
> > alright, which is good as the largest part of our cluster is over
> > gigabit, and as Gadget-3 scales rather well, the penalty
I don't know if it's the same problem or not (and we haven't tested on
Myrinet), but we have one code which frequently hangs on smallish (64 node)
runs. I unfortunately haven't been able to deep dive into the problem, but the
hang is in a bcast call, where peers are doing sendrecv calls. All b
On May 16, 2010, at 1:32 PM, Lydia Heck wrote:
> When running over gigabit using -mca btl tcp,self,sm the code runs alright,
> which is good as the largest part of our cluster is over gigabit, and as
> Gadget-3 scales rather well, the penalty for running over gigabit is not
> prohibitive.
> We