Ok. FWIW, Pasha and I think that openib has supported "send-to-self"
for a while (we don't know exactly when; but Pasha thinks it is very
old code that we don't check for self in add_procs). But it only
broke recently.
On Jul 29, 2008, at 9:31 AM, George Bosilca wrote:
I ran few tests and the only combination leading to a deadlock is
openib and self. As openib is the only BTL supporting self
communications (except self of course), I guess it interfere with
self in some more or less strange ways. I didn't had the time to dig
deeper yet to see what exactly happens there, I'll schedule this
later today.
george.
On Jul 29, 2008, at 8:52 AM, Pavel Shamis (Pasha) wrote:
Jeff Squyres wrote:
This used to be true, but I think we changed it a while ago
(Pasha: do you remember?) because Mellanox HCAs are capable of
send-to-self (process) and there were no code changes necessary to
enable it. So it allowed a slightly simpler command line. This
was quite a while ago, IIRC.
Yep, Correct.
FYI. In my MTT testing I also see a lot of killed tests.
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Jeff Squyres
Cisco Systems