Yeah. I think it is a problem I introduced. I am trying to track it down now. Probably messed up one of the conversions in ob1.
-Nathan On Mon, Jul 14, 2014 at 01:38:38PM -0700, Rolf vandeVaart wrote: > I have noticed that I am seeing some tests hang on the trunk. For > example: > > > > $ mpirun --mca btl_tcp_if_include eth0 --host > drossetti-ivy0,drossetti-ivy1 -np 2 --mca pml ob1 --mca btl sm,tcp,self > --mca coll_mdisable_allgather 1 --mca btl_openib_warn_default_gid_prefix 0 > send > > > > It is not unusual for this test to take several minutes, particularly on > slow networks. > > Please be patient. > > NOTICE: Using max message size: 10485760 > > Progress: [===== > > > > Is anyone else seeing this? (This is really a hang in spite of the > message saying it should take a few minutes) > > > > This started with the changes Nathan did for renaming the descriptor > fields - r32196 through r32202. > > From what I can tell, it looks like it hangs the second time the > rendezvous protocol is used to send the data. > > > > Rolf > > ---------------------------------------------------------------------- > > This email message is for the sole use of the intended recipient(s) and > may contain confidential information. Any unauthorized review, use, > disclosure or distribution is prohibited. If you are not the intended > recipient, please contact the sender by reply email and destroy all copies > of the original message. > > ---------------------------------------------------------------------- > _______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/07/15136.php
pgpghCbh7lgyD.pgp
Description: PGP signature