I dug into this a bit and think the patch wasn't quite complete, so I modified the approach to ensure this race condition gets resolved in every scenario. Hopefully, r32643 takes care of it for you.
On Aug 29, 2014, at 1:08 AM, Gilles Gouaillardet <gilles.gouaillar...@iferc.org> wrote: > Ralph and all, > > The following trivial test hangs > /* it hangs at least 99% of the time in my environment, 1% is a race > condition and the program behaves as expected */ > > mpirun -np 1 --mca btl self /bin/false > > same behaviour happen with the following trivial but MPI program : > > #include <mpi.h> > > int main (int argc, char *argv[]) { > MPI_Init(&argc, &argv); > MPI_Finalize(); > return 1; > } > > The attached patch fixes the hang (e.g. the program nicely abort with > the correct error message) > > i did not commit it since i am not confident at all > > could you please review it ? > > Cheers > > Gilles > <hang.patch>_______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2014/08/15751.php