Hi Ralph,
Thanks for that. That would also explain why it works with OMPI 1.10.7. In
which case, I’ll just suggest they continue using 1.10.7 for now.
I just went back over the doMPI R code, and it looks like it’s using
MPI_Comm_spawn to create it’s “cluster” of MPI worker processes but then
Comm_connect and Comm_disconnect are both broken in OMPI v2.0 and above,
including OMPI master - the precise reasons differ across the various releases.
From what I can tell, the problem is in the OMPI side (as opposed to PMIx).
I’ll try to file a few issues (since the problem is different in
Hi,
Moving this over to the devel list... I’m not sure if it's is a problem with
PMIx or with OMPI’s integration with that. It looks like wait_cbfunc callback
enqueued as part of the PMIX_PTL_SEND_RECV at pmix_client_connect.c:329 is
never called, and so the main thread is never woken from the