Re: [OMPI devel] [OMPI users] 3.x - hang in MPI_Comm_disconnect

2018-05-21 Thread Ben Menadue
Hi Ralph, Thanks for that. That would also explain why it works with OMPI 1.10.7. In which case, I’ll just suggest they continue using 1.10.7 for now. I just went back over the doMPI R code, and it looks like it’s using MPI_Comm_spawn to create it’s “cluster” of MPI worker processes but then

Re: [OMPI devel] [OMPI users] 3.x - hang in MPI_Comm_disconnect

2018-05-21 Thread r...@open-mpi.org
Comm_connect and Comm_disconnect are both broken in OMPI v2.0 and above, including OMPI master - the precise reasons differ across the various releases. From what I can tell, the problem is in the OMPI side (as opposed to PMIx). I’ll try to file a few issues (since the problem is different in

Re: [OMPI devel] [OMPI users] 3.x - hang in MPI_Comm_disconnect

2018-05-21 Thread Ben Menadue
Hi, Moving this over to the devel list... I’m not sure if it's is a problem with PMIx or with OMPI’s integration with that. It looks like wait_cbfunc callback enqueued as part of the PMIX_PTL_SEND_RECV at pmix_client_connect.c:329 is never called, and so the main thread is never woken from the