Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread Ben Menadue
Hi All, This looks very much like what I reported a couple of weeks ago with Rmpi and doMPI — the trace looks the same. But as far as I could see, doMPI does exactly what simple_spawn.c does — use MPI_Comm_spawn to create the workers and then MPI_Comm_disconnect them when you call

Re: [OMPI users] need help installing mpi4py on openmpi

2018-06-04 Thread Konstantinos Konstantinidis
Here are some instructions I have put together. I am using Python 2 and Open MPI 2.1.2 so I changed the commands to work for Python 3 and I tested them. Hope it helps. Regards, Kostas On Sun, Jun 3, 2018 at 1:56 PM, Neil k8it wrote: > thanks, to all on this list for getting me this far. > my

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread r...@open-mpi.org
Yes, that does sound like a bug - the #connects must equal the #disconnects. > On Jun 4, 2018, at 1:17 PM, marcin.krotkiewski > wrote: > > huh. This code also runs, but it also only displays 4 connect / disconnect > messages. I should add that the test R script shows 4 connect, but 8 >

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread Bennet Fauber
Marcin, If you are interested, I can send you the R examples I use to test things offline. -- bennet On Mon, Jun 4, 2018 at 4:17 PM, marcin.krotkiewski wrote: > huh. This code also runs, but it also only displays 4 connect / disconnect > messages. I should add that the test R script shows 4

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread marcin.krotkiewski
huh. This code also runs, but it also only displays 4 connect / disconnect messages. I should add that the test R script shows 4 connect, but 8 disconnect messages. Looks like a bug to me, but where? I guess we will try to contact R forums and ask there. Bennet: I tried to use doMPI +

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread r...@open-mpi.org
It might call disconnect more than once if it creates multiple communicators. Here’s another test case for that behavior: intercomm_create.c Description: Binary data > On Jun 4, 2018, at 7:08 AM, Bennet Fauber wrote: > > Just out of curiosity, but would using Rmpi and/or doMPI help in any

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread Bennet Fauber
Just out of curiosity, but would using Rmpi and/or doMPI help in any way? -- bennet On Mon, Jun 4, 2018 at 10:00 AM, marcin.krotkiewski wrote: > Thanks, Ralph! > > Your code finishes normally, I guess then the reason might be lying in R. > Running the R code with -mca pmix_base_verbose 1 i see

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread marcin.krotkiewski
Thanks, Ralph! Your code finishes normally, I guess then the reason might be lying in R. Running the R code with -mca pmix_base_verbose 1 i see that each rank calls ext2x:client disconnect twice (each PID prints the line twice) [...]     3 slaves are spawned successfully. 0 failed.

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread r...@open-mpi.org
Try running the attached example dynamic code - if that works, then it likely is something to do with how R operates. simple_spawn.c Description: Binary data > On Jun 4, 2018, at 3:43 AM, marcin.krotkiewski > wrote: > > Hi, > > I have some problems running R + Rmpi with OpenMPI 3.1.0 +

[OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread marcin.krotkiewski
Hi, I have some problems running R + Rmpi with OpenMPI 3.1.0 + PMIx 2.1.1. A simple R script, which starts a few tasks, hangs at the end on diconnect. Here is the script: library(parallel) numWorkers <- as.numeric(Sys.getenv("SLURM_NTASKS")) - 1 myCluster <- makeCluster(numWorkers, type =