Re: [OMPI users] [EXTERNAL] Shmem errors on Mac OS Catalina

2020-02-07 Thread Jin Tao via users
Hi, Sorry to raise this issue again but now I still receive the following error after a while: posix: file name search - max attempts exceeded.cannot continue with posix. I am compiling and running with the following command: *make && mpiexec --oversubscribe -np 10 main.out* My make file

Re: [OMPI users] openmpi/pmix/ucx

2020-02-07 Thread Ray Muno via users
Were using MLNX_OFED 4.7.3. It supplies UCX 1.7.0. We have OpenMPI 4.02 compiled against the Mellanox OFED 4.7.3 provided versions of UCX, KNEM and HCOLL, along with HWLOC 2.1.0 from the OpenMPI site. I mirrored the build to be what Mellanox used to configure OpenMPI in HPC-X 2.5. I have

[OMPI users] openmpi/pmix/ucx

2020-02-07 Thread Michael Di Domenico via users
i haven't compiled openmpi in a while, but i'm in the process of upgrading our cluster. the last time i did this there were specific versions of mpi/pmix/ucx that were all tested and supposed to work together. my understanding of this was because pmi/ucx was under rapid development and the api's

[OMPI users] Question about UCX progress throttling

2020-02-07 Thread Joseph Schuchart via users
Today I came across the two MCA parameters osc_ucx_progress_iterations and pml_ucx_progress_iterations in Open MPI. My interpretation of the description is that in a loop such as below, progress in UCX is only triggered every 100 iterations (assuming opal_progress is only called once per

Re: [OMPI users] [EXTERNAL] Shmem errors on Mac OS Catalina

2020-02-07 Thread Jin Tao via users
Ok it seems to be working now if I remove the quotation marks from the config file. Thank you all for your help! I am looking forward to using open MPI for my work! On Fri, 7 Feb 2020 at 09:42, Jin Tao wrote: > Hi, > > Thank you for the guidance. I rebooted my computer but now the program >

Re: [OMPI users] [EXTERNAL] Shmem errors on Mac OS Catalina

2020-02-07 Thread Jin Tao via users
Hi, Thank you for the guidance. I rebooted my computer but now the program fails to compile. I then tried changing the tmp directory adding the following line to *openmpi-mca-params.conf*: *orte_tmpdir_base = "/Users/myname/Desktop/shared/tmp"* But now I get the following error: *PMIX ERROR: