Re: [OMPI users] Open MPI installation problem

2019-01-25 Thread Shrader, David Lee via users
The error about not finding shared libraries can be fixed by adding the proper path to LD_LIBRARY_PATH, which Ralph has already mentioned. If Open MPI is installed in $HOME/openmpi, LD_LIBRARY_PATH needs to be set like this: export LD_LIBRARY_PATH=$HOME/openmpi/lib:$LD_LIBRARY_PATH The mpi

Re: [OMPI users] [EXTERNAL] strange pml error

2021-11-02 Thread Shrader, David Lee via users
As a workaround for now, I have found that setting OMPI_MCA_pml=ucx seems to get around this issue. I'm not sure why this works, but perhaps there is different initialization that happens such that the offending device search problem doesn't occur? Thanks, David

Re: [OMPI users] [EXTERNAL] strange pml error

2021-11-02 Thread Shrader, David Lee via users
I too have been getting this using 4.1.1, but not with the master nightly tarballs from mid-October. I still have it on my to-do list to open a github issue. The problem seems to come from device detection in the ucx pml: on some ranks, it fails to find a device and thus the ucx pml

Re: [OMPI users] [EXTERNAL] strange pml error

2021-11-03 Thread Shrader, David Lee via users
I opened an issue, and a fix looks like it went in to the 4.1.2 release branch already. I tested the patch on my 4.1.1 release tarball, and the error no longer occurs. Here is the link to the issue: https://github.com/open-mpi/ompi/issues/9617 Thanks, David