Re: [OMPI users] MPI failing on Infiniband (queue pair error)

2019-05-09 Thread Jeff Squyres (jsquyres) via users
You might want to try two things: 1. Upgrade to Open MPI v4.0.1. 2. Use the UCX PML instead of the openib BTL. You may need to download/install UCX first. Then configure Open MPI: ./configure --with-ucx --without-verbs --enable-mca-no-build=btl-uct ... This will build the UCX PML, and that

[OMPI users] MPI failing on Infiniband (queue pair error)

2019-05-09 Thread Koutsoukos Dimitrios via users
Hi all, I am trying to run MPI on a distributed mode. The cluster setup is an 8-machine cluster with Debian 8 (Jessie), Intel Xeon E5-2609 2.40 GHz and Mellanox-QDR HCA Infiniband. My MPI version is 3.0.4. I can successfully run a simple command on all nodes that doesn’t use the infiniband but