Thanks for the suggestion -- I filed https://github.com/open-mpi/ompi/issues/7240 to have Intel / Nvidia do this.
On Dec 15, 2019, at 5:01 PM, PADIOLEAU Thomas via devel <devel@lists.open-mpi.org<mailto:devel@lists.open-mpi.org>> wrote: Hello, I recently figured out that when running multi-GPU MPI application (one MPI process to one GPU) on a computer using Intel Omni-Path, you need to do the GPU binding before MPI initialization, according to Intel documentation<https://www.intel.com/content/dam/support/us/en/documents/network-and-i-o/fabric-products/Intel_PSM2_PG_H76473_v13_0.pdf>. If this seems correct to you, could you update your "Running CUDA-aware" web page accordingly ? This would help people to know what is the correct order. Sincerely Thomas -- Jeff Squyres jsquy...@cisco.com<mailto:jsquy...@cisco.com>