Hello,
I'm using CUDA-aware OMPIv4.0.3 with UCX to run some apps. Most of them have worked seamlessly, but one breaks and returns the error:

memtype_cache.c:299  UCX  ERROR failed to set UCM memtype event handler: Unsupported operation
--------------------------------------------------------------------------
No components were able to be opened in the pml framework.

This typically means that either no components of this type were
installed, or none of the installed components can be loaded.
Sometimes this means that shared libraries required by these
components are unable to be found/loaded.

  Host:      laghos-2
  Framework: pml
--------------------------------------------------------------------------
[laghos-2:00328] PML ucx cannot be selected 

The full discussion is posted at https://github.com/openucx/ucx/issues/4988. I don't fully understand the internals of OpenMPI and my question is specific to the 'pml' layer. Does it make any difference if the network is either ethernet or IB on how OpenMPI handles the data and, more specifically, memory access?
Thanks.
Arturo
 

Reply via email to