Hi Collin,

Am 10.12.25 um 15:36 schrieb 'Collin Strassburger' via Open MPI users:
/opt/hpcx/ucc/lib/ucc/libucc_tl_cuda.so (libcuda.so.1: cannot open shared object file: No such file or directory)

Is it only the second host that cannot find libcuda.so? Do you have the library installed on both nodes?

What is the output for:

mpirun --hosts node1,node2 ldd /opt/hpcx/ucc/lib/ucc/libucc_tl_cuda.so

- Joachim
--
Dr. rer. nat. Joachim Jenke
Deputy Group Lead

IT Center
Group: HPC - Parallelism, Runtime Analysis & Machine Learning
Division: Computational Science and Engineering
RWTH Aachen University
Seffenter Weg 23
D 52074  Aachen (Germany)
Tel: +49 241 80- 24765
Fax: +49 241 80-624765
[email protected]
www.itc.rwth-aachen.de

To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].

Attachment: smime.p7s
Description: Kryptografische S/MIME-Signatur

Reply via email to