Hi Collin, Am 10.12.25 um 15:36 schrieb 'Collin Strassburger' via Open MPI users:
/opt/hpcx/ucc/lib/ucc/libucc_tl_cuda.so (libcuda.so.1: cannot open shared object file: No such file or directory)
Is it only the second host that cannot find libcuda.so? Do you have the library installed on both nodes?
What is the output for: mpirun --hosts node1,node2 ldd /opt/hpcx/ucc/lib/ucc/libucc_tl_cuda.so - Joachim -- Dr. rer. nat. Joachim Jenke Deputy Group Lead IT Center Group: HPC - Parallelism, Runtime Analysis & Machine Learning Division: Computational Science and Engineering RWTH Aachen University Seffenter Weg 23 D 52074 Aachen (Germany) Tel: +49 241 80- 24765 Fax: +49 241 80-624765 [email protected] www.itc.rwth-aachen.de To unsubscribe from this group and stop receiving emails from it, send an email to [email protected].
smime.p7s
Description: Kryptografische S/MIME-Signatur
