barry-jin commented on issue #20494: URL: https://github.com/apache/incubator-mxnet/issues/20494#issuecomment-897179376
Looks like CentOS tests still failed. https://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Fcentos-gpu/detail/PR-20512/4/pipeline I think the root cause is that libcuda.so is linked to `/usr/local/cuda/targets/x86_64-linux/lib/stubs/libcuda.so`, which should only be used in build time, but now mxnet is trying to dlopen it in runtime. I cannot reproduce it because libcuda.so is linked to the libcuda.so used for runtime. ``` [root@1dc28a587f4f build]# ldconfig -p | grep cuda ... libcuda.so.1 (libc6,x86-64) => /lib64/libcuda.so.1 libcuda.so (libc6,x86-64) => /lib64/libcuda.so ... ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
