On 03/12/2019 15:50, Loris Bennett wrote:
Hi,
Installing
TensorFlow-1.13.1-fosscuda-2019a-Python-3.7.2.eb
fails with
bazel-out/k8-opt/bin/tensorflow/python/gen_ragged_conversion_ops_py_wrappers_cc:
error while loading shared libraries: libcuda.so.1: cannot open shared object
file: No such file or directory
There is a step before which includes:
cp -f
"/trinity/shared/easybuild/software/CUDA/10.1.105-GCC-8.2.0-2.31.1/targets/x86_64-linux/lib/stubs/libcuda.so"
"bazel-out/k8-opt/genfiles/external/local_config_cuda/cuda/cuda/lib/libcuda.so"
So it seems to me as if bazel is perhaps failing to add libcuda.so.1 as a link
to
libcuda.so.
Does anyone have an idea about how to proceed here?
The libcuda.so and libcuda.so.1 are only stubs to the actual libcuda.so
& libcuda.so.1 which should be provided through the OS (because they're
tied to the GPU drivers).
On our GPU system they're provided via an RPM:
$ rpm -qf /usr/lib64/libcuda.so.1
nvidia-driver-latest-dkms-cuda-libs-418.87.01-1.el7.x86_64
which seems to come from
https://developer.download.nvidia.com/compute/cuda/repos/rhel7/x86_64/ .
regards,
Kenneth
Cheers,
Loris