Hello
Le 16.12.2019 à 19:09, Åke Sandgren a écrit :
On 12/16/19 6:05 PM, Yann Sagon wrote:
My $MODULEPATH contains after loading fosscuda/2019b:
/opt/ebmodules/all/MPI/GCC/8.3.0/OpenMPI/3.1.4:/opt/ebmodules/all/Compiler/GCC/8.3.0:/opt/ebmodules/all/Compiler/GCCcore/8.3.0
For a standard HMNS loading fosscuda/2019b should show (in your case)
/opt/ebmodules/all/MPI/GCC-CUDA/8.3.0/OpenMPI/3.1.4:/opt/ebmodules/all/Compiler/GCC-CUDA/8.3.0:...
So, take a look at your modules/all/Compiler/GCC/8.3.0/CUDA/10.1.243.lua
and verify that it does extend MODULEPATH with
prepend_path("MODULEPATH",
"/opt/ebmodules/all/Compiler/GCC-CUDA/8.3.0-10.1.243")
I don't have this file (at this place).
[sagon@node025 TensorFlow] $ find /opt/ebmodules/ -name \*10.1.243\*.lua
/opt/ebmodules/all/Core/CUDA/10.1.243.lua
And this is the module loaded by fosscuda/2019b when I do module show CUDA.
If not, or if it doesn't exist, you need to make sure you're using
EasyBuild 4.1.0 and then do a rebuild.
eb gcccuda-2019b.eb fossacuda-2019b.eb --rebuild --robot
I did that, and It wasn't working. Then I removed the gcccuda-2019b.eb
and fosscuda-2019b.eb from the repositorypath and rebuild again and this
time it's working!
A previous version of fosscuda/2019b was incorrect for HMNS
I did a diff between the two fosscuda-2019b.eb files and the obvious
difference is that CUDA is the cuda for GCC instead of a plain CUDA in
the old eb file.
Many thanks for your help, it's working fine now!
Best
--
Logo UNIGE Yann Sagon
Référent HPC
Division du système et des technologies de l'information et de la
communication
Université de Genève | 24 rue Général-Dufour
Tél 022 379 77 37 | Bureau 151
www.unige.ch/stic <http://www.unige.ch/stic>