Hi Huette,
 
If you mean a node equipped with NVIDIA GPU(s),
 
could you have a look at this doc  https://xcat-docs.readthedocs.io/en/latest/advanced/gpu/index.html
 
Hope these steps can help you.
 
 
Best Regards
--------------------------------------------------
Yuan Bai (白媛)

CSTL HPC System Management Development
Tel:86-10-82451401
E-mail: by...@cn.ibm.com
Address: IBM ZGC Campus. Ring Building 28,
ZhongGuanCun Software Park,No.8 Dong Bei Wang West Road, Haidian District,
Beijing P.R.China 100193

IBM环宇大厦
北京市海淀区东北旺西路8号,中关村软件园28号楼
邮编:100193
 
 
----- Original message -----
From: "Heckes Frank (CI/OSB4)" <frank.hec...@de.bosch.com>
To: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
Cc:
Subject: Re: [xcat-user] nvidia driver on stateless cluster
Date: Fri, Nov 16, 2018 9:28 PM
 

Hello,

 

I suppose you mean a node equipped with NVIDIA GPU(s).

 

There’s one option I currently use to install the driver in image of a rhel/centos node.

 

On a node with the kernel-devel RPM of the target node installed (might be MN or a build host of sorts),
run the downloaded driver:

 

./NVIDIA-Linux-x86_64-390.87.run --add-this-kernel 

 

The node don’t have to be the target node.
This will create a self extracting file customized with the kernel running on your target node. ./NVIDIA-Linux-x86_64-390.87-custom.run .
In case the kernel isn’t running on the ‘build’ node you  can specify the kernel version and src dir via command-line

Options (see –advanced-options output)

Now can start this version from a postscript. The file might be in a network FS share or inside the image and deleted afterwards by running:

 

NVIDIA-Linux-x86_64-390.87-custom.run –x; ./nvidia-install -s

You need to blacklist the noveau in the diskless boot before.

There’s another possibility to use dkms with the nvidia installer. You’d need to chroot (and bind /dev/, /proc/, sys) manually and run the installer with –dkms option.

 

Mit freundlichen Grüßen / Best regards

Frank Heckes

CI Operations - Server Services Sun Solaris, Linux (CI/OSB4)

frank.hec...@de.bosch.com


Von: Huette, Antoine <antoine.hue...@bechtle.com>
Gesendet: Freitag, 16. November 2018 12:36
An: xCAT Users Mailing list <xcat-user@lists.sourceforge.net>
Betreff: [xcat-user] nvidia driver on stateless cluster

 

Hello,

 

On a stateless CentOS 7.5 cluster with Quadro GPUs, I need to install the Nvidia driver. I’m using the runfile downloaded from the Nvidia website.

What is the suggested procedure ? Is it better to install the driver in the osimage, or should I make the installer run when the nodes start ?

 

The problem I see with the first option is the fact that the driver checks if a GPU is present in the system, so I’m not sure if this method can work.

 

The problem with the second method is that, after trying it, it’s very difficult to have a working X server with a Gnome desktop. The driver installer needs the node to be in runlevel 3 (multi-user.target) but once it is installed, I need to switch to runlevel 5 (graphical.target) which almost never works. So far the only way I’ve found is by installing the driver manually on a freshly booted node, run nvidia-xconfig to fill the Xorg.conf file, and then restarting the gnome services.

 

Any help on this subject would be much appreciated ! 😊

 

 

Best regards,

 

Antoine Huette
HPC Engineer

antoine.hue...@bechtle.com | 03.67.07.97.37/07.72.31.82.12 |  bechtle.fr |

 

Bechtlehttp://www.bechtle-contenu.fr/signature/facebook.png   http://www.bechtle-contenu.fr/signature/twitter.png   http://www.bechtle-contenu.fr/signature/linkedin.png   http://www.bechtle-contenu.fr/signature/youtube.png   Résultat de recherche d'images pour "instagram logo" 

 

_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user
 

_______________________________________________
xCAT-user mailing list
xCAT-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xcat-user

Reply via email to