Just released a new version of the plugin. Our cluster has been upgraded to
21.08.6 and the cgroups structure is different. Fixed in latest release:
* Tested on 21.08 and 20.11
Regards
> On 4 Apr 2022, at 09:20, Bas van der Vlies wrote:
>
> We have the exact same request for our GPUS that
Thank you all for the help!
The plugin seems to be thing I'm looking for.
I'll try to test it with a spare server/GPUs.
Thank again!
--
Kamil Wilczek
W dniu 04.04.2022 o 09:20, Bas van der Vlies pisze:
We have the exact same request for our GPUS that are not A100 and we
have developed a lua
We have the exact same request for our GPUS that are not A100 and we
have developed a lua plugin for our needs (The new slurm version will
also allow the 22.XX). Bu tfor earlier version:
* https://github.com/basvandervlies/surf_slurm_mps
On 03/04/2022 23:19, Kamil Wilczek wrote:
Hello!
I
Eric F. Alemany wrote:
> Another solution would be the vNVIDIA GPU
> (Virtual GPU manager software).
> You can share GPU among VM’s
You can really *share* one, not just delegate one GPU to one VM?
Another solution would be the vNVIDIA GPU
(Virtual GPU manager software).
You can share GPU among VM’s
._
Eric F. Alemany
System Administrator for Research
EXO - Extended Operations
Stanford
Someone else may see another option, but NVIDIA MIG seems like the
straightforward option. That would require both a Slurm upgrade and the
purchase of MIG-capable cards.
https://slurm.schedmd.com/gres.html#MIG_Management
Would be able to host 7 users per A100 card, IIRC.
On Apr 3, 2022, at
Hello!
I am an administrator of a GPU cluster (Slurm version 19.05.5).
Could someone help me a little bit and explain if a single
GPU can be shared between multiple users? My experience and
documentation tells me that it is not possible. But even after
some time Slurm is still a beast to me and