Re: [slurm-users] Sharing a GPU

2022-04-13 Thread Bas van der Vlies
Just released a new version of the plugin. Our cluster has been upgraded to 21.08.6 and the cgroups structure is different. Fixed in latest release: * Tested on 21.08 and 20.11 Regards > On 4 Apr 2022, at 09:20, Bas van der Vlies wrote: > > We have the exact same request for our GPUS that

Re: [slurm-users] Sharing a GPU

2022-04-05 Thread Kamil Wilczek
Thank you all for the help! The plugin seems to be thing I'm looking for. I'll try to test it with a spare server/GPUs. Thank again! -- Kamil Wilczek W dniu 04.04.2022 o 09:20, Bas van der Vlies pisze: We have the exact same request for our GPUS that are not A100 and we have developed a lua

Re: [slurm-users] Sharing a GPU

2022-04-04 Thread Bas van der Vlies
We have the exact same request for our GPUS that are not A100 and we have developed a lua plugin for our needs (The new slurm version will also allow the 22.XX). Bu tfor earlier version: * https://github.com/basvandervlies/surf_slurm_mps On 03/04/2022 23:19, Kamil Wilczek wrote: Hello! I

Re: [slurm-users] Sharing a GPU

2022-04-03 Thread Gerhard Strangar
Eric F. Alemany wrote: > Another solution would be the vNVIDIA GPU > (Virtual GPU manager software). > You can share GPU among VM’s You can really *share* one, not just delegate one GPU to one VM?

Re: [slurm-users] Sharing a GPU

2022-04-03 Thread Eric F. Alemany
Another solution would be the vNVIDIA GPU (Virtual GPU manager software). You can share GPU among VM’s ._ Eric F. Alemany System Administrator for Research EXO - Extended Operations Stanford

Re: [slurm-users] Sharing a GPU

2022-04-03 Thread Renfro, Michael
Someone else may see another option, but NVIDIA MIG seems like the straightforward option. That would require both a Slurm upgrade and the purchase of MIG-capable cards. https://slurm.schedmd.com/gres.html#MIG_Management Would be able to host 7 users per A100 card, IIRC. On Apr 3, 2022, at

[slurm-users] Sharing a GPU

2022-04-03 Thread Kamil Wilczek
Hello! I am an administrator of a GPU cluster (Slurm version 19.05.5). Could someone help me a little bit and explain if a single GPU can be shared between multiple users? My experience and documentation tells me that it is not possible. But even after some time Slurm is still a beast to me and