Re: [slurm-users] Using cgroups to hide GPUs on a shared controller/node

2019-05-21 Thread John Hearns
Sorry Dave, nothing handy. However look at this writeup from You Know Who: https://pbspro.atlassian.net/wiki/spaces/PD/pages/11599882/PP-325+Support+Cgroups Look at the devices: Subsystem You will need the major device number for the Nvidia devices, for example on my system: crw-rw-rw- 1 root

Re: [slurm-users] final stages of cloud infrastructure set up

2019-05-21 Thread nathan norton
Unfortunately that didn't work, However i modified my slurm.conf to lie and say i had 16 cpu on 1 thread and now everything is working fine. One issue with CLOUD state machines is that is when i run scontrol show nodes they don't show up, is there a way i can get their info when they are

[slurm-users] Is the generic plugin still available?

2019-05-21 Thread Rikimaru Honjo
Hi, I'm trying to use BurstBuffer with the generic plugin according to the latest BurstBufferGuide[1]. But I have failed to do it. The generic plugin was probably loaded correctly. But, it seemed that burst_buffer.conf was not applied. Because script files specified to BurstBuffer operations