Hello everyone,

I’m having trouble getting SLURM to choose the 2nd GPU on this node.

gres.conf:

NodeName=node21 Name=gpu Type=gtx680 File=/dev/nvidia0
NodeName=node21 Name=gpu Type=gtx580 File=/dev/nvidia1

slurm.conf:

NodeName=node21 Gres=gpu:gtx680:1,gpu:gtx580:1

If I then run srun --gres=gpu:gtx580 I get CUDA_VISIBLE_DEVICES=0
If I also run srun --gres=gpu:gtx680 I get CUDA_VISIBLE_DEVICES=0

I also get some errors in the slurmctld log file when I specify 
--gres=gpu:gtx580, which I don’t understand:

[2016-06-21T13:41:27.968] error: gres/gpu: job 228 dealloc node node21 topo 
gres count underflow (0 1)
[2016-06-21T13:41:27.968] error: gres/gpu: job 228 dealloc node node21 type 
gtx680 gres count underflow (0 1)


Can anyone please offer some advice?

Many thanks,

Tom Deakin

Reply via email to