I have a couple nodes with 4xK80 GPU's in them (nvidia0-7). Is there a way to either request peer-to-peer GPU's, or force allocation to 2 GPU's at a time? We'd prefer for the former (run when peer-to-peer is available, unless you don't care) so we can fit more users onto the machine. However, ensuring the peer-to-peer codes get the proper allocation is more important.
User 1 - needs a full K80 with peer-to-peer User 2 - needs a single GPU User 3 - needs a single GPU User 4 - Needs 2 full K80 I.e 0,1 - User 1 2 - User 2 3 - User 3 4,5,6,7 - User 4 Or 0,1 - User 1 2,3 - User 2 4,5 - User 3 QUEUED - User 4 I tried this gres configuration, but it didn't do what I expected. Name=gpu File=/dev/nvidia[0-1] Count=2 CPUs=0-9 Name=gpu File=/dev/nvidia[2-3] Count=2 CPUs=0-9 Name=gpu File=/dev/nvidia[4-5] Count=2 CPUs=10-19 Name=gpu File=/dev/nvidia[6-7] Count=2 CPUs=10-19