I have a couple nodes with 4xK80 GPU's in them (nvidia0-7).

Is there a way to either request peer-to-peer GPU's, or force allocation to
2 GPU's at a time?  We'd prefer for the former (run when peer-to-peer is
available, unless you don't care) so we can fit more users onto the
machine.  However, ensuring the peer-to-peer codes get the proper
allocation is more important.


User 1 - needs a full K80 with peer-to-peer
User 2 - needs a single GPU
User 3 - needs a single GPU
User 4 - Needs 2 full K80

I.e
0,1 - User 1
2    - User 2
3    - User 3
4,5,6,7 - User 4

Or

0,1 - User 1
2,3  - User 2
4,5   - User 3
QUEUED - User 4

I tried this gres configuration, but it didn't do what I expected.

Name=gpu File=/dev/nvidia[0-1] Count=2 CPUs=0-9
Name=gpu File=/dev/nvidia[2-3] Count=2 CPUs=0-9
Name=gpu File=/dev/nvidia[4-5] Count=2 CPUs=10-19
Name=gpu File=/dev/nvidia[6-7] Count=2 CPUs=10-19

Reply via email to