[slurm-dev] Re: QoS Feature Requests

Paul Edmon Wed, 21 May 2014 18:55:14 -0700

Thanks. For the info. The spillover stuff would be handy, but I candefinitely see the difficulties with the coding of it. Though a similarmechanism exists for the partitions where you can list multiplepartitions and it will execute on the one that will go first. Couldthis be imported into QoS?


-Paul Edmon-

On 5/21/2014 6:52 PM, [email protected] wrote:

Quoting Paul Edmon <[email protected]>:
We have just started using QoS here and I was curious about a fewfeatures which would make our lives easier.
1. Spillover/overflow: Essentially if you use up one QoS you wouldspill over into your next lower priority QoS. For instance if youused up your groups QoS but still had jobs and there were idle cyclesyour jobs that were pending for your high priority QoS would go tothe low priority normal QoS.
There isn't a great way to do this today. Each job is associated witha single QOS.
One possibility would be to submit one job to each QOS and thenwhichever job started first would kill the others. A job submit plugincould probably handle the multiple submissions (e.g. if the --qosoption has multiple comma-separated names, then submit one job foreach QOS). Offhand I'm not sure what would be a good way to identifyand purge the extra jobs. Some variation of the "--depend=singleton"logic would probably do the trick.
2. Gres: Adding number of GPU's or other Gres quantities to the QoSthat can be used.
This has been discussed, but not implemented yet.
3. Requeue/No Requeue: There are some partitions we want to allow QoSto requeue, others we don't. For instance we have a general queuewhich we don't want requeue on, but we also have a backfill queuethat we do permit it on. If the QoS could kill the backfill jobsfirst to find space, and just wait on the general queue that would begreat. We haven't experimented with QoS Requeue but we may in thefuture so this is just looking forward.
You can configure differtent preemption mechanisms and preempt byeither QoS or partition. Take a look at:
http://slurm.schedmd.com/preempt.html
For example, you might enable QoS "high" to requeue jobs in QoS "low",but wait for jobs in QoS "medium".There is no mechanism to configure QoS "high" to preempt jobs bypartition.
We were also wondering if jobs asking for Gres could get higherpriority on those nodes, such that they can grab the GPU's and leavethe CPU's for everyone else. After all the Gres resources areusually scarcer than the CPU resouces and we would hate for a Gresresource to idle just because all the CPU jobs took up the slots.
-Paul Edmon-
This has also been discussed, but not implemented yet. One optionmight be to use a job_submit plugin to adjust a job's "nice" optionbased upon GRES.There is a partition parameter MaxCPUsPerNode that might be useful tolimit the number of CPUs that are consumed on each node by eachpartition. You would probably require a separate partition/queue forGPU jobs for that to work well, so that would probably not work for you.
Let me know if you need help pursuing these options.

Moe Jette

[slurm-dev] Re: QoS Feature Requests

Reply via email to