You can use the REQUEUE partition functionality in slurm to accomplish
this. We do that here. Basically we have a high priority partition
that is what the hardware owners use and then a lower priority partition
that backfills it with the requeue option. If the high priority
partition has idle resources, the lower priority uses it. If the high
priority needs the resources it will requeue the jobs from the lower
priority partition.
-Paul Edmon-
On 7/11/2017 4:29 PM, David Perel wrote:
Hello --
Say on a cluster researcher X has his own reserved partition, XP,
where normally only X can run jobs.
Can SLURM be configured so that if some resource metric(s) (e.g.,
load average, memory usage) on XP nodes is below a given level
for a given time period (such nodes are labelled "XP-L"), then:
- if researcher Y is authorized to use XP, Y's jobs can automatically
use the XP-L nodes; and
- as soon as X submits jobs to XP for which XP-L are needed, Y's
jobs on the XP-L nodes are moved off of those nodes to wherever they
can run, or queued
Thanks.
___________
David Perel
IST Academic and Research Computing Systems (ARCS)
New Jersey Institute of Technology
[email protected]