You can use the REQUEUE partition functionality in slurm to accomplish this. We do that here. Basically we have a high priority partition that is what the hardware owners use and then a lower priority partition that backfills it with the requeue option. If the high priority partition has idle resources, the lower priority uses it. If the high priority needs the resources it will requeue the jobs from the lower priority partition.

-Paul Edmon-


On 7/11/2017 4:29 PM, David Perel wrote:
Hello --

  Say on a cluster researcher X has his own reserved partition, XP,
  where normally only X can run jobs.

  Can SLURM be configured so that if some resource metric(s) (e.g.,
  load average, memory usage) on XP nodes is below a given level
  for a given time period (such nodes are labelled "XP-L"), then:

   - if researcher Y is authorized to use XP, Y's jobs can automatically
     use the XP-L nodes; and

   - as soon as X submits jobs to XP for which XP-L are needed, Y's
     jobs on the XP-L nodes are moved off of those nodes to wherever they
     can run, or queued

  Thanks.

___________
David Perel
IST Academic and Research Computing Systems (ARCS)
New Jersey Institute of Technology
[email protected]

Reply via email to