[slurm-dev] Large jobs never running

Bob Healey Thu, 17 Oct 2013 08:40:10 -0700

Hi.

I'm currently running slurm 2.6.3. Was previously running the 2.4series with no issues. Since switching to slurm 2.6.3, I've noticedjobs that want more than 2 nodes never get to run unless nothing elsewishes to use the queue, but small jobs will always jump the line andrun if enough nodes are free. My slurm.conf has not been changed since2.1.11 when it was first deployed. I'm using sched/backfill andpriority/basic. The max run time on the single queue for this clusteris 48 hours. I've had a job requesting 20 nodes submitted on 10/15 torun for $max_time with an estimated start time of 10/19, while 551single node jobs submitted requesting $max_time have been submitted andrun/are running in the mean time. The small jobs are taking ~14 hours,but the submitter is requesting the full 48.

How can I tweak my slurm.conf so if a large job is at the top of thequeue, it blocks everything of equal or greater run time until it canrun? That is the behavior my end users are used to and expect.


--
Bob Healey
Systems Administrator
Biocomputation and Bioinformatics Constellation
and Molecularium
[email protected]
(518) 276-4407

[slurm-dev] Large jobs never running

Reply via email to