Just a question on expected behavior of the backfill scheduler. This is an SMP machine if that matters. Scheduler is backfill with no preemption.

I have a number of jobs queued. There are three which matter, ordered by priority. In the current state I have 60 free cores.

job 201 needs 200 cores and will start in 1 hour requiring 24 hours of runtime job 202 needs 250 cores and will start in 5 hours requiring 24 hours of runtime
...
job 300 needs 30 cores and will start in 300 hours requiring 2 hours of runtime

The job completing in 1 hour will free 252 cores.

Clearly, starting job 300 will not impact job 201's start time in any way. Yet it will not start since the time overlaps the expected 1 hour start time of job 201. Is this the expected behavior? I haven't yet checked the source code to verify that this just looks at the trivial impact on the next job but I'd expect the scheduler to be able to look a little deeper than this.

Bill

Reply via email to