Hi Ulf, Ulf Markwardt <[email protected]> writes:
> Dear all, > > I have a problem with a large reservation in a few hours, ~1700 > long-running jobs waiting to start afterwards and my short job (srun -t > 1 hostname) with priority of 1 that would fill any gap... > > > "sdiag" always shows a value of about 100 as "Last depth cycle" for > backfilling. Does that mean that it only looks at the first 100 jobs? > I thought, bf_continue should take care of this, so that the next > backfilling test starts where the last has finished. > > At the moment we have 15.08.6 running with: > SchedulerParameters=bf_interval=30,bf_max_job_test=2000,bf_window=7200,default_queue_depth=5000,bf_continue,sched_interval=120,defer > (Some values might be too high for production, but I was desperate to ge > my job running...) Is your bf_window at least as large as the timelimit on the partition in question? If not, see the info about bf_window on the slurm.conf manpage. > Can anybody give me a hint on how to change this so that my low priority > job gets scheduled? > > Thanks a lot, > Ulf > > PS. As soon as I give this job a Nice=-200 it starts, but that is not > the way I want it :-) Cheers, Loris -- Dr. Loris Bennett (Mr.) ZEDAT, Freie Universität Berlin Email [email protected]
