Hi everyone. I'm still testing slurm 2.4 (taken from git today), but this is a 
pattern I've noticed at least since 4-5 months.

I'm scheduling 900 jobs on a partition, using multifactor priorities and 
accounting.

Sometimes (after a few days usually), I noticed that freshly submitted jobs all 
have priority "1":

  JOBID       NAME ST       TIME PRIOR
 459128 MERLQ_2129 PD       0:00     1
 459127 MERLQ_2129 PD       0:00     1
 459126 MERLQ_2129 PD       0:00     1
 459125 MERLQ_2129 PD       0:00     1

Let's pick the first one:

# sprio  -j 459128
Unable to find jobs matching user/id(s) specified

But how is that even possible?

# sacct -j 459128
       JobID    JobName  Partition    Account  AllocCPUS      State ExitCode 
------------ ---------- ---------- ---------- ---------- ---------- -------- 
459128       MERLQ_212+      batch    default          0    PENDING      0:0 

Any clue about what might be causing this?
Older jobs do have a priority and can be queried using sprio.

Reply via email to