Hi Yiri,
The requeued job keeps the same job ID, name, priority, qos, etc. The
requeued job submit time is reset to the current time, so you will see
two records in the accounting logs for that job ID with different
submit times. The job also has an environment variable set for it,
SLURM_RESTART_COUNT.
Moe
Quoting Yuri D'Elia <[email protected]>:
Assuming one node fails (and reboots), how is the job requeued?
In more detail: is it requeued under a new id?
Does it inherit the old priority/qos/name and comment?
Apart from perusing the logs/accounting, how do I find out if a
specific job id (that has now completed) has been, in reality, been
requeued at least once?