Hi Eric,

If you use slurmdbd, that usually means you have runaway jobs in the
Slurm DB, ie. jobs that are not running anymore (don's show up in
squeue), but don't have an end date and/or are still considered
running in sacct.
Phil Eckert posted a perl script to detect such jobs some time ago:
https://groups.google.com/d/msg/slurm-devel/3f1SOGHXwSY/95y1cPwF7M0J

Cheers,
-- 
Kilian

Reply via email to