Hi Eric, If you use slurmdbd, that usually means you have runaway jobs in the Slurm DB, ie. jobs that are not running anymore (don's show up in squeue), but don't have an end date and/or are still considered running in sacct. Phil Eckert posted a perl script to detect such jobs some time ago: https://groups.google.com/d/msg/slurm-devel/3f1SOGHXwSY/95y1cPwF7M0J
Cheers, -- Kilian
