Hi Yair, Yair Yarom <[email protected]> writes:
> Hi, > > > Using slurm 14.03.1-2, with mysql and slurmdbd storage, we've > encountered the error "We have more allocated time than is > possible". Following the suggestions on this thread: > > https://groups.google.com/forum/#!topic/slurm-devel/3f1SOGHXwSY > > we modified the database and marked the phantom jobs and steps' state as > FAILED with appropriate time_end. However, sreport still shows wrong > usage for the times it was active. It appears to be the cluster usage > tables. > > Is there any way to reset or force slurmdbd to recalculate them (or any > other fix for the sreport false usage data)? will deleting the data in > the usage tables work in this case (will it be rebuilt)? > > Is there any utility to check the integrity of the database? e.g. that > there are no "running" steps for non-running jobs, that there are no > dead jobs still running, or that the usage tables are accurate? > > Thanks, > Yair. As I mentioned in the thread you refer to, I have a Perl script which fixes such problems which I can let you have if you send me a mail Regards Loris -- Dr. Loris Bennett (Mr.) ZEDAT, Freie Universität Berlin Email [email protected]
