I can’t get qmaster to respond. Memory is no longer an issue but the queue is 138,000+ jobs long and it’s not responding to any control commands. I need to manually delete the master job list.
Am I correct in assuming that if I delete all the subdirectories in the jobs folder in spool/qmaster, this will reset the master job list and give me back control? Mfg, Juan Jimenez System Administrator, BIH HPC Cluster MDC Berlin / IT-Dept. Tel.: +49 30 9406 2800 On 27.06.17, 11:12, "William Hay" <[email protected]> wrote: On Tue, Jun 27, 2017 at 08:44:30AM +0000, [email protected] wrote: > So, if I reinstall using the Berkeley DB spooler, will this mitigate this kind of problem, or will the qmaster still want to commit hara-kiri by trying to load everything into memory from the DB? > It will still need everything in memory in order to schedule it. If the problem is that you don't have sufficient memory then probably not. If it is just regular performance issues then BDB may help. William _______________________________________________ SGE-discuss mailing list [email protected] https://arc.liv.ac.uk/mailman/listinfo/sge-discuss
