I can’t get qmaster to respond. Memory is no longer an issue but the queue is 
138,000+ jobs long and it’s not responding to any control commands. I need to 
manually delete the master job list.

Am I correct in assuming that if I delete all the subdirectories in the jobs 
folder in spool/qmaster, this will reset the master job list and give me back 
control?

Mfg,
Juan Jimenez
System Administrator, BIH HPC Cluster
MDC Berlin / IT-Dept.
Tel.: +49 30 9406 2800


 

On 27.06.17, 11:12, "William Hay" <[email protected]> wrote:

    On Tue, Jun 27, 2017 at 08:44:30AM +0000, [email protected] 
wrote:
    > So, if I reinstall using the Berkeley DB spooler, will this mitigate this 
kind of problem, or will the qmaster still want to commit hara-kiri by trying 
to load everything into memory from the DB?
    >
    It will still need everything in memory in order to schedule it.  If the 
problem
    is that you don't have sufficient memory then probably not.  If it is just 
regular
    performance issues then BDB may help. 
    
    William
    

_______________________________________________
SGE-discuss mailing list
[email protected]
https://arc.liv.ac.uk/mailman/listinfo/sge-discuss

Reply via email to