I have a subordinate queue set up with notification time of 5 minutes, and preempted jobs are terminated (using SIGTERM) after that period. For jobs running in that queue, I've been able to confirm that there is a 5 minute delay between when the notification is sent and when the job is terminated. The idea is to give the job a chance to save state and shut itself down cleanly before being terminated.
The issue that I've been running into is that the job that triggers the preemption begins running when the notification signal is sent. We then end up with both jobs running simultaneously during the notification period. Is there any way to delay that second job so it will not start until the preempted job has either exited on its own or been killed? Any suggestions for how I might configure this differently would be appreciated. Thanks, Jeff _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
