Hi all.

I google this issue but did not see much help on the subject.

I have several queues with hard wall clock limits like this one:

# qconf -sq queue  | grep h_rt
h_rt                  96:00:00

I am running Son of Grid engine 8.1.2 and many jobs run past the hard wall 
clock limit and continue to run.

Look at GE qmaster logs, I see dozens and dozens of these entries:

    10/30/2012 11:23:10|schedu|hpc|W|job 13179.1 should have finished since 
42318s


These entries correspond to the running jobs that should have ended 96 hours 
ago, but they keep on running.

Why is GE not killing these jobs correctly when they run past the 96 hour limit 
but yet complains they should have ended?






_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to