Hi,
Am 04.05.2012 um 09:52 schrieb Winkler, Ursula ([email protected]):
> I have a problem with SGE: it looks like (though I’m really not sure) that
> running jobs are killed when other jobs are submitted which otherwise should
> wait in the queue because no free cores are available. It happens repeatedly
> so this raises suspicion. Unfortunately the error of the job log files only
> says that the jobs terminated badly. “qacct –j <job-no>” tells that the exit
> status of the jobs is “7”. I could not find out what this error code means.
> Does anybody know that?
Well, can you tell us more about your setup? In principle it's e.g. possible to
define a suspend method to kill the job on its own.
Exit state 7 is SIGBUS (`kill -l` lists all), which might even lead to a
hardware error.
-- Reuti
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users