<[email protected]> writes: > We've got a job that was suspended via: > > qmod -sj $jobid > > that's continuing to run. The job consists of a BASH script, which in > turn submits other jobs in a loop, sleeping for 30 seconds after each loop. > > When I examine the job status on the node where it is executing via: > ps -e f | grep $JOBID > > I see that the process is sleeping (state "S"), which is not unexpected, > given the 'sleep 30' in the loop, but not suspended (state "T"):
I've just tried that successfully with sge-8.1.8, i.e. the processes were stopped. How is the job started (e.g. binary, unix mode, or not)? What do you see in the trace file in the job's spool directory? If something has stopped working, feel free to put an issue in the tracker so that it's less likely to be forgotten, but I can't think of any changes that are likely to be relevant. _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
