Hi all, I'm using SGE 6.2u5. I built a tightly integrated parallel environment for my application, using "qrsh -inherit". Everything works fine, but at the end of every job using the PE, there is a huge time between the moment when the PE script returns and the moment when the parent qsub returns (approx 2 minutes).
The only case were it returns fast is when I send a SIGINT to the parent qsub. In every other configuration, there is this delay. This happens whatever the result of the PE script is, or whether qrsh processes are cleanly shutdown before returning or not. My PE does not have any stop_proc_args, my queue no epilog. I can't find any relevant trace of what happens in the meantime in the logs. Is this normal behavior ? Is there some kind of polling mechanism ? Regards, Julien
_______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
