Hi,

Am 03.09.2012 um 17:11 schrieb Julien Nicoulaud:

> I'm using SGE 6.2u5. I built a tightly integrated parallel environment for my 
> application, using "qrsh -inherit". Everything works fine, but at the end of 
> every job using the PE, there is a huge time between the moment when the PE 
> script returns and the moment when the parent qsub returns (approx 2 minutes).
> 
> The only case were it returns fast is when I send a SIGINT to the parent 
> qsub. In every other configuration, there is this delay. This happens 
> whatever the result of the PE script is, or whether qrsh processes are 
> cleanly shutdown before returning or not.
> 
> My PE does not have any stop_proc_args, my queue no epilog.
> 
> I can't find any relevant trace of what happens in the meantime in the logs.
> 
> Is this normal behavior?

Yes.


> Is there some kind of polling mechanism?

It's a safety precaution to be sure all tasks ended.

But it's fixed in some follow up versions of SGE IIRC, but I can't find any 
links about it.

-- Reuti


> Regards,
> Julien
> _______________________________________________
> users mailing list
> [email protected]
> https://gridengine.org/mailman/listinfo/users


_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to