On 28 February 2012 11:02, Stefano Bridi <[email protected]> wrote:
> Hi list, I have a problem on a SGE setup where the home directory are
> shared trough glusterfs and some job failed to start because of a
> latency on the filesystem propagation between the login node and the
> compute node.
> What happen is that a script create a workdir with some support files,
> "cd" inside and then qsub a script, sometime the script start to run
> on the compute node too quickly and the "workdir" is not  yet visible
> on that node. I know it is a glusterfs problem that must be resolved
> elsewhere but in the meantime, where can I put a "sleep"?
> Does exist a prerun hook that I can use for that? For other use
> (copying files around and cleanup) does exist a similar postrun hook?
The prolog and epilog might be usable provided the output files are
sent somewhere else.  Not sure if they run in the
job's working directory.  Otherwise you could set up a client side jsv
that sleeps for a bit.  This would slow down
every qsub unless you can find a way to make it only sleep if the
current directory is newly created.

William

>
> thanks
> stef
> _______________________________________________
> users mailing list
> [email protected]
> https://gridengine.org/mailman/listinfo/users
>
>

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to