Hi list, I have a problem on a SGE setup where the home directory are
shared trough glusterfs and some job failed to start because of a
latency on the filesystem propagation between the login node and the
compute node.
What happen is that a script create a workdir with some support files,
"cd" inside and then qsub a script, sometime the script start to run
on the compute node too quickly and the "workdir" is not  yet visible
on that node. I know it is a glusterfs problem that must be resolved
elsewhere but in the meantime, where can I put a "sleep"?
Does exist a prerun hook that I can use for that? For other use
(copying files around and cleanup) does exist a similar postrun hook?

thanks
stef
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to