Hi list, I have a problem on a SGE setup where the home directory are shared trough glusterfs and some job failed to start because of a latency on the filesystem propagation between the login node and the compute node. What happen is that a script create a workdir with some support files, "cd" inside and then qsub a script, sometime the script start to run on the compute node too quickly and the "workdir" is not yet visible on that node. I know it is a glusterfs problem that must be resolved elsewhere but in the meantime, where can I put a "sleep"? Does exist a prerun hook that I can use for that? For other use (copying files around and cleanup) does exist a similar postrun hook?
thanks stef _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
