Hello all, When I tried to execute an distributed job on a cluster, the job started successfully.
However, after some time, the job was getting hanged by the following process. Can anyone please let me know what could be the issue? /opt/sge/utilbin/lx24-amd64/rsh -n -p 36425 <NodeName> exec '/opt/sge/utilbin/lx24-amd64/qrsh_starter' '/opt/spool/node/active_jobs/41270.1/1.node' FYI, cluster is having both password less ssh and rsh communications between the nodes. Thanks, Britto.
_______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
