Hello all,

When I tried to execute an distributed job on a cluster, the job started 
successfully.

However, after some time, the job was getting hanged by the following process. 
Can anyone please let me know what could be the issue?

/opt/sge/utilbin/lx24-amd64/rsh -n -p 36425 <NodeName> exec 
'/opt/sge/utilbin/lx24-amd64/qrsh_starter' 
'/opt/spool/node/active_jobs/41270.1/1.node'

FYI, cluster is having both password less ssh and rsh communications between 
the nodes.

Thanks,
Britto.

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to