Hi

I have torque pbs_server running on the headnode, which is also the submit host. There are 32 other compute nodes, mentioned in /var/spool/torque/server_priv/nodes file. There is a single queue at present. Sometimes, mpi jobs requesting for 28/30 nodes, land up running on the head node, though the head node is not a compute node at all. netstat -anp shows several sockets being openend for the job, and eventually the head node hangs up.
Appreciate any help/suggestion on this.

Sutapa
_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers

Reply via email to