Hi
I have torque pbs_server running on the headnode, which is also the
submit host. There are 32 other compute nodes, mentioned in
/var/spool/torque/server_priv/nodes file. There is a single queue at
present. Sometimes, mpi jobs requesting for 28/30 nodes, land up
running on the head node, though the head node is not a compute node at
all. netstat -anp shows several sockets being openend for the job, and
eventually the head node hangs up.
Appreciate any help/suggestion on this.
Sutapa
_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers