Like I hinted at, I am starting to figure out what I can use to intelligently tell me what's going on inside the switch. It's the classic problem of, "I'm the only unix sysadmin here and I really don't have time!". I appreciate the tips about the switch monitoring, I, think I'll start there.
(I'm already monitoring the NODES closely. Have not seen any real issue or resource shortage or errors or... ). The followup question I have is, my qmaster host is the NFS server for the SGE management. The DATA the jobs require are all on our nfs appliance (Isilon, in our case). Would you recommend I re-do my cluster to store ALL sge goodies on the nfs appliance? I didn't think grid engine beat up the NFS qmaster too much but then again....... _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
