I have an 11-node OSCAR cluster (Dell PowerEdge 2650, RedHat 7.3, OSCAR 2.1) up and running for six months now. Recently I have seen repeated nfs messages in the message file:
kernel: nfs: server nfs_oscar not responding, still trying kernel: nfs: server nfs_oscar OK
The load on computing nodes were about 2.0 to 2.5. Among the nodes we have gigabit Ethernet: eth0 NIC Link is UP, 1000 Mbps full duplex.
The cluster has also experiencing node hung/freezing at random times. When it hung, there's no error messages in the log files. I could ping it, but had to power-cycle the node. I don't know whether the nfs issue causing the system hung.
How do I improve the performance of nfs and stabilize the cluster?
Thank you in advance.
Regards, Julia
------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users
