Brent: > ... > Where I work, we got this really weird problem whereby any servers in a > cluster pair may on occasion reboot. Im thinking its due to high IO. But > I cant prove it. > We have systat installed and via sar, nothing really sticks out as to > what the culprit may be. > ...
It can sometimes help to get your system messages redirected to another machine ... hosts will sometimes be unable to write to their messages file, but will manage to send a network packet containing information before they die (add something like "*.* @remote-address" into syslog.conf (assuming a simple syslogd)). You could also enable remote crash dumps. _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
