Brent:

 > ...
 > Where I work, we got this really weird problem whereby any servers in a
 > cluster pair may on occasion reboot. Im thinking its due to high IO. But
 > I cant prove it.
 > We have systat installed and via sar, nothing really sticks out as to
 > what the culprit may be.
 > ...

It can sometimes help to get your system messages redirected to another
machine ... hosts will sometimes be unable to write to their messages file,
but will manage to send a network packet containing information before they
die (add something like "*.* @remote-address" into syslog.conf (assuming a
simple syslogd)). You could also enable remote crash dumps.
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to