Re: [Linux-HA] EXTERNAL: HA servers rebooting

2011-05-06 Thread Dimitri Maziuk
On 5/6/2011 7:34 AM, Lacoco, Joshua wrote: > It's unlikely that heartbeat itself is causing the rebooting unless you enabled/configured stonith. Drbd can be configured to halt the machine, though. I've seen linux miss packets and time out on sockets under high load -- are you monitoring load ave

Re: [Linux-HA] HA servers rebooting

2011-05-06 Thread Max
Brent: > ... > Where I work, we got this really weird problem whereby any servers in a > cluster pair may on occasion reboot. Im thinking its due to high IO. But > I cant prove it. > We have systat installed and via sar, nothing really sticks out as to > what the culprit may be. > ... It

Re: [Linux-HA] EXTERNAL: HA servers rebooting

2011-05-06 Thread Lacoco, Joshua
It's unlikely that heartbeat itself is causing the rebooting unless you enabled/configured stonith. Do you have debugging enabled (in ha.cf config file set debugging to true)? Also enable cores in the ha.cf file so you can see if there's more info on what's going on. -Original Message--

[Linux-HA] HA servers rebooting

2011-05-06 Thread Brent Clark
Hiya Im wondering if someone could share some thought on a problem that my colleague and I are experiencing. Where I work, we got this really weird problem whereby any servers in a cluster pair may on occasion reboot. Im thinking its due to high IO. But I cant prove it. We have systat installe