Hi, Summary: Two node cluster running DRBD, IET with a floating IP and stonith enabled.
All this works well, I can kernel panic the machine, kill individual PIDs (for example IET) which then invoke failover. However, when I forkbomb the master, nothing happens. The box is dead, the services stop responding etc, but pacemaker does not recognise this and therefore failover does not occur. Very occasionally it will fence and invoke failover after several minutes or even longer, which is no good at all. To me, it seems extremely odd pacemaker itself does not automatically incorporate system health checks that can detect such a scenario. I've raised this a couple of times, but the suggestion is to run watchdog or create an RA to do resource checking. Watchdog certainly does its job and is easy to configure, but this seems flawed to me. Regards, James _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
