Can someone please explain to me how I can get my hardware watchdog to monitor processes and reboot if a process hangs or is killed in a non graceful manner? I have heartbeat running as it should. I have a hardware watchdog set up and I have confirmed that it is working properly. However, I would like to make it possible so if something happens to a running daemon (like heartbeat as a whole), it will trigger watchdog to stop resetting the WDT, therefor rebooting the machine. I might be completely off when I mention this, but I was under the impression that the apphbd plugin is supposed to do this. If so, how do I set this up? I have added "watchdog /dev/watchdog" into the ha.cf file, and performed a killall -9 heartbeat. But the machine does not reboot.
I am also aware that I can set up watchdog so if the load average goes above a certain level, a reboot will occur. But is there another, more efficient way to reboot the system if a process is killed off unexpectedly or if the process is hung? I can only assume that the load average will naturally go up if this were the case, so perhaps just setting the load average setting may do just fine. Or maybe I've had too much coffee and I've been thinking too fast and deep about this, and have made it more complicated than it already is... %-| Thanks for your help! -- View this message in context: http://www.nabble.com/Watchdog-and-apphbd-question-tp22834997p22834997.html Sent from the Linux-HA mailing list archive at Nabble.com. _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
