Can someone please explain to me how I can get my hardware watchdog to
monitor processes and reboot if a process hangs or is killed in a non
graceful manner?  I have heartbeat running as it should.  I have a hardware
watchdog set up and I have confirmed that it is working properly.  However,
I would like to make it possible so if something happens to a running daemon
(like heartbeat as a whole), it will trigger watchdog to stop resetting the
WDT, therefor rebooting the machine.  I might be completely off when I
mention this, but I was under the impression that the apphbd plugin is
supposed to do this.  If so, how do I set this up?  I have added "watchdog
/dev/watchdog" into the ha.cf file, and performed a killall -9 heartbeat. 
But the machine does not reboot.

I am also aware that I can set up watchdog so if the load average goes above
a certain level, a reboot will occur.  But is there another, more efficient
way to reboot the system if a process is killed off unexpectedly or if the
process is hung?  I can only assume that the load average will naturally go
up if this were the case, so perhaps just setting the load average setting
may do just fine.  Or maybe I've had too much coffee and I've been thinking
too fast and deep about this, and have made it more complicated than it
already is...  %-|

Thanks for your help!
-- 
View this message in context: 
http://www.nabble.com/Watchdog-and-apphbd-question-tp22834997p22834997.html
Sent from the Linux-HA mailing list archive at Nabble.com.

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to