The objective is to achieve sub minute monitoring of services like httpd and exim/dovecot so that I can run a script to notify/SMS myself when one of the machines fails to respond. Right now I'm just running a cron script every few minutes to ping the servers are but the problem is that I discovered that the server could respond to pings while services are dead to the world.
I don't need recovery/HA/take over simply because these are not really clusters and I have to fix these usually by a reboot. Since the only times when the services fail so far is when the machine is basically unresponsive to remote connections due to DDoS or some kind of loop caused by user script/email problems. So I was wondering if heartbeat/pacemaker would be suitable for monitoring services across WAN. I know heartbeat seems be deprecated in favour of Corosync. However, Corosync doesn't appear to be suitable. This is because based on the sample conf file and the documentation, it seems that Corosync/Totem only works in a local network context because it needs to broadcast to find peers. There was some roadmap I found dated last year about adding WAN capability so I assume it's still not possible. But please correct me if I'm mistaken. _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
