On Fri, Oct 31, 2008 at 00:22, Andreas Mock <[EMAIL PROTECTED]> wrote: >> -----Ursprüngliche Nachricht----- >> Von: "David Pinkerton H" <[EMAIL PROTECTED]> >> Gesendet: 30.10.08 22:55:22 >> An: "'[email protected]'" <[email protected]> >> Betreff: [Linux-HA] Heartbeat Cluster Monitoring > > Hi David, > > >> Ideally if a resource is stopped I would like the monitoring system to >> confirm it restarts on a different node, if not page out. I do not want to >> be paged if the cluster successfully fails over (ie. working as designed) > > the most simple IMHO: Make a reference 'crm_mon -r -1' with the state of a > working cluster. > Then do a crm_mon regularly and compare the output to your reference output. > You can extract only these lines you're interested in (e.g. don't compare the > timestamps). > If something changed send alert.
We'll be looking at making this less clunky in future releases (ie. having a daemon that watches cib changes looking for failures) Nice script for making do with what we have though :-) _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
