We am using Nagios for network monitoring and for Pacemaker, we are using 
check_crm to provide status for nagios. This works very well if the failure is 
still seen by Pacemaker at the time nagios polls. If the failure condidtion 
causes a switchover, but the recovery is before the nest Nagios poll, Nagios 
does not report any issue, because there isn't one at the time of the poll, 
even if the transition of the resource changed to another node.



Is there a way to know that the transistion occured without scraping the 
messages log in a set interval and looking for only new events that state a 
transition or "Members Joined"/"Members Left". It appears that failcount only 
increments on a failure, but not on a transition imposed by someone manually 
puttintg  a node into standby.



I am just trying to figure out a way to trigger an alarm for Nagios any time 
resources get moved between nodes, so the cause can be investigated.



Any ideas?



Thanks,

Keith


_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to