Hi Group, I have a peculiar problem with my two node cluster that occurs infrequently.
I have an OCF resource agent AGENT1. By default, AGENT1 runs on my primary node. My cib.xml has "migration-threshold" set to "3" to allow upto three failures of the resource on my primary node. To test this, I deliberately stopped the AGENT1 resource on node A. 'crm_mon --failcount' and 'AGENT1 monitor' both detect the resource as stopped (exit code 7). But, 'crm_mon -V -1' incorrectly shows the resource as started and this status is not getting updated. How to check why 'crm_mon -V -1' is not getting updated? Because, it incorrectly detects the resource as "started", the resource is not restarted on primary node. Thanks, Mahesh _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
