On Wed, Oct 7, 2009 at 6:26 AM, MAHESH, SIDDACHETTY M (SIDDACHETTY M) <[email protected]> wrote: > Hi Group, > > I have a peculiar problem with my two node cluster that occurs infrequently. > > I have an OCF resource agent AGENT1. By default, AGENT1 runs on my primary > node. My cib.xml has "migration-threshold" set to "3" to allow upto three > failures of the resource on my primary node. > > To test this, I deliberately stopped the AGENT1 resource on node A. 'crm_mon > --failcount' and 'AGENT1 monitor' both detect the resource as stopped (exit > code 7). But, 'crm_mon -V -1' incorrectly shows the resource as started and > this status is not getting updated. > > How to check why 'crm_mon -V -1' is not getting updated? Because, it > incorrectly detects the resource as "started", the resource is not restarted > on primary node.
Can you attach the output from cibadmin -Ql please? > > Thanks, > Mahesh > > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems > _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
