Hi Group,

  I have a peculiar problem with my two node cluster that occurs infrequently.

  I have an OCF resource agent AGENT1. By default, AGENT1 runs on my primary 
node. My cib.xml has "migration-threshold" set to "3" to allow upto three 
failures of the resource on my primary node. 

  To test this, I deliberately stopped the AGENT1 resource on node A. 'crm_mon 
--failcount' and 'AGENT1 monitor' both detect the resource as stopped (exit 
code 7). But, 'crm_mon -V -1' incorrectly shows the resource as started and 
this status is not getting updated.

  How to check why 'crm_mon -V -1' is not getting updated? Because, it 
incorrectly detects the resource as "started", the resource is not restarted on 
primary node.

  Thanks,
  Mahesh


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to