Re: [Linux-HA] Failover on monitor failure.

Dominik Klein Thu, 13 Nov 2008 00:35:04 -0800

Alex Balashov wrote:

Greetings,
I am using a custom OCF RA and Heartbeat v2 + CRM/CIB for monitoring acustom service at the application level in an active-passive binarycluster.
When the service is detected as failing on the first node, the resourcemanager tries to restart the service. I've set effective service andfailure stickiness to almost zero so if it fails to start, it will failover all the resources to the secondary node.
What I want to know is whether it's possible to fail the service overimmediately the moment a single monitor procedure fails, no questionsasked, without any attempts to restart. If so, what cluster propertysets should I set and how?


Set default-resource-failure-stickiness to -infinity.

cibadmin -U -o crm_config -X '<cluster_property_setid="cib-bootstrap-options"><nvpair id="someid"name="default-resource-failure-stickiness"value="-infinity"/></cluster_property_set>'


should do.

Whichever monitor operation fails will render the resource unrunnable onthe node it failed on and the cluster will choose another node and startthe resource there.

In order to ever be able to run that resource on this node again, youhave to reset the particular failcount.

If you used pacemaker 1.0 you would not have to deal withfailure-stickiness anymore, but could use the very nice new"migration-threshold" feature. Set this to 1 and after 1 failure, theresource will failover, regardless of its score.


Regards
Dominik
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Failover on monitor failure.

Reply via email to