Dominik,
Dominik Klein wrote:
Alex Balashov wrote:
Greetings,
I am using a custom OCF RA and Heartbeat v2 + CRM/CIB for monitoring a
custom service at the application level in an active-passive binary
cluster.
When the service is detected as failing on the first node, the
resource manager tries to restart the service. I've set effective
service and failure stickiness to almost zero so if it fails to start,
it will fail over all the resources to the secondary node.
What I want to know is whether it's possible to fail the service over
immediately the moment a single monitor procedure fails, no questions
asked, without any attempts to restart. If so, what cluster property
sets should I set and how?
Set default-resource-failure-stickiness to -infinity.
cibadmin -U -o crm_config -X '<cluster_property_set
id="cib-bootstrap-options"><nvpair id="someid"
name="default-resource-failure-stickiness"
value="-infinity"/></cluster_property_set>'
should do.
Whichever monitor operation fails will render the resource unrunnable on
the node it failed on and the cluster will choose another node and start
the resource there.
Thank you. However, I tried this and it did not work; the resource
thrashes between monitor & start on the same node in perpetuity.
I am guessing this has to do with the score assigned the respective
nodes. I am not assigning a default score, just setting a preferred
node. My cib.xml is more or less converted straight from an haresources
file using the haresources2cib.py script:
<constraints>
<rsc_location id="rsc_location_group_1" rsc="group_1">
<rule id="prefered_location_group_1" score="100">
<expression attribute="#uname"
id="prefered_location_group_1_expr" operation="eq" value="ipc-1"/>
</rule>
</rsc_location>
</constraints>
I am not sure what default score that assigns the secondary node, nor
how to check it. I read the failover calculation formula in the FAQ but
am not entirely sure how to influence it to do what I want at this
point. You have to excuse me, I'm very new to this. :/
--
Alex Balashov
Evariste Systems
Web : http://www.evaristesys.com/
Tel : (+1) (678) 954-0670
Direct : (+1) (678) 954-0671
Mobile : (+1) (706) 338-8599
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems