Dominik,

Dominik Klein wrote:

Alex Balashov wrote:
Greetings,

I am using a custom OCF RA and Heartbeat v2 + CRM/CIB for monitoring a custom service at the application level in an active-passive binary cluster.

When the service is detected as failing on the first node, the resource manager tries to restart the service. I've set effective service and failure stickiness to almost zero so if it fails to start, it will fail over all the resources to the secondary node.

What I want to know is whether it's possible to fail the service over immediately the moment a single monitor procedure fails, no questions asked, without any attempts to restart. If so, what cluster property sets should I set and how?

Set default-resource-failure-stickiness to -infinity.

cibadmin -U -o crm_config -X '<cluster_property_set id="cib-bootstrap-options"><nvpair id="someid" name="default-resource-failure-stickiness" value="-infinity"/></cluster_property_set>'

should do.

Whichever monitor operation fails will render the resource unrunnable on the node it failed on and the cluster will choose another node and start the resource there.

Thank you. However, I tried this and it did not work; the resource thrashes between monitor & start on the same node in perpetuity.

I am guessing this has to do with the score assigned the respective nodes. I am not assigning a default score, just setting a preferred node. My cib.xml is more or less converted straight from an haresources file using the haresources2cib.py script:

     <constraints>
       <rsc_location id="rsc_location_group_1" rsc="group_1">
         <rule id="prefered_location_group_1" score="100">
<expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="ipc-1"/>
         </rule>
       </rsc_location>
     </constraints>

I am not sure what default score that assigns the secondary node, nor how to check it. I read the failover calculation formula in the FAQ but am not entirely sure how to influence it to do what I want at this point. You have to excuse me, I'm very new to this. :/


--
Alex Balashov
Evariste Systems
Web    : http://www.evaristesys.com/
Tel    : (+1) (678) 954-0670
Direct : (+1) (678) 954-0671
Mobile : (+1) (706) 338-8599
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to