Hello,
I am new to cluster HA and I am trying to failover a service between two
nodes. I am using heartbeat 2.0.8 on redhat linux (64 bits).
The monitor operation on the resource is not doing failover after
encountering error during monitor operation. It keeps running the script
on the same node and do not fail over to another node.
My requirement is the service (rsc_tt) should be fail over to another
node after the first failure of monitoring operation. The failover
behavior needs to be same for both the nodes.
Please let me know the configuration I need to modify in order to
achieve this.
My ha.cf is as below
#logfacility local7
logfile /var/log/ha-log
debugfile /var/log/ha-debug
debug 1
use_logd off
udpport 694
keepalive 1 # 1 second
deadtime 4
initdead 10
bcast eth0
#serial /dev/ttyS0 #if you use serial
#baud 19200 #if you use serial
node node1 node2
crm yes
auto_failback yes
#apiauth mgmtd uid=root
respawn root /usr/lib64/heartbeat/mgmtd -t
#respawn root /usr/lib/heartbeat/pingd -m 100 -d 5s
My cib.xml is as below.
<cib admin_epoch="0" have_quorum="true" ignore_dtd="false" num_peers="1"
cib_feature_revision="1.3" generated="true" epoch="7" num_updates="107"
cib-last-written="Fri Jul 6 19:25:21 2007" ccm_transition="3"
dc_uuid="5426e37c-9469-40a3-813c-eebeb0b7c6a0">
<configuration>
<crm_config>
<cluster_property_set id="cib-bootstrap-options">
<attributes>
<nvpair id="symmetric_cluster" name="symmetric_cluster"
value="true"/>
<nvpair id="no_quorum_policy" name="no_quorum_policy"
value="stop"/>
</attributes>
</cluster_property_set>
</crm_config>
<nodes>
<node id="1c3fdfbd-ee55-47e3-a8c2-52f34a5c5553" uname="node2"
type="normal"/>
<node id="5426e37c-9469-40a3-813c-eebeb0b7c6a0" uname="node1"
type="normal"/>
</nodes>
<resources>
<group id="group_org" collocated="true" ordered="true">
<primitive class="ocf" id="IPaddr_1" provider="heartbeat"
type="IPaddr">
<operations>
<op id="1" interval="1s" name="monitor" timeout="2s"/>
</operations>
<instance_attributes id="i1">
<attributes>
<nvpair id="id1" name="ip" value="ipnumber"/>
<nvpair id="mask1" name="netmask" value="24"/>
<nvpair id="nic1" name="nic" value="eth0"/>
</attributes>
</instance_attributes>
</primitive>
<primitive id="rsc_tt" class="heartbeat" type="ttmgr.sh"
provider="heartbeat">
<instance_attributes id="rsc_tt_instance_attrs">
<attributes/>
</instance_attributes>
<operations>
<op id="2b832860-331c-492e-84c9-9138eb95de70" name="start"
description="begin opr" timeout="5" start_delay="0" disabled="false"
role="Started" prereq="nothing" on_fail="stop"/>
<op id="98cc0b94-6598-4860-a8df-3789194226fc"
name="monitor" description="monitor resour" interval="2s" timeout="2s"
start_delay="1" disabled="false" role="Started" prereq="nothing"
on_fail="stop"/>
</operations>
</primitive>
</group>
</resources>
<constraints>
<rsc_location id="rsc_location_group_org" rsc="group_org">
<rule id="prefered_location_group_org" score="100">
<expression id="e1" attribute="#uname" operation="eq"
value="node1"/>
</rule>
</rsc_location>
</constraints>
</configuration>
</cib>
Thanks
Chetan
http://www.patni.com
World-Wide Partnerships. World-Class Solutions.
_____________________________________________________________________
This e-mail message may contain proprietary, confidential or legally
privileged information for the sole use of the person or entity to
whom this message was originally addressed. Any review, e-transmission
dissemination or other use of or taking of any action in reliance upon
this information by persons or entities other than the intended
recipient is prohibited. If you have received this e-mail in error
kindly delete this e-mail from your records. If it appears that this
mail has been forwarded to you without proper authority, please notify
us immediately at [EMAIL PROTECTED] and delete this mail.
_____________________________________________________________________
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems