Hi Andrew!!
Thanks a lot for the tip. I thought we did not have any update to
heartbeat from Novell, but looking at Yast online update (
https://you.novell.com/update/x86_64/update/SUSE-SLES/10/rpm/src/) I could
find the heartbeat 2.0.7 available for download.
After updating it and setting the following configuration:
ha.cf:
ping 10.226.0.100
respawn root /usr/lib64/heartbeat/pingd -m 200 -d 5s
apiauth ping gid=root uid=root
cib.xml:
<constraints>
<rsc_location id="place_db2pr1" rsc="group_db2pr1">
<rule id="prefered_place_db2pr1" score="100">
<expression attribute="#uname"
id="745c2e82-e6cb-4c56-9611-797a2533a47d" operation="eq"
value="s0580crmdb2pr1"/>
</rule>
<rule id="prefered_place_db2pr1_connected"
score_attribute="pingd">
<expression id="5042183e-b115-474e-9596-51d1581c0032"
attribute="pingd" operation="defined"/>
</rule>
</rsc_location>
</constraints>
The problem was solved! =0)
Thanks a lot and best regards!
Fabio Martins
[EMAIL PROTECTED] wrote on 24/05/2007 03:53:41:
> On 5/23/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
> > Hi Everyone!
> >
> > I've send an e-mail before talking about a problem with DB2 resource.
Now
> > I'm sending this e-mail because I have problems with pingd.
> >
> > I'm configuring a cluster over SLES 10 on xSeries Servers (x86_64)
running
> > heartbeat 2.0.5 to manage DRBD, IP, Filesystems and DB2 resources.
>
> please apply the sles10 updates to get the latest heartbeat version.
>
> >
> > The node s0580crmdb2pr1 is the active node and the node s0580crmdb2pr2
is
> > the passive node (where no resources are running).
> >
> > The /etc/ha.d/ha.cf is configured as follows:
> >
> >
> > debugfile /var/log/ha-debug
> > logfile /var/log/ha-log
> > logfacility local0
> > keepalive 2
> > deadtime 30
> > warntime 10
> > initdead 120
> > udpport 694
> > bcast eth1
> > auto_failback on
> > watchdog /dev/watchdog
> > ping 10.226.0.100
> > respawn root /usr/lib64/heartbeat/pingd -m 200 -d 5s
> > apiauth ping gid=root uid=root
> > crm yes
> > node s0580crmdb2pr1 s0580crmdb2pr2
> >
> > Here the IP address 10.226.0.100 is the gateway for both servers. My
> > cib.xml have a place rule configured for pingd as follows:
> >
> > <constraints>
> > <rsc_location id="place_db2pr1" rsc="group_db2pr1">
> > <rule id="prefered_place_db2pr1" score="100">
> > <expression attribute="#uname"
> > id="745c2e82-e6cb-4c56-9611-797a2533a47d" operation="eq"
> > value="s0580crmdb2pr1"/>
> > </rule>
> > <rule id="prefered_place_db2pr1_connected"
> > score_attribute="pingd">
> > <expression id="5042183e-b115-474e-9596-51d1581c0032"
> > attribute="pingd" operation="defined"/>
> > </rule>
> > </rsc_location>
> > </constraints>
> > </configuration>
> > </cib>
> >
> > But everytime I start my heartbeat, I'm getting the following error
> > messages:
> >
> >
> > attrd[6143]: 2007/05/23_16:13:05 info: attrd_timer_callback:attrd.c
> > Sending flush op to all hosts for: pingd
> > attrd[6143]: 2007/05/23_16:13:05 info: attrd_ha_callback:attrd.c Sent
> > update 6: pingd=0
> > tengine[9698]: 2007/05/23_16:14:14 info: te_crm_command:actions.c
Skipping
> > wait for 21
> > heartbeat[6114]: 2007/05/23_16:14:31 info: killing
> > /usr/lib64/heartbeat/pingd -m 200 -d 5s process group 6138 with signal
15
> > pingd[6138]: 2007/05/23_16:14:31 info: send_update:pingd.c 0 active
ping
> > nodes
> > pingd[6138]: 2007/05/23_16:14:31 ERROR: crm_send_ipc_message:ipc.c IPC
> > Channel to 6143 is not connected
> > pingd[6138]: 2007/05/23_16:14:31 WARN: #========= IPC[outbound]
message
> > start ==========#
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG: Dumping message with 6
fields
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[0] : [t=attrd]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[1] : [src=pingd]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[2] : [task=update]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[3] : [attr_name=pingd]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[4] : [attr_value=0]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[5] : [attr_dampening=5s]
> > pingd[6138]: 2007/05/23_16:14:31 ERROR: send_update:pingd.c Could not
send
> > update
> > heartbeat[30416]: 2007/05/23_16:16:45 info: glib: ping heartbeat
started.
> > heartbeat[30416]: 2007/05/23_16:16:46 info: Status update for node
> > 10.226.0.100: status ping
> > heartbeat[30416]: 2007/05/23_16:18:46 info: Starting child client
> > "/usr/lib64/heartbeat/pingd -m 200 -d 5s" (0,0)
> > heartbeat[30580]: 2007/05/23_16:18:46 info: Starting
> > "/usr/lib64/heartbeat/pingd -m 200 -d 5s" as uid 0 gid 0 (pid 30580)
> > cib[30582]: 2007/05/23_16:18:46 info: readCibXmlFile: [on-disk] <rule
> > id="prefered_place_db2pr1_connected" score_attribute="pingd">
> > cib[30582]: 2007/05/23_16:18:46 info: readCibXmlFile: [on-disk]
> > <expression id="5042183e-b115-474e-9596-51d1581c0032"
attribute="pingd"
> > operation="defined"/>
> > heartbeat[30416]: 2007/05/23_16:19:01 WARN: Client [pingd] pid 30580
> > failed authorization [client failed authorization]
> > heartbeat[30416]: 2007/05/23_16:19:01 ERROR:
api_process_registration_msg:
> > cannot add client(pingd)
> > pengine[30673]: 2007/05/23_16:19:55 WARN:
generate_location_rule:unpack.c
> > node s0580crmdb2pr2 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:19:55 WARN:
generate_location_rule:unpack.c
> > node s0580crmdb2pr1 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:19:58 WARN:
generate_location_rule:unpack.c
> > node s0580crmdb2pr2 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:19:58 WARN:
generate_location_rule:unpack.c
> > node s0580crmdb2pr1 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:20:00 WARN:
generate_location_rule:unpack.c
> > node s0580crmdb2pr2 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:20:00 WARN:
generate_location_rule:unpack.c
> > node s0580crmdb2pr1 did not have a value for pingd
> >
> > And my crm_verify -LV shows errors related to pingd too:
> >
> > s0580crmdb2pr1:~ # crm_verify -LV
> > crm_verify[30641]: 2007/05/23_16:19:22 WARN:
> > generate_location_rule:unpack.c nod
> > e s0580crmdb2pr2 did not have a value for pingd
> > crm_verify[30641]: 2007/05/23_16:19:22 WARN:
> > generate_location_rule:unpack.c nod
> > e s0580crmdb2pr1 did not have a value for pingd
> >
> > I'd like to ask you some help over this problem.
> >
> > Best Regards,
> > Fabio Martins
> > _______________________________________________
> > Linux-HA mailing list
> > [email protected]
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> >
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems