Re: [Linux-HA] pingd problems

fabiomm Thu, 24 May 2007 10:21:18 -0700

Hi Andrew!!

Thanks a lot for the tip. I thought we did not have any update to 
heartbeat from Novell, but looking at Yast online update (
https://you.novell.com/update/x86_64/update/SUSE-SLES/10/rpm/src/) I could 
find the heartbeat 2.0.7 available for download.


After updating it and setting the following configuration:

ha.cf:

ping 10.226.0.100
respawn root /usr/lib64/heartbeat/pingd -m 200 -d 5s
apiauth ping gid=root uid=root

cib.xml:

     <constraints>
       <rsc_location id="place_db2pr1" rsc="group_db2pr1">
         <rule id="prefered_place_db2pr1" score="100">
           <expression attribute="#uname" 
id="745c2e82-e6cb-4c56-9611-797a2533a47d" operation="eq" 
value="s0580crmdb2pr1"/>
         </rule>
         <rule id="prefered_place_db2pr1_connected" 
score_attribute="pingd">
           <expression id="5042183e-b115-474e-9596-51d1581c0032" 
attribute="pingd" operation="defined"/>
         </rule>
       </rsc_location>
     </constraints>

The problem was solved! =0)

Thanks a lot and best regards!
Fabio Martins

[EMAIL PROTECTED] wrote on 24/05/2007 03:53:41:

> On 5/23/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
> > Hi Everyone!
> >
> > I've send an e-mail before talking about a problem with DB2 resource. 
Now
> > I'm sending this e-mail because I have problems with pingd.
> >
> > I'm configuring a cluster over SLES 10 on xSeries Servers (x86_64) 
running
> > heartbeat 2.0.5 to manage DRBD, IP, Filesystems and DB2 resources.
> 
> please apply the sles10 updates to get the latest heartbeat version.
> 
> >
> > The node s0580crmdb2pr1 is the active node and the node s0580crmdb2pr2 
is
> > the passive node (where no resources are running).
> >
> > The /etc/ha.d/ha.cf is configured as follows:
> >
> >
> > debugfile /var/log/ha-debug
> > logfile /var/log/ha-log
> > logfacility     local0
> > keepalive 2
> > deadtime 30
> > warntime 10
> > initdead 120
> > udpport 694
> > bcast   eth1
> > auto_failback on
> > watchdog /dev/watchdog
> > ping 10.226.0.100
> > respawn root /usr/lib64/heartbeat/pingd -m 200 -d 5s
> > apiauth ping gid=root uid=root
> > crm yes
> > node s0580crmdb2pr1 s0580crmdb2pr2
> >
> > Here the IP address 10.226.0.100 is the gateway for both servers. My
> > cib.xml have a place rule configured for pingd as follows:
> >
> >      <constraints>
> >        <rsc_location id="place_db2pr1" rsc="group_db2pr1">
> >          <rule id="prefered_place_db2pr1" score="100">
> >            <expression attribute="#uname"
> > id="745c2e82-e6cb-4c56-9611-797a2533a47d" operation="eq"
> > value="s0580crmdb2pr1"/>
> >          </rule>
> >          <rule id="prefered_place_db2pr1_connected"
> > score_attribute="pingd">
> >            <expression id="5042183e-b115-474e-9596-51d1581c0032"
> > attribute="pingd" operation="defined"/>
> >          </rule>
> >        </rsc_location>
> >      </constraints>
> >    </configuration>
> >  </cib>
> >
> > But everytime I start my heartbeat, I'm getting the following error
> > messages:
> >
> >
> > attrd[6143]: 2007/05/23_16:13:05 info: attrd_timer_callback:attrd.c
> > Sending flush op to all hosts for: pingd
> > attrd[6143]: 2007/05/23_16:13:05 info: attrd_ha_callback:attrd.c Sent
> > update 6: pingd=0
> > tengine[9698]: 2007/05/23_16:14:14 info: te_crm_command:actions.c 
Skipping
> > wait for 21
> > heartbeat[6114]: 2007/05/23_16:14:31 info: killing
> > /usr/lib64/heartbeat/pingd -m 200 -d 5s process group 6138 with signal 
15
> > pingd[6138]: 2007/05/23_16:14:31 info: send_update:pingd.c 0 active 
ping
> > nodes
> > pingd[6138]: 2007/05/23_16:14:31 ERROR: crm_send_ipc_message:ipc.c IPC
> > Channel to 6143 is not connected
> > pingd[6138]: 2007/05/23_16:14:31 WARN: #========= IPC[outbound] 
message
> > start ==========#
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG: Dumping message with 6 
fields
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[0] : [t=attrd]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[1] : [src=pingd]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[2] : [task=update]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[3] : [attr_name=pingd]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[4] : [attr_value=0]
> > pingd[6138]: 2007/05/23_16:14:31 WARN: MSG[5] : [attr_dampening=5s]
> > pingd[6138]: 2007/05/23_16:14:31 ERROR: send_update:pingd.c Could not 
send
> > update
> > heartbeat[30416]: 2007/05/23_16:16:45 info: glib: ping heartbeat 
started.
> > heartbeat[30416]: 2007/05/23_16:16:46 info: Status update for node
> > 10.226.0.100: status ping
> > heartbeat[30416]: 2007/05/23_16:18:46 info: Starting child client
> > "/usr/lib64/heartbeat/pingd -m 200 -d 5s" (0,0)
> > heartbeat[30580]: 2007/05/23_16:18:46 info: Starting
> > "/usr/lib64/heartbeat/pingd -m 200 -d 5s" as uid 0  gid 0 (pid 30580)
> > cib[30582]: 2007/05/23_16:18:46 info: readCibXmlFile: [on-disk] <rule
> > id="prefered_place_db2pr1_connected" score_attribute="pingd">
> > cib[30582]: 2007/05/23_16:18:46 info: readCibXmlFile: [on-disk]
> > <expression id="5042183e-b115-474e-9596-51d1581c0032" 
attribute="pingd"
> > operation="defined"/>
> > heartbeat[30416]: 2007/05/23_16:19:01 WARN: Client [pingd] pid 30580
> > failed authorization [client failed authorization]
> > heartbeat[30416]: 2007/05/23_16:19:01 ERROR: 
api_process_registration_msg:
> > cannot add client(pingd)
> > pengine[30673]: 2007/05/23_16:19:55 WARN: 
generate_location_rule:unpack.c
> > node s0580crmdb2pr2 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:19:55 WARN: 
generate_location_rule:unpack.c
> > node s0580crmdb2pr1 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:19:58 WARN: 
generate_location_rule:unpack.c
> > node s0580crmdb2pr2 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:19:58 WARN: 
generate_location_rule:unpack.c
> > node s0580crmdb2pr1 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:20:00 WARN: 
generate_location_rule:unpack.c
> > node s0580crmdb2pr2 did not have a value for pingd
> > pengine[30673]: 2007/05/23_16:20:00 WARN: 
generate_location_rule:unpack.c
> > node s0580crmdb2pr1 did not have a value for pingd
> >
> > And my crm_verify -LV shows errors related to pingd too:
> >
> > s0580crmdb2pr1:~ # crm_verify -LV
> > crm_verify[30641]: 2007/05/23_16:19:22 WARN:
> > generate_location_rule:unpack.c nod
> > e s0580crmdb2pr2 did not have a value for pingd
> > crm_verify[30641]: 2007/05/23_16:19:22 WARN:
> > generate_location_rule:unpack.c nod
> > e s0580crmdb2pr1 did not have a value for pingd
> >
> > I'd like to ask you some help over this problem.
> >
> > Best Regards,
> > Fabio Martins
> > _______________________________________________
> > Linux-HA mailing list
> > [email protected]
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> >
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] pingd problems

Reply via email to