Hi,

 

I have a 2 node HB 2.1.3 cluster running on CentOS 5.  I just upgraded
the passive node to CentOS 5.4, but the heartbeat packages did not
change:

 

heartbeat-stonith-2.1.3-3.el5.centos

heartbeat-2.1.3-3.el5.centos

heartbeat-pils-2.1.3-3.el5.centos

 

Now, when I try to start HB on the node, it reports that it is starting,
but the daemons never actually start:

 

r...@fc-fmcln02:~$ service heartbeat start

logd is already running

Starting High-Availability services:

2010/05/04_18:02:53 INFO:  Resource is stopped

2010/05/04_18:02:53 INFO:  Resource is stopped

                                                           [  OK  ]

r...@fc-fmcln02:~$ ps aux | grep heartbeat

root      6117  0.0  0.0   3916   696 pts/0    S+   18:02   0:00 grep
heartbeat

 

Logs say:

 

May  4 18:02:53 fc-fmcln02 heartbeat: [6112]: info: Version 2 support:
false

May  4 18:02:53 fc-fmcln02 heartbeat: [6112]: WARN: logd is enabled but
logfile/debugfile is still configured in ha.cf

May  4 18:02:53 fc-fmcln02 heartbeat: [6112]: info:
**************************

May  4 18:02:53 fc-fmcln02 heartbeat: [6112]: info: Configuration
validated. Starting heartbeat 2.1.3

May  4 18:02:53 fc-fmcln02 heartbeat: [6113]: info: heartbeat: version
2.1.3

May  4 18:02:53 fc-fmcln02 heartbeat: [6113]: info: Heartbeat
generation: 1208455483

 

Running /usr/lib/heartbeat/heartbeat -d 1000 shows:

 

 

heartbeat[6122]: 2010/05/04_18:04:00 ERROR: Cannot shmget for process
status: Invalid argument

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(keepalive,1)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(deadtime,10)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(warntime,5)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(initdead,120)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(udpport,694)

heartbeat: udpport setting must precede media statementsheartbeat[6122]:
2010/05/04_18:04:00 debug: add_option(bcast,eth1)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(auto_failback,off)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(node,fc-fmcln01.corp.follett.com)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(node,fc-fmcln02.corp.follett.com)

heartbeat[6122]: 2010/05/04_18:04:00 info: respawn directive: hacluster
/usr/lib/heartbeat/ipfail

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(use_logd,yes)

heartbeat[6122]: 2010/05/04_18:04:00 info: Enabling logging daemon

heartbeat[6122]: 2010/05/04_18:04:00 info: logfile and debug file are
those specified in logd config file (default /etc/logd.cf)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(logfile,/var/log/hb.log)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(debugfile,/var/log/heartbeat-debug.log)

heartbeat[6122]: 2010/05/04_18:04:00 debug: uid=hacluster, gid=<null>

heartbeat[6122]: 2010/05/04_18:04:00 debug: uid=hacluster, gid=<null>

heartbeat[6122]: 2010/05/04_18:04:00 debug: uid=<null>, gid=haclient

heartbeat[6122]: 2010/05/04_18:04:00 debug: uid=root, gid=<null>

heartbeat[6122]: 2010/05/04_18:04:00 debug: uid=<null>, gid=haclient

heartbeat[6122]: 2010/05/04_18:04:00 debug: Beginning authentication
parsing

heartbeat[6122]: 2010/05/04_18:04:00 debug: 16 max authentication
methods

heartbeat[6122]: 2010/05/04_18:04:00 debug: Keyfile opened

heartbeat[6122]: 2010/05/04_18:04:00 debug: Keyfile perms OK

heartbeat[6122]: 2010/05/04_18:04:00 debug: 16 max authentication
methods

heartbeat[6122]: 2010/05/04_18:04:00 debug: Found authentication method
[sha1]

heartbeat[6122]: 2010/05/04_18:04:00 info: AUTH: i=1: key = 0x8c52d78,
auth=0x5c6228, authname=sha1

heartbeat[6122]: 2010/05/04_18:04:00 debug: Outbound signing method is 1

heartbeat[6122]: 2010/05/04_18:04:00 debug: Authentication parsing
complete [1]

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(cluster,linux-ha)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(hopfudge,1)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(baud,19200)

heartbeat: baudrate setting must precede media
statementsheartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(hbgenmethod,file)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(realtime,true)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(msgfmt,classic)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(conn_logd_time,60)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(log_badpack,true)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(syslogmsgfmt,false)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(coredumps,true)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(crm,false)

heartbeat[6122]: 2010/05/04_18:04:00 info: Version 2 support: false

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(autojoin,none)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(uuidfrom,file)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(compression,zlib)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(compression_threshold,2)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(traditional_compression,no)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(max_rexmit_delay,250)

heartbeat[6122]: 2010/05/04_18:04:00 debug: Setting max_rexmit_delay to
250 ms

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(record_config_changes,on)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(record_pengine_inputs,on)

heartbeat[6122]: 2010/05/04_18:04:00 debug:
add_option(enable_config_writes,on)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(memreserve,6500)

heartbeat[6122]: 2010/05/04_18:04:00 debug: add_option(debug,1)

heartbeat[6122]: 2010/05/04_18:04:00 WARN: logd is enabled but
logfile/debugfile is still configured in ha.cf

heartbeat[6122]: 2010/05/04_18:04:00 info: **************************

heartbeat[6122]: 2010/05/04_18:04:00 info: Configuration validated.
Starting heartbeat 2.1.3

heartbeat[6122]: 2010/05/04_18:04:00 debug: HA configuration OK.
Heartbeat starting.

 

My /etc/ha.d/ha.cf has not changed either.  It is the same on node1:

 

keepalive 1                    # Interval between heartbeat (HB)
packets.

deadtime 10                   # How quickly HB determines a dead node.

warntime 5                    # Time HB will issue a late HB.

initdead 120                  # Time delay needed by HB to report a dead
node.

udpport 694                  # UDP port HB uses to communicate between
nodes.

bcast eth1                    # Which interface to use for HB packets.

auto_failback off         # Auto promotion of primary node upon return
to cluster.

node    fc-fmcln01.corp.blah.com # Node name must be same as uname -r.

node    fc-fmcln02.corp.blah.com # Node name must be same as uname -r.

 

respawn hacluster /usr/lib/heartbeat/ipfail

# Specifies which programs to run at startup

 

use_logd yes                  # Use system logging.

logfile /var/log/hb.log    # Heartbeat logfile.

debugfile /var/log/heartbeat-debug.log # Debugging logfile.

 

Can someone help me determine why HB dies immediately after I start it?
How can I further troubleshoot this issue?

 

Thanks!

 

Josh

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to