Ok i got it:
heartbeat[13432]: 2008/11/03_06:53:57 info: No pkts missing from
lexoncom.com!
heartbeat[13432]: 2008/11/03_06:53:57 info: Other node completed standby
takeover of foreign resources.
heartbeat[13903]: 2008/11/03_06:55:36 ERROR: nice_failback flag is
obsolete.. Use auto_failback {on, off, legacy} instead.
heartbeat[13903]: 2008/11/03_06:55:36 ERROR: 'nice_failback yes' has been
changed to 'auto_failback off'
heartbeat[13903]: 2008/11/03_06:55:36 ERROR: See documentation for details.

i removed nice_failback flag

thx

>>>>
>>>>
>>>>


I have below config:

ha.cf
logfile /var/log/ha-log
logfacility local0
keepalive 2
deadtime 30
warntime 10
initdead 30
bcast   eth0 eth1
node    lexoncom.com
node    voip.lexoncom.com
auto_failback yes
nice_failback yes
ping 192.168.1.254
respawn hacluster /usr/lib/heartbeat/ipfail

haresources
voip.lexoncom.com 192.168.1.251 asterisk
lexoncom.com drbddisk::r1 Filesystem::/dev/drbd1::/shared1::ext3
192.168.1.241 drbdlinks httpd dovecot sendmail

below is the log when lexoncom.com is started while lexoncom.com was shut
down and voip.lexoncom.com was running.
According to above config the lexoncom.com should take over the httpd
dovecot and sendmail resources and leave the asterisk running on the other
node.
This does not happen. The services are still running on voip.lexoncom.com.
Why that active/active config does not work. I used that setup before and
it was fine. It worked on fedora4.
I switched to centos and no luck.
Kernel:
2.6.18-92.1.10.el5
heartbeat-2.1.3-3.el5.centos


thx
Bart

log below:

lexoncom:


logd[23493]: 2008/11/03_11:50:05 info: logd started with /etc/logd.cf.
logd[23494]: 2008/11/03_11:50:05 info: G_main_add_SignalHandler: Added
signal handler for signal 15
logd[23493]: 2008/11/03_11:50:05 info: G_main_add_SignalHandler: Added
signal handler for signal 15
heartbeat[23587]: 2008/11/03_11:50:05 ERROR: nice_failback flag is
obsolete.. Use auto_failback {on, off, legacy} instead.
heartbeat[23587]: 2008/11/03_11:50:05 ERROR: 'nice_failback yes' has been
changed to 'auto_failback off'
heartbeat[23587]: 2008/11/03_11:50:05 ERROR: See documentation for
details. heartbeat[23587]: 2008/11/03_11:50:05 info: Version 2 support:
false heartbeat[23587]: 2008/11/03_11:50:05 WARN: Logging daemon is
disabled --enabling logging daemon is recommended
heartbeat[23587]: 2008/11/03_11:50:05 info: **************************
heartbeat[23587]: 2008/11/03_11:50:05 info: Configuration validated.
Starting heartbeat 2.1.3
heartbeat[23588]: 2008/11/03_11:50:05 info: heartbeat: version 2.1.3
heartbeat[23588]: 2008/11/03_11:50:05 info: Heartbeat generation:
1224508099 heartbeat[23588]: 2008/11/03_11:50:05 info: glib: UDP Broadcast
heartbeat started on port 694 (694) interface eth0
heartbeat[23588]: 2008/11/03_11:50:05 info: glib: UDP Broadcast heartbeat
closed on port 694 interface eth0 - Status: 1
heartbeat[23588]: 2008/11/03_11:50:05 info: glib: UDP Broadcast heartbeat
started on port 694 (694) interface eth1
heartbeat[23588]: 2008/11/03_11:50:05 info: glib: UDP Broadcast heartbeat
closed on port 694 interface eth1 - Status: 1
heartbeat[23588]: 2008/11/03_11:50:05 info: glib: ping heartbeat started.
heartbeat[23588]: 2008/11/03_11:50:05 info: G_main_add_TriggerHandler:
Added signal manual handler
heartbeat[23588]: 2008/11/03_11:50:05 info: G_main_add_TriggerHandler:
Added signal manual handler
heartbeat[23588]: 2008/11/03_11:50:05 info: G_main_add_SignalHandler:
Added signal handler for signal 17
heartbeat[23588]: 2008/11/03_11:50:05 info: Local status now set to: 'up'
heartbeat[23588]: 2008/11/03_11:50:07 info: Link lexoncom.com:eth0 up.
heartbeat[23588]: 2008/11/03_11:50:07 info: Link lexoncom.com:eth1 up.
heartbeat[23588]: 2008/11/03_11:50:07 info: Link
192.168.1.254:192.168.1.254 up.
heartbeat[23588]: 2008/11/03_11:50:07 info: Status update for node
192.168.1.254: status ping
heartbeat[23588]: 2008/11/03_11:50:07 info: Link voip.lexoncom.com:eth1
up. heartbeat[23588]: 2008/11/03_11:50:07 info: Status update for node
voip.lexoncom.com: status active
harc[23599]:    2008/11/03_11:50:07 info: Running /etc/ha.d/rc.d/status
status
heartbeat[23588]: 2008/11/03_11:50:07 info: Comm_now_up(): updating status
to active
heartbeat[23588]: 2008/11/03_11:50:07 info: Local status now set to:
'active' heartbeat[23588]: 2008/11/03_11:50:07 info: Starting child client
"/usr/lib/heartbeat/ipfail" (498,496)
heartbeat[23616]: 2008/11/03_11:50:07 info: Starting
"/usr/lib/heartbeat/ipfail" as uid 498  gid 496 (pid 23616)
heartbeat[23588]: 2008/11/03_11:50:08 info: remote resource transition
completed.
heartbeat[23588]: 2008/11/03_11:50:08 info: remote resource transition
completed.
heartbeat[23588]: 2008/11/03_11:50:08 info: Local Resource acquisition
completed. (none)
heartbeat[23588]: 2008/11/03_11:50:08 info: Initial resource acquisition
complete (T_RESOURCES(them))
ipfail[23616]: 2008/11/03_11:50:14 info: Ping node count is balanced.
ipfail[23616]: 2008/11/03_11:50:15 info: Giving up foreign resources
(auto_failback).
ipfail[23616]: 2008/11/03_11:50:15 info: Delayed giveup in 4 seconds.
ipfail[23616]: 2008/11/03_11:50:18 info: giveup() called (timeout worked)
heartbeat[23588]: 2008/11/03_11:50:19 info: lexoncom.com wants to go
standby [foreign]
heartbeat[23588]: 2008/11/03_11:50:20 info: standby: voip.lexoncom.com can
take our foreign resources
heartbeat[23619]: 2008/11/03_11:50:20 info: give up foreign HA resources
(standby).
ResourceManager[23632]: 2008/11/03_11:50:20 info: Releasing resource
group: voip.lexoncom.com 192.168.1.251 asterisk
ResourceManager[23632]: 2008/11/03_11:50:20 info: Running
/etc/init.d/asterisk  stop
ResourceManager[23632]: 2008/11/03_11:50:20 info: Running
/etc/ha.d/resource.d/IPaddr 192.168.1.251 stop
IPaddr[23704]:  2008/11/03_11:50:20 INFO:  Success
heartbeat[23619]: 2008/11/03_11:50:20 info: foreign HA resource release
completed (standby).
heartbeat[23588]: 2008/11/03_11:50:20 info: Local standby process
completed [foreign].
heartbeat[23588]: 2008/11/03_11:50:21 WARN: 1 lost packet(s) for
[voip.lexoncom.com] [97:99]
heartbeat[23588]: 2008/11/03_11:50:21 info: remote resource transition
completed.
heartbeat[23588]: 2008/11/03_11:50:21 info: No pkts missing from
voip.lexoncom.com!
heartbeat[23588]: 2008/11/03_11:50:21 info: Other node completed standby
takeover of foreign resources.



voip.lexoncom.com:

heartbeat[10576]: 2008/11/03_06:49:07 info: Heartbeat restart on node
lexoncom.com
heartbeat[10576]: 2008/11/03_06:49:07 info: Link lexoncom.com:eth0 up.
heartbeat[10576]: 2008/11/03_06:49:07 info: Status update for node
lexoncom.com: status init
heartbeat[10576]: 2008/11/03_06:49:07 info: Status update for node
lexoncom.com: status up
ipfail[10604]: 2008/11/03_06:49:07 info: Link Status update: Link
lexoncom.com/eth0 now has status up
ipfail[10604]: 2008/11/03_06:49:07 info: Status update: Node lexoncom.com
now has status init
ipfail[10604]: 2008/11/03_06:49:07 info: Status update: Node lexoncom.com
now has status up
harc[12326]:    2008/11/03_06:49:07 info: Running /etc/ha.d/rc.d/status
status
harc[12342]:    2008/11/03_06:49:07 info: Running /etc/ha.d/rc.d/status
status
heartbeat[10576]: 2008/11/03_06:49:08 info: Status update for node
lexoncom.com: status active
ipfail[10604]: 2008/11/03_06:49:08 info: Status update: Node lexoncom.com
now has status active
harc[12358]:    2008/11/03_06:49:08 info: Running /etc/ha.d/rc.d/status
status
heartbeat[10576]: 2008/11/03_06:49:08 info: remote resource transition
completed.
ipfail[10604]: 2008/11/03_06:49:09 info: Asking other side for ping node
count.
ipfail[10604]: 2008/11/03_06:49:15 info: No giveup timer to abort.
heartbeat[10576]: 2008/11/03_06:49:19 info: lexoncom.com wants to go
standby [foreign]
heartbeat[10576]: 2008/11/03_06:49:20 info: standby: acquire [foreign]
resources from lexoncom.com
heartbeat[12374]: 2008/11/03_06:49:20 info: acquire local HA resources
(standby).
ResourceManager[12387]: 2008/11/03_06:49:21 info: Acquiring resource
group: voip.lexoncom.com 192.168.1.251 asterisk
IPaddr[12414]:  2008/11/03_06:49:21 INFO:  Running OK
heartbeat[12374]: 2008/11/03_06:49:21 info: local HA resource acquisition
completed (standby).
heartbeat[10576]: 2008/11/03_06:49:21 info: Standby resource acquisition
done [foreign].
heartbeat[10576]: 2008/11/03_06:49:21 info: remote resource transition
completed.




_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to