On Oct 13, 2008, at 8:36 AM, Brian Klauss wrote:

When I disconnect the heartbeat from one of the servers, whether through the shutdown of the NIC or by physically removing the cable, the other node
senses this as a failure and immediately takes over both floating IP
addresses. Strangely the node where the heartbeat was removed from also acquires both floating IP addresses and therefore causes much grief on the network. I'm at a loss here and this is causing me a lot of headache. Any
ideas?

I'm guessing you have multiple NICs and that you're creating a split brain situation. Your ping nodes wont help if both machines can see them but not each other.



Here is node 1's ha.cf:

#debugfile /var/log/ha-debug
logfile /var/log/ha-log
logfacility local0
keepalive 2
deadtime 30
warntime 10
initdead 60
udpport 694
ucast eth2 10.0.0.2
auto_failback on
ping_group group1 10.220.3.50 10.220.3.55
apiauth client-name gid=haclient uid=hacluster
apiauth ipfail gid=haclient uid=hacluster
max_rexmit_delay 10000
hbgenmethod time
node node1.server.com
node node2.server.com

Here is node 2's ha.cf:

#debugfile /var/log/ha-debug
logfile /var/log/ha-log
logfacility local0
keepalive 2
deadtime 30
warntime 10
initdead 60
udpport 694
ucast eth2 10.0.0.1
auto_failback on
ping_group group1 10.220.3.50 10.220.3.55
apiauth client-name gid=haclient uid=hacluster
apiauth ipfail gid=haclient uid=hacluster
max_rexmit_delay 10000
hbgenmethod time
node node1.server.com
node node2.server.com

The IP address for the heartbeat on node1 is 10.0.0.1 and the heartbeat on node2 is 10.0.0.2. The ping_group are servers within the infrastructure
that are extrememly reliable.  Still, I am at a complete loss.

Any help will be greatly appreciated.

Best regards,
Brian Klauss
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to