[Linux-HA] heartbeat failover not working on hard drive error

Coach-X Thu, 27 Mar 2008 13:24:27 -0700

We have a simple two node cluster that share an ip and one resource
(exim).  Connection is by serial link.  The system works fine if we
power down the master or take it offline, but if the master experiences
a drive error, making the resource unavailable the failover never happens.


This has happened several times.  Nothing shows up in either log file,
and a hard reboot brings the master back online.  Is this caused by the
serial link still being active?  Is there a way to have this type of
issue cause the slave to become active?

heartbeat: 1.2.5

ha.cf:
debugfile /var/log/ha-debug
logfile        /var/log/ha-log
keepalive 10
deadtime 30
initdead 60
baud   19200
serial /dev/ttyS0
auto_failback on
node    mailONE
node    mail
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

[Linux-HA] heartbeat failover not working on hard drive error

Reply via email to