We have a simple two node cluster that share an ip and one resource (exim). Connection is by serial link. The system works fine if we power down the master or take it offline, but if the master experiences a drive error, making the resource unavailable the failover never happens.
This has happened several times. Nothing shows up in either log file, and a hard reboot brings the master back online. Is this caused by the serial link still being active? Is there a way to have this type of issue cause the slave to become active? heartbeat: 1.2.5 ha.cf: debugfile /var/log/ha-debug logfile /var/log/ha-log keepalive 10 deadtime 30 initdead 60 baud 19200 serial /dev/ttyS0 auto_failback on node mailONE node mail _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
