Hello, I need some help with a strange problem which I just can't seem to figure out on my own.
In my setup I've got two load-balancers (running heartbeat 2.1.3-6lenny4 on Debian Lenny) connected with a null-modem cable. Most of the time this works like a charm, however once in a while (about once a month) suddenly both heartbeat instances loose their serial connection and end up in a split-brain situation. In the ha-log, this logline appears: /heartbeat[21791]: 2009/11/22_12:42:57 WARN: glib: TTY write timeout on [/dev/ttyS0] (no connection or bad cable? [see documentation]) heartbeat[21791]: 2009/11/22_12:42:57 info: glib: See http://linux-ha.org/FAQ#TTYtimeout for details / Now, I've read that FAQ entry and I'm positive (verified with a multimeter) that my serial-cable is wired correctly. The only way to get out of the split-brain situation and let the heartbeat software communicate with each other again is to reboot one, wait, and reboot the other. After that all is well, until it happens again about a month later. Does anyone know what could be causing this problem? Any help is appreciated, Fili _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
