I spoke to a developer regarding this and he recommended moving to 2.x.
I'm still looking for further details. Anyone have any additional information?
One of the systems in question is Linux 2.6.9, heartbeat
1.2.3.cvs.20050927, and glibc 2.3.4.
I discovered a thread [1] describing a very similar issue.
Was this ever corrected in the 1.x branch? I'd prefer not to
restandardize production on 2.x yet. I reviewed the changelogs for both 1.x
and 2.x, where I could not locate any reference to this bug being corrected.
In my case, the magical number seems to be 447 days. I recently had it
occur again with another heartbeat instance where it hit that mark. I'm
concerned about it recurring at a higher uptime too. Perhaps 447 days is not
the only trigger.
I would greatly appreciate any assistance that could be lent.
Some log output:
heartbeat: 2008/11/11_10:46:37 info: These are nothing to worry about.
heartbeat: 2008/11/11_20:26:19 WARN: node fw-02a: is dead
heartbeat: 2008/11/11_20:26:19 ERROR: No local heartbeat. Forcing restart.
heartbeat: 2008/11/11_20:26:19 info: Heartbeat shutdown in progress. (3720)
heartbeat: 2008/11/11_20:26:19 WARN: node fw-02b: is dead
heartbeat: 2008/11/11_20:26:19 info: Link fw-02b:/dev/ttyS0 dead.
heartbeat: 2008/11/11_20:26:19 info: Link fw-02b:eth1 dead.
heartbeat: 2008/11/11_20:26:19 WARN: Late heartbeat: Node fw-02a: interval 41270
ms
heartbeat: 2008/11/11_20:26:19 info: Giving up all HA resources.
heartbeat: 2008/11/11_20:26:19 WARN: Cluster node fw-02b returning after partiti
on.
heartbeat: 2008/11/11_20:26:19 WARN: Deadtime value may be too small.
heartbeat: 2008/11/11_20:26:19 info: See documentation for information on tuning
deadtime.
heartbeat: 2008/11/11_20:26:19 info: Link fw-02b:eth1 up.
heartbeat: 2008/11/11_20:26:19 WARN: Late heartbeat: Node fw-02b: interval 41650
ms
heartbeat: 2008/11/11_20:26:19 info: Status update for node fw-02b: status activ
e
Best regards,
Warner.
[1] http://www.mail-archive.com/[EMAIL PROTECTED]/msg01449.html
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems