Markus Marquardt wrote:
Hello,
i've set up a two node cluster on FC4 with latest updates. When
restarting the active node i get some strange effects when it's coming
up again. In the ha-log of the rebooted node there are some messages
heartbeat[2023]: 2006/04/06_15:11:31 WARN: Gmain_timeout_dispatch
heartbeat[2023]: 2006/04/06_15:11:31 WARN: Late heartbeat: Node mynode1:
heartbeat[2023]: 2006/04/06_15:12:12 info: time_longclock: clock_t
wrapped around (uptime).
(see logs)
Later on heartbeat restarts two times with a split brain situation and
then is everything fine again.
So what could be wrong here? I can't see that the machine is under heavy
load when heartbeat is coming up. It's a dual xeon 3.0 ghz dell server.
Any suggestions?
This is totally weird...
The clock_t wrapping around is a REALLY wrong... Looking at the logs in
detail it doesn't seem to be happening to more than once to a given
process. If the machine is running an HZ value of 1000, then this
shouldn't happen unless the machine has been up for 49 days, or 497 days
with the more normal value of 100 for HZ.
What kernel version are you running here?
--
Alan Robertson <[EMAIL PROTECTED]>
"Openness is the foundation and preservative of friendship... Let me
claim from you at all times your undisguised opinions." - William
Wilberforce
_______________________________________________________
Linux-HA-Dev: [email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Home Page: http://linux-ha.org/