Markus Marquardt wrote:
Hello,

i've set up a two node cluster on FC4 with latest updates. When restarting the active node i get some strange effects when it's coming up again. In the ha-log of the rebooted node there are some messages

heartbeat[2023]: 2006/04/06_15:11:31 WARN: Gmain_timeout_dispatch
heartbeat[2023]: 2006/04/06_15:11:31 WARN: Late heartbeat: Node mynode1:
heartbeat[2023]: 2006/04/06_15:12:12 info: time_longclock: clock_t wrapped around (uptime).
(see logs)

Later on heartbeat restarts two times with a split brain situation and then is everything fine again.

So what could be wrong here? I can't see that the machine is under heavy load when heartbeat is coming up. It's a dual xeon 3.0 ghz dell server.

Any suggestions?

This is totally weird...

The clock_t wrapping around is a REALLY wrong... Looking at the logs in detail it doesn't seem to be happening to more than once to a given process. If the machine is running an HZ value of 1000, then this shouldn't happen unless the machine has been up for 49 days, or 497 days with the more normal value of 100 for HZ.

What kernel version are you running here?




--
    Alan Robertson <[EMAIL PROTECTED]>

"Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions." - William Wilberforce
_______________________________________________________
Linux-HA-Dev: [email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Home Page: http://linux-ha.org/

Reply via email to