Today, James Braid wrote forth saying... > Yeah, I noticed there was some weirdness going on there. Something > else I noticed which I am a bit worried about is that even if the > solaris box is the only box on a multicast channel, and I check it > with gstat, it thinks its dead after ~1min. I noticed that the > REPORTED timestamp never increased....i.e it was like it was grabbing > and reporting the stats once and not doing anything after that...is > this related to the above problem or something completely different? > Note that the gmond and gstat were both running on the same machine.
that is the same problem. heartbeat messages are sent every 15 seconds so if a machine doesn't get a heartbeat message in 60 seconds (4 missed heartbeats) it assumes it is down. if you use the latest CVS source you should see the problem no longer is there. let me know otherwise. thanks so much for the feedback -matt
