Hello all, I'm experiencing some odd problems with Ganglia (3.1.1, under CentOS 5). Sometimes, gmond stops collecting data from remote hosts. I'll look at the graphs from gmetad, and find nothing but empty space. Telnet'ing to gmond on the local host will show nothing but an empty <HOST> element for the system in question:
<HOST NAME="somehost.example.com" IP="a.b.c.d" REPORTED="1225382936" TN="7" TMAX="20" DMAX="0" LOCATION="unspecified" GMOND_STARTED="1225306452"> </HOST> What's odd is that: (a) The REPORTED attribute shows a current timestamp. The REPORTED value, above, was within five seconds of the current time. (b) A packet trace shows the remote host sending the data. I've verified that the incoming packets do contain metric data. Gmond merrily ignores the data. It seems that restarting the *local* gmond does not correct the problem, but restarting the *remote* (sending) gmond does make things start working again. Has anyone encountered this behavior before? -- Lars Kellogg-Stedman <[EMAIL PROTECTED]> ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ Ganglia-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/ganglia-general

