On Wed, Jun 25, 2008 at 11:25 AM, Bernard Li <[EMAIL PROTECTED]> wrote: > Hi Kirk: > > On Wed, Jun 25, 2008 at 11:13 AM, Kirk McDonald > <[EMAIL PROTECTED]> wrote: > >> I have a gmetad which probes a number of gmonds, and each gmond has a >> number of hosts associated with it. When I scrape the XML from each of >> the gmonds probed by gmetad myself, the TN value for each host looks >> good (they average well under 10 seconds). However, when I scrape the >> XML from gmetad, the TN values for each host are much higher, enough >> so that it begins marking many of the hosts as down. I was wondering >> what could cause this to happen. > > Can you let us know what version of Ganglia you are running and > roughly how many hosts you are monitoring? > > While I do notice the TN values of my hosts from gmetad are quite high > (as you mentioned), none of my hosts are actually marked down. So the > issue may be unrelated. > > Cheers, > > Bernard >
I am running Ganglia 3.0.7. Let's say I have a large number of hosts. I've been spending a number of days now trying to get a handle on the precise problem. Ganglia marks hosts down when their TN value creeps over 80 seconds. (Four times their TMAX value, which is hardcoded to 20.) The average TN of the hosts in my gmetad's XML is over 100, and so many of them are marked down. -Kirk ------------------------------------------------------------------------- Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php _______________________________________________ Ganglia-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/ganglia-general

