Re: [Ganglia-general] How to take a box out of a cluster?

Steven Wagner Fri, 06 Dec 2002 13:43:31 -0800

2.5.1 should support the concept of DMAX for individual metrics. I thinkthat extends to hosts, as well. Basically it's metric aging - if a metrichasn't been transmitted for X seconds, take it off the list. It's designedexactly for this type of thing - getting rid of hosts that have been dead along time. It doesn't delete RRDs, though (I think ... Fed can overrule meon that one).

Funny thing, in my experience you only need to kill the monitoring coresthat your metadaemon polls.


Hope that helps...

Phil Radden wrote:

I wondered if anybody could help with an annoying situation I keep runningin to! Due to a couple of teething problems setting this all up, I keepgetting boxes coming up into the wrong multicast group. Unfortunately,once all the other boxes in that cluster have noticed the imposter, itseems remarkably difficult to persuade them all to forget about him again.
The only fix I've found to work is to stop _all_ the gmonds in theaffected cluster simultaneously, delete the appropriate rrds, and thenstart all the gmonds up again. I'm really hoping there's a quicker way!
Thanks in advance
Phil
PS Many thanks to all of you who've sent comments on my disk-throughputproblems - storing the rrdbs on a loopback mount is appearing to do
   the job for me nicely.



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Re: [Ganglia-general] How to take a box out of a cluster?

Reply via email to