2.5.1 should support the concept of DMAX for individual metrics. I think that extends to hosts, as well. Basically it's metric aging - if a metric hasn't been transmitted for X seconds, take it off the list. It's designed exactly for this type of thing - getting rid of hosts that have been dead a long time. It doesn't delete RRDs, though (I think ... Fed can overrule me on that one).

Funny thing, in my experience you only need to kill the monitoring cores that your metadaemon polls.

Hope that helps...

Phil Radden wrote:
I wondered if anybody could help with an annoying situation I keep running in to! Due to a couple of teething problems setting this all up, I keep getting boxes coming up into the wrong multicast group. Unfortunately, once all the other boxes in that cluster have noticed the imposter, it seems remarkably difficult to persuade them all to forget about him again.

The only fix I've found to work is to stop _all_ the gmonds in the affected cluster simultaneously, delete the appropriate rrds, and then start all the gmonds up again. I'm really hoping there's a quicker way!

Thanks in advance
Phil

PS Many thanks to all of you who've sent comments on my disk-throughput problems - storing the rrdbs on a loopback mount is appearing to do
   the job for me nicely.



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general



Reply via email to