Hi Bernard, Thank you so much for your quick response and pointing to the right place:
3.1 collectors will request a gmond to resend its metric description information if needed and if using multicast, if you are using unicast there is no way to do that yet and so if you restart your collector will be left with partial or no data from the cluster that is being collected through it untill all gmond in that cluster are restarted. To workaround this problem if using unicast setup send_metadata_interval to a reasonable value so that all gmond resent their metadata periodically to the collector in case it gets lost. And it looks to have fixed the problem by setting send_metadata_interval=60. Is 60 a reasonable value to use? Or should I use a smaller or larger one? I have almost all other settings at their default values. Thank you so much for your help again! Guosheng -----Original Message----- From: Bernard Li [mailto:[email protected]] Sent: Monday, September 28, 2009 5:01 PM To: Li, Guosheng Cc: [email protected] Subject: Re: [Ganglia-general] Ganglia stopped reporting data to multiple clusters when using unicast to report same host data to multiple head hosts Hi Guosheng: Please have a look at the release notes: http://sourceforge.net/apps/trac/ganglia/wiki/ganglia_release_notes Search specifically for "send_metadata_interval" and see if that helps. Cheers, Bernard On Mon, Sep 28, 2009 at 2:48 PM, Li, Guosheng <[email protected]> wrote: > I am using unicast to send data of the same host to multiple hosts on > different clutsers/groups so that that same host can be viewable from > different clusters. Here is part of my gmond.conf: > > udp_send_channel { > > host = host1 > > port = 8649 > > } > > udp_send_channel { > > host = host2 > > port = 8650 > > } > > I have host1 defined as head host of cluster1 and host2 as head host of > cluster2. This works, i.e. I can see host data from both cluster1 and > cluster2, however, only for some short time, then Ganglia reported data only > to one cluster head node and stopped reporting to others. If I restart gmond > daemon on the host, it will start reporting to multiple clusters again, but > will stop again after some short time. As you can see, I am using different > port number for different hosts for udp_send_channel. When using the same > port number 8649 for different hosts, I got the same problem. > > What is the cause of the problem and how can I resolve? Any hints are very > much appreciated! > > Guosheng > > ------------------------------------------------------------------------------ > Come build with us! The BlackBerry® Developer Conference in SF, CA > is the only developer event you need to attend this year. Jumpstart your > developing skills, take BlackBerry mobile applications to market and stay > ahead of the curve. Join us from November 9-12, 2009. Register now! > http://p.sf.net/sfu/devconf > _______________________________________________ > Ganglia-general mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/ganglia-general > > ------------------------------------------------------------------------------ Come build with us! The BlackBerry® Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9-12, 2009. Register now! http://p.sf.net/sfu/devconf _______________________________________________ Ganglia-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/ganglia-general

