[Ganglia-general] Ganglia 3.1.2 -- Modules fail to load intermittently, Hosts disappear from the web interface, XML errors that can't be found

2009-05-14 Thread Adam Tygart
Hello everyone, I have been having a hack of a time diagnosing this problem. I recently updated to ganglia-3.1.2 for 3.0.7. Since then I have been plagued with (what looked like) data errors, mis-reporting swap usage was the easiest to see. This seems to be caused by some reporting modules

Re: [Ganglia-general] Ganglia 3.1.2 -- Modules fail to load intermittently, Hosts disappear from the web interface, XML errors that can't be found

2009-05-14 Thread Adam Tygart
All of the XML is sent within the intranet. In fact with this latest test, all of the XML is being passed through one switch. This is a 1Gbps switch with the switch itself being able to push 96Gbps split across all ports. The network is currently pushing 1MBps, don't think the network is maxed

Re: [Ganglia-general] Ganglia 3.1.2 -- Modules fail to load intermittently, Hosts disappear from the web interface, XML errors that can't be found

2009-05-14 Thread Richard Edward Horner
Well, I'd say the next step is to find out if it's a problem creating the data or reading/parsing it. I'd run some tests with telnet to see if the data is coming in fine or if there really are missing brackets. Rich(ard) On Thu, May 14, 2009 at 5:18 PM, Adam Tygart adam.tyg...@gmail.com wrote:

Re: [Ganglia-general] ganglia on apple problems

2009-05-14 Thread Ryan Evans
We have been running ganglia on our apple clusters for years. I have the gmond.conf set to broadcast to the head node without issue (192.xxx.xxx.xxx). I had to play with it a bunch to get it working though. We are running ganglia 3.1.2 This post caught my eye because I had one client that was

[Ganglia-general] gmetad/rrdhelpers.c sends lots of RRD_update messages in /var/log/messages

2009-05-14 Thread Michael Will
I have a cluster setup with /etc/gmetad.conf configured to pull XML data out of two daemons for the same cluster. Both have their own subset of data (one does system performance and the other does ipmi metrics) and all seems to work well except that sometimes gmetad seems to pull the data

Re: [Ganglia-general] gmetad/rrdhelpers.c sends lots of RRD_update messages in /var/log/messages

2009-05-14 Thread Rick Cobb
I see this behavior all the time. Sometimes it's even a feature, though most of the time it's a bug. The easiest way to cause it is to have gmonds for two different data_source lines using the same cluster name in their gmond.conf - which I think is what you're doing. That makes two different

Re: [Ganglia-general] gmetad/rrdhelpers.c sends lots of RRD_update messages in /var/log/messages

2009-05-14 Thread Michael Will
Rick C was asking what the exact scenario was in terms of gmond/gmetad: I don't use gmond, I just have gmetad query two local daemons that respond in leu of all cluster nodes since the data is already available on the headnode. And as you mentioned, they do use the same clustername so they