Re: [Ganglia-general] A Java Virtual Machine probe

2006-03-29 Thread José Miguel Pereira Tavares
Hi Ben! On Tuesday, 28 March 2006 21:38, Ben Hartshorne wrote: Does it statically or dynamically link against those binaries? In other words, do I only need the ganglia src on the machine on which I compile JVMProbe, or will I need it to run? It's static except for glibc.

Re: [Ganglia-general] A Java Virtual Machine probe

2006-03-29 Thread José Miguel Pereira Tavares
On Wednesday, 29 March 2006 00:53, Ben Hartshorne wrote: can this probe get things like stats on the garbage collector (avg time spent in GC, etc.) Unfortunately no. :( I did some research on how that would be possible and I come up with two options: - using JVMPI

[Ganglia-general] Upgrade to apr-0.9.7

2006-03-29 Thread Martin Knoblauch
Hi, everyone monitoring ganglia-cvs will by now have seen that I have upgraded the apr sources within the ganglia CVS tree to version 0.9.7. This was done to fix some reported problems with the old version. So, if you are using CVS sources to build ganglia, please do a cvs update -Pd or

[Ganglia-general] gmetad not updating RRD's/hosts that are proper in gmond XML

2006-03-29 Thread Eli Stair
My installation started having an issue yesterday afternoon that I have yet to explain or remedy. One cluster that I have unicasting, has started losing hosts... the directory entries on disk never get created for newly deployed hosts, and gmond reports receiving messages for the host (and

[Ganglia-general] gmond unreliable on one cluster, must be constantly restarted

2006-03-29 Thread Steven A. DuChene
I have been struggling with a gmond process on one cluster here that after some indeterminate period of time marks everything else in the cluster as down so instead of having a clustersize (as indicqated from the ganglia python command line client) of 135, the clustersize is 1. I have on the

[Ganglia-general] Re: gmetad not updating RRD's/hosts that are proper in gmond XML

2006-03-29 Thread Eli Stair
The only issue I can find at all with this config is that the new hosts have been deployed by someone with two PTR records, both the proper one pointing to the A hostname, as well as all having an improper PTR - linux.FQDN. Is there a potential that gmetad is doing a lookup of both the

[Ganglia-general] How do I know that cluster nodes are down?

2006-03-29 Thread 황영철
Hi all I'm runing ganglia on 60 nodes cluseters!! I found a problem about node states. When downing the nodes, Ganglia Webmonitoring indecated the situation to me. But, When I connected front node and ran telnet localhost 20651, It displaied node information despite down state How

Re: [Ganglia-general] gmond unreliable on one cluster, must be constantly restarted

2006-03-29 Thread Martin Knoblauch
Steven, do you see anything in the /var/log/messages of the gmetad host? Do you insert any custom metrics via gmetric or other means? Cheers Martin --- Steven A. DuChene [EMAIL PROTECTED] wrote: I have been struggling with a gmond process on one cluster here that after some indeterminate

Re: [Ganglia-general] Re: gmetad not updating RRD's/hosts that are proper in gmond XML

2006-03-29 Thread Martin Knoblauch
Eli, yup. That could definitely cause problems. Do you see anything in the /var/log/messages of the gmetad host? Hmm. You may have to restart *all* gmonds, as well as the gmetad. This is something that I usually do when my ganglia setup was hosed somehow. Definitely the case for multicast

RE: [Ganglia-general] Re: gmetad not updating RRD's/hosts that are proper in gmond XML

2006-03-29 Thread Eli Stair
Martin, et al: I'm getting ...illegal attempt to update using time 1143703242 when last update time is 1143703242 (minimum one second step)... messages for the improper 'linux.' hosts. I was assuming that gmetad was sorting/indexing the data from those sources by the FQDN which was the same