Bob,

I would definitely recommend this series of actions:
1. stop (kill) gmond on all nodes
2. restart the gmetad process
3. start gmond on all nodes

This will ensure that none of nodes or the gmetad collect any old data from old config files and that they are using the new config files.

Second make sure that you have two different gmond.confs, one for each cluster. In your email, you show that gmetad is connecting to two different ports on the same machine (localhost which you say is linux). From my understanding, you should be connecting to the head node (whatever its name is) for the solaris cluster and not localhost.

Try those suggestions,
Ian

bob flynn wrote:

Hi, I am attempting to setup ganglia for the first time. Having read the documentation and looked at a couple of examples. I still have a few queries. Let me explain what I am attempting , what I have configured and what I see.

What I am attempting.

To configure two clusters.

What I have configured.

1. web front end server, with the following installed;

apache
ganglia-gmetad-3.0.0-1
ganglia-web-3.0.0-1

The gmetad conf file /etc/gmetad.conf contains the following entries;

data_source "linux cluster" 10 localhost:8649
data_source "solaris cluster" 10 localhost:8650
gridname "Leop LSF"

everything else remains as default.

2. client machine. ( Linux box )

ganglia-gmond-3.0.0-1

The gmond conf file /etc/gmond.conf contains the following entries;

cluster {
 name = "linux cluster"
}

What I am seeing is;

a number of machines in "unspecified" cluster. How do I blow these away. I understand that if I either restart all daemons, or else enter

globals {
   host_dmax = 3600
 }

in the gmond.conf it should take care of this. The thing is some nodes appear on both lists, ie in linux cluster and unspecified.

The other thing I am seeing for the hosts visible in the "linux cluster" cluster is that they are showing as down. Yet when I run a

telnet localhost 8649

I see the xml output. I figure I should be able to see the aggragate data on the head node, ie the one with gmetad and the php web interface installed. How do I debug this ? I have not gone down the road of installing the solaris binaries until I have this sorted. Any help appreciated.

Tks,

-Bob



Reply via email to