Continued progress with this.

Doing a /gstat --all/ on any of the boxes with gmond installed, I can see the stats. However I cannot see the clusters off the web interface.


node> gstat --all
CLUSTER INFORMATION
      Name: linux cluster
     Hosts: 11
Gexec Hosts: 0
Dead Hosts: 1
 Localtime: Tue Mar  1 16:58:28 2005

can someone confirm that gmond does NOT have to be running on the box with the web interface, and that it should only consist of;

gmetad and php interface.

Tks

-Bob ( getting somewhat frustrated )

Ian Cunningham wrote:

Bob,

I would definitely recommend this series of actions:
1. stop (kill) gmond on all nodes
2. restart the gmetad process
3. start gmond on all nodes

This will ensure that none of nodes or the gmetad collect any old data from old config files and that they are using the new config files.

Second make sure that you have two different gmond.confs, one for each cluster. In your email, you show that gmetad is connecting to two different ports on the same machine (localhost which you say is linux). From my understanding, you should be connecting to the head node (whatever its name is) for the solaris cluster and not localhost.

Try those suggestions,
Ian

bob flynn wrote:

Hi, I am attempting to setup ganglia for the first time. Having read the documentation and looked at a couple of examples. I still have a few queries. Let me explain what I am attempting , what I have configured and what I see.

What I am attempting.

To configure two clusters.

What I have configured.

1. web front end server, with the following installed;

apache
ganglia-gmetad-3.0.0-1
ganglia-web-3.0.0-1

The gmetad conf file /etc/gmetad.conf contains the following entries;

data_source "linux cluster" 10 localhost:8649
data_source "solaris cluster" 10 localhost:8650
gridname "Leop LSF"

everything else remains as default.

2. client machine. ( Linux box )

ganglia-gmond-3.0.0-1

The gmond conf file /etc/gmond.conf contains the following entries;

cluster {
 name = "linux cluster"
}

What I am seeing is;

a number of machines in "unspecified" cluster. How do I blow these away. I understand that if I either restart all daemons, or else enter

globals {
   host_dmax = 3600
 }

in the gmond.conf it should take care of this. The thing is some nodes appear on both lists, ie in linux cluster and unspecified.

The other thing I am seeing for the hosts visible in the "linux cluster" cluster is that they are showing as down. Yet when I run a

telnet localhost 8649

I see the xml output. I figure I should be able to see the aggragate data on the head node, ie the one with gmetad and the php web interface installed. How do I debug this ? I have not gone down the road of installing the solaris binaries until I have this sorted. Any help appreciated.

Tks,

-Bob






The information contained in this e-mail and in any attachments is confidential 
and is designated solely for the attention of the intended recipient(s). If you 
are not an intended recipient, you must not use, disclose, copy, distribute or 
retain this e-mail or any part thereof. If you have received this e-mail in 
error, please notify the sender by return e-mail and delete all copies of this 
e-mail from your computer system(s).
Please direct any additional queries to: [EMAIL PROTECTED]
Thank You.

Reply via email to