Continued progress with this.
Doing a /gstat --all/ on any of the boxes with gmond installed, I can
see the stats. However I cannot see the clusters off the web interface.
node> gstat --all
CLUSTER INFORMATION
Name: linux cluster
Hosts: 11
Gexec Hosts: 0
Dead Hosts: 1
Localtime: Tue Mar 1 16:58:28 2005
can someone confirm that gmond does NOT have to be running on the box
with the web interface, and that it should only consist of;
gmetad and php interface.
Tks
-Bob ( getting somewhat frustrated )
Ian Cunningham wrote:
Bob,
I would definitely recommend this series of actions:
1. stop (kill) gmond on all nodes
2. restart the gmetad process
3. start gmond on all nodes
This will ensure that none of nodes or the gmetad collect any old data
from old config files and that they are using the new config files.
Second make sure that you have two different gmond.confs, one for each
cluster. In your email, you show that gmetad is connecting to two
different ports on the same machine (localhost which you say is
linux). From my understanding, you should be connecting to the head
node (whatever its name is) for the solaris cluster and not localhost.
Try those suggestions,
Ian
bob flynn wrote:
Hi, I am attempting to setup ganglia for the first time. Having read
the documentation and looked at a couple of examples. I still have
a few queries. Let me explain what I am attempting , what I have
configured and what I see.
What I am attempting.
To configure two clusters.
What I have configured.
1. web front end server, with the following installed;
apache
ganglia-gmetad-3.0.0-1
ganglia-web-3.0.0-1
The gmetad conf file /etc/gmetad.conf contains the following entries;
data_source "linux cluster" 10 localhost:8649
data_source "solaris cluster" 10 localhost:8650
gridname "Leop LSF"
everything else remains as default.
2. client machine. ( Linux box )
ganglia-gmond-3.0.0-1
The gmond conf file /etc/gmond.conf contains the following entries;
cluster {
name = "linux cluster"
}
What I am seeing is;
a number of machines in "unspecified" cluster. How do I blow these
away. I understand that if I either restart all daemons, or else enter
globals {
host_dmax = 3600
}
in the gmond.conf it should take care of this. The thing is some
nodes appear on both lists, ie in linux cluster and unspecified.
The other thing I am seeing for the hosts visible in the "linux
cluster" cluster is that they are showing as down. Yet when I run a
telnet localhost 8649
I see the xml output. I figure I should be able to see the aggragate
data on the head node, ie the one with gmetad and the php web
interface installed. How do I debug this ? I have not gone down the
road of installing the solaris binaries until I have this sorted. Any
help appreciated.
Tks,
-Bob
The information contained in this e-mail and in any attachments is confidential
and is designated solely for the attention of the intended recipient(s). If you
are not an intended recipient, you must not use, disclose, copy, distribute or
retain this e-mail or any part thereof. If you have received this e-mail in
error, please notify the sender by return e-mail and delete all copies of this
e-mail from your computer system(s).
Please direct any additional queries to: [EMAIL PROTECTED]
Thank You.