Dear All,
Am I an idiot? Where am I going wrong?
I would like to use Ganglia to monitor various machines at work. The machines
I would like to group together into various clusters. Just like other people
have done.
So for simplicity I have three machines and I want two machines to be in one
cluster and the other in a separate cluster
So I have installed the Linux gmond rpm on each machine. I have then edit the
/etc/gmond.conf file and put in this entry: -
For machine A - name "Systems Machines"
For machine B - name "Systems Machines"
For machine C - name "Tower Machines"
Every other entry in this file is left to the defaults. I have then restarted
the gmond daemon on all three machines.
On machine A I have also installed the gmetad rpm. I have edited this file and
put in the following lines: -
data_source "System Machines" 127.0.0.1:8649 172.16.11.136:8649
(172.16.11.136 being the IP address of machine B)
Then I restarted gmetad.
Running gstat -a I get this output: -
CLUSTER INFORMATION
Name: System Machines
Hosts: 2
Gexec Hosts: 0
Dead Hosts: 0
Localtime: Wed Jul 30 17:16:50 2003
CLUSTER HOSTS
Hostname LOAD CPU Gexec
CPUs (Procs/Total) [ 1, 5, 15min] [ User, Nice, System, Idle]
wingnut.mpc.local
1 ( 2/ 180) [ 0.05, 0.06, 0.03] [ 1.2, 0.0, 0.5, 98.4] OFF
bromley.mpc.local
0 ( 0/ 0) [ 0.00, 0.00, 0.00] [ 0.0, 0.0, 0.0, 0.0] OFF
Then on machine C I also installed the gmetad rpm, the webfrontend, the rrd
package, apache.
I have also edited the gmetad.conf file on machine C and I have put in these
lines: -
data_source "Tower Machines" localhost
data_source "Systems Machines" localhost
gridname "MPC"
Running gstat -a on machine I get this information: -
CLUSTER INFORMATION
Name: Tower Machines
Hosts: 3
Gexec Hosts: 0
Dead Hosts: 0
Localtime: Wed Jul 30 17:26:25 2003
CLUSTER HOSTS
Hostname LOAD CPU Gexec
CPUs (Procs/Total) [ 1, 5, 15min] [ User, Nice, System, Idle]
wingnut.mpc.local
0 ( 0/ 0) [ 0.00, 0.00, 0.00] [ 0.7, 0.0, 0.6, 99.2] OFF
askja.mpc.local
1 ( 1/ 190) [ 1.04, 1.11, 0.71] [ 5.1, 0.0, 2.5, 92.5] OFF
bromley.mpc.local
0 ( 0/ 0) [ 0.06, 0.00, 0.00] [ 5.1, 0.0, 0.8, 94.0] OFF
When I fire up the web browser at point it at machineC/gangliaFrontend I see
all three machines under Tower Machines > -- Choose a node
Not what I was hoping for.
If I change the following line in machine C's gmetad.conf file
data_source "Systems Machines" localhost to data_source "Systems Machines"
172.16.200.29
(172.16.200.29 the IP address of machine A also running gmetad) nothing
changes. Followed by restarting gmetad. Nothing changes
Where am I going wrong?
Thanks for any help offered.
Regards,
Dave.