Hello !
Im trying to monitorize a LVS cluster using ganglia.
Each machine in the cluster are connected to different networks.
Master Load balancer 1
- intranet network ( <- default route)
- data network
- management network
Slave Load balancer 2
- internet (only active if Master is down!)
- data network (only active if Master is down!)
- management network
Real Server 1 (Master Load balancer data network is default route)
- data network
- management network
Real Server 2 (Master Load balancer data network is default route)
- data network
- management network
Monitoring Box
- management network
- intranet network. ( <- default route)
I have installed gmond, auth, gexec in all machines, and also necessary
tools to do web monitoring through Monitoring Box, rrdtool,
ganglia-php-rrdtool...
By default i see that gmond has used the intranet network as the network
to do monitoring, (i suppose due to this machines have the default route
set to the intranet network)in the graphics (i see uptadated graphics,
php works...)
- Monitoring Box
- Load Balancer 1
But i want gmond _to use the management network_ in order to see the
other machines,so I edit an /etc/gmond.conf and use
mcast_channel 10.10.20.1
where 10.10.20.1 is the IP of the Monitoring Box into management
network.
it did not works, so I add a default route to host in each computer
route add -host 10.10.20.1 eth1
where eth1 was the interface connected to the management network, but
instead of all it only show computers connected to the intranet.
There is a simple hub between all the machines, no strange routers...
Where i can specify gmond daemon to look for boxes in management network
but not in our intranet?
If I use gstat in the Monitoring Box I recevie the 2 intranet ip of the
machines:
CLUSTER INFORMATION
Name: unspecified
Hosts: 2
Gexec Hosts: 2
Dead Hosts: 0
Localtime: Tue May 14 16:38:49 2002
CLUSTER HOSTS
Hostname LOAD CPU
Gexec
CPUs (Procs/Total) [ 1, 5, 15min] [ User, Nice, System, Idle]
192.168.1.207
1 ( 0/ 162) [ 0.00, 0.00, 0.00] [ 0.5, 0.0, 0.1, 98.9]
ON
192.168.1.208
1 ( 0/ 133) [ 0.18, 0.24, 0.18] [ 37.6, 0.4, 0.3, 61.6]
ON
The default route in this machine is a fw located in 192.168.1.0
network.
i also set in /etc/profile this lines:
export GEXEC_GMOND_SVRS="gpphost1 gpphost2 director1 director2 doctor"
export GEXEC_SVRS="gpphost1 gpphost2 director1 director2 doctor"
export GEXEC_SVRS="gpphost1 gpphost2 director1 director2 doctor"
where "gpphost1 gpphost2 director1 director2 doctor" is the name of the
machines listed above.
How can I specify the management network to do monitoring?
It could be a routing problem?
Thanks a lot in advance.
--
--------------------------------------------------------
Carlos J. Ramos Genasys II Spain, S.A.U.
Administrador de Sistemas Ventura de la Vega 5
Tel:+34-91-3649100 ext 170 28014 Madrid
Fax:+34-91-3649119 Spain
Email: [EMAIL PROTECTED]
--------------------------------------------------------