Hello !

Im trying to monitorize a LVS cluster using ganglia.

Each machine in the cluster are connected to different networks.

        Master Load balancer 1
        - intranet network ( <- default route)
        - data network
        - management network

        Slave Load balancer 2
        - internet (only active if Master is down!)
        - data network (only active if Master is down!)
        - management network


        Real Server 1 (Master Load balancer data network is default     route)
        - data network
        - management network

        Real Server 2 (Master Load balancer data network is default     route)
        - data network 
        - management network


        Monitoring Box
        - management network
        - intranet network. ( <- default route)


I have installed gmond, auth, gexec in all machines, and also necessary
tools to do web monitoring through Monitoring Box, rrdtool,
ganglia-php-rrdtool...

By default i see that gmond has used the intranet network as the network
to do monitoring, (i suppose due to this machines have the default route
set to the intranet network)in the graphics (i see uptadated graphics,
php works...)

        - Monitoring Box
        - Load Balancer 1


But i want gmond _to use the management network_ in order to see the
other machines,so I edit an /etc/gmond.conf and use 

mcast_channel 10.10.20.1

where 10.10.20.1 is the IP of the Monitoring Box into management
network.

it did not works, so I add a default route to host in each computer 

route add -host 10.10.20.1 eth1

where eth1 was the interface connected to the management network, but
instead of all it only show computers connected to the intranet.

There is a simple hub between all the machines, no strange routers...

Where i can specify gmond daemon to look for boxes in management network
but not in our intranet?

If I use gstat in the Monitoring Box I recevie the 2 intranet ip of the
machines:

CLUSTER INFORMATION
       Name: unspecified
      Hosts: 2
Gexec Hosts: 2
 Dead Hosts: 0
  Localtime: Tue May 14 16:38:49 2002

CLUSTER HOSTS
Hostname                     LOAD                       CPU             
Gexec
 CPUs (Procs/Total) [     1,     5, 15min] [  User,  Nice, System, Idle]

192.168.1.207
    1 (    0/  162) [  0.00,  0.00,  0.00] [   0.5,   0.0,   0.1,  98.9]
ON
192.168.1.208
    1 (    0/  133) [  0.18,  0.24,  0.18] [  37.6,   0.4,   0.3,  61.6]
ON

The default route in this machine is a fw located in 192.168.1.0
network. 

i also set in  /etc/profile this lines:

export GEXEC_GMOND_SVRS="gpphost1 gpphost2 director1 director2 doctor"
export GEXEC_SVRS="gpphost1 gpphost2 director1 director2 doctor"
export GEXEC_SVRS="gpphost1 gpphost2 director1 director2 doctor"

where "gpphost1 gpphost2 director1 director2 doctor" is the name of the
machines listed above.

How can I specify the management network to do monitoring?
It could be a routing problem?

Thanks a lot in advance.

-- 
--------------------------------------------------------
Carlos J. Ramos                 Genasys II Spain, S.A.U.
Administrador de Sistemas       Ventura de la Vega 5
Tel:+34-91-3649100 ext 170      28014 Madrid
Fax:+34-91-3649119              Spain
Email:  [EMAIL PROTECTED]
--------------------------------------------------------


Reply via email to