Try running the monitoring cores in debug mode (in the foreground) to see if they're receiving multicast packets from other hosts. You may need to increase your mcast_ttl value.

Remember that all monitoring cores must use the same multicast address and port, otherwise they won't hear one another.

And finally, I have no idea what the level of support for 2.2.x kernels is with the monitoring core. Matt? :)

Andrei E. Chevel wrote:
Hi everybody,

I started to install Ganglia core package on RH Linux 6.1 ---------
Linux pcfarm.pnpi.spb.ru 2.2.18smp #3 SMP Tue Feb 12 19:15:04 MSD 2002
i686 unknown
--------

Everything with installation process was fine (tar, ./configure, make,
make install, ...).

I started three gmond on different machines in our cluster with
negligable changes in /etc/gmond.conf (in attachment).

Also pcfarm:shevel:Wed Aug 28::17:08:56:$> cat /proc/net/igmp Idx Device : Count Querier Group Users Timer Reporter
1       lo        :     0      V2
                                010000E0     1 0:FC6E99C1               0
2       eth0      :     2      V2
                                470B02EF     1 0:FFFEA541               0
                                010000E0     1 0:FC6E99C1               0
3       eth1      :     1      V2
                                010000E0     1 0:FC6E99C1               0
-------------------------


Also
pcfarm:shevel:Wed Aug 28::17:09:46:$> gstat CLUSTER INFORMATION
       Name: PNPI HEP Farm
      Hosts: 2
Gexec Hosts: 1
 Dead Hosts: 1
  Localtime: Wed Aug 28 17:10:27 2002

CLUSTER HOSTS
Hostname                     LOAD                       CPU
Gexec
 CPUs (Procs/Total) [     1,     5, 15min] [  User,  Nice, System, Idle]

pcfarm-01.pnpi.spb.ru
    2 (    0/  225) [  0.80,  0.67,  0.75] [   1.1,   0.0,   1.2,  97.7]
ON
pcfarm:shevel:Wed Aug 28::17:10:27:$> --------------------------------

Approximately same picture on all mahcines

-----------------------
pcfarm:shevel:Wed Aug 28::17:10:27:$> rsh pcfarm-02 gstat CLUSTER INFORMATION
       Name: PNPI HEP Farm
      Hosts: 1
Gexec Hosts: 1
 Dead Hosts: 2
  Localtime: Wed Aug 28 17:13:05 2002

CLUSTER HOSTS
Hostname                     LOAD                       CPU
Gexec
 CPUs (Procs/Total) [     1,     5, 15min] [  User,  Nice, System, Idle]

pcfarm-02.pnpi.spb.ru
    2 (    0/   34) [  0.00,  0.00,  0.00] [   0.0,   0.0,   0.0,  99.9
-----------------------------


----------------------------------------------------------------
pcfarm:shevel:Wed Aug 28::17:11:48:$> rsh pcfarm-03 gstat CLUSTER INFORMATION
       Name: PNPI HEP Farm
      Hosts: 2
Gexec Hosts: 1
 Dead Hosts: 1
  Localtime: Wed Aug 28 17:12:10 2002

CLUSTER HOSTS
Hostname                     LOAD                       CPU
Gexec
 CPUs (Procs/Total) [     1,     5, 15min] [  User,  Nice, System, Idle]

pcfarm-03.pnpi.spb.ru
    0 (    1/   38) [  1.00,  1.00,  1.00] [   0.0, 100.0,   0.0,   0.0]
ON
pcfarm:shevel:Wed Aug 28::17:12:31:$> ---------------------------------------------------------------


With 'ntop' on pcfarm I see that multicast packets are going mainly from
pcfarm (about 150 packets and only two packets from pcfarm-02).

In principle how I said the same picture on any other host.

Any ideas are welcome.

Thanks.




------------------------------------------------------------------------



# This is the configuration file for the Ganglia Monitor Daemon (gmond)
 # Documentation can be found at http://ganglia.sourceforge.net/docs/
 #
 # To change a value from it's default simply uncomment the line
 # and alter the value
 #####################
 #
 # The name of the cluster this node is a part of
 # default: "unspecified"
 # name  "My Cluster"
name "PNPI HEP Farm"
 #
 # The multicast channel for gmond to send/receive data on.  i
 # NOTE: Must be in the multicast range from 224.0.0.0-239.255.255.255
 # default: 239.2.11.71
 # mcast_channel 239.2.11.71
mcast_channel 239.2.11.71
 #
 # The multicast port for gmond to send/receive data on
 # default: 8649
 # mcast_port    8649
 #
 # The multicast interface for gmond to send/receive data on
 # default: the kernel decides based on routing configuration
 # mcast_if  eth1
mcast_if eth0
 #
 # The multicast Time-To-Live (TTL) for outgoing messages
 # default: 1
# mcast_ttl 1 mcast_ttl 3
 #
 # The number of threads listening to multicast traffic
 # default: 2
 # mcast_threads 2
 #
 # Which port should gmond listen for XML requests on
 # default: 8649
 # xml_port     8649
 #
# The number of threads answering XML requests # default: 2
 # xml_threads   2
 #
 # Hosts ASIDE from "127.0.0.1"/localhost and those multicasting
# on the same multicast channel which you will share your XML # data with. Multiple hosts are allowed on multiple lines.
 # default: none
 # trusted_hosts 1.1.1.1 1.1.1.2 1.1.1.3 \
# 2.3.2.3 3.4.3.4 5.6.5.6 trusted_hosts 192.168.1.2 192.168.1.1 192.168.1.3 192.168.1.4 192.168.1.254 \
              212.58.198.197
 #
 # The number of nodes in your cluster.  This value is used in the
# creation of the cluster hash. # default: 1024
 # num_nodes  1024
num_nodes 8
 #
 # The number of custom metrics this gmond will be storing.  This
 # value is used in the creation of the host custom_metrics hash.
 # default: 16
 # num_custom_metrics 16
num_custom_metrics 30
 #
 # Run gmond in "mute" mode.  Gmond will only listen to the multicast
 # channel but will not send any data on the channel.
 # default: off
 # mute on
mute off
 #
 # Run gmond in "deaf" mode.  Gmond will only send data on the multicast
 # channel but will not listen/store any data from the channel.
 # default: off
 # deaf on
deaf off
 #
 # Run gmond in "debug" mode.  Gmond will not background.  Debug messages
 # are sent to stdout.  Value from 0-100.  The higher the number the more
 # detailed debugging information will be sent.
 # default: 0
# debug_level 10 #
 # If you don't want gmond to setuid, set this to "on"
 # default: off
 # no_setuid  on
 #
 # Which user should gmond run as?
# default: nobody # setuid nobody
 #
 # If you do not want this host to appear in the gexec host list, set
 # this value to "on"
 # default: off
 # no_gexec   on
# # If you want any host which connects to the gmond XML to receive
 # data, then set this value to "on"
 # default: off
 # all_trusted on
# End of  the configuration file for the Ganglia Monitor Daemon (gmond)




Reply via email to