I am new to Ganglia (and IP Mulitcast) BUT I have searched around for an
answer and cant find one. 

Defintion of issue: Each host with gmond only sees itself and the
central host with gmetad does not receive or display information about
any other nodes, except for itself.

I have around 100 Linux servers on the same subnet (10/100 ethernet).
They are not a GRID or cluster, simply very similar hosts performing
similar functions. There are multiple switches between hosts (different
models and manufacturers) but no routers.

I have gmetad running on a central server (Fedora Core 2 with kernel
2.6.5) with Apache/PHP/RRDTOOl setup and working fine. The web interface
shows the stats of the local nide perfectly. However, I have started up
gmond on other servers (mostly Fedora Core 1 with 2.6.1) but the node
information does not show up in the web interface on the central
server. 

I have checked the following:

- gmond is running on all hosts and listening on 8649 
- I can telnet to 8649 on all hosts from gmetad server and receive XML
- gstat --all on each host only shows itself, even though all have the
same cluster name

I suspect that since I know nothing about IP Multicast, that this is not
working. So I guess my question is, where do I begin looking? Should I
start with the kernels on the machines or the configs of the switches
(only some of them are Cisco and have any sort of management interface)?
Any info would be great.

Thanks
Warwick Poole

Reply via email to