On Fri, Dec 16, 2011 at 5:48 PM, Seth Graham <set...@fnal.gov> wrote:
>
> On Dec 16, 2011, at 10:28 AM, Maciek Lasyk wrote:
>
>> I've been trying to make a basic ganglia configuration: one gmetad
>> getting data from 2 clusters (11 sources and 1 source) via unicast.
>> Unfortunately with attached configuration I see only the first host
>> from data_source
>
>
> It appears you're using the same port for both data_source lines, which is 
> why you're having issues. Ganglia uses the port number to differentiate 
> between clusters.


I've done some testing...

1. I've removed second cluster from gmetad.conf - just to be sure
2. I've checked telnet connectivity from gmetad server to gmond nodes
via interfaces from gmetad,conf - everyone is working fine dumping XML
3. I've set setuid to "off" - just to exclude "nobody's" influence
4. I've removed "Grid" setting (gridname=...)
5. I've changed xml_port to 8151 - other then default (but it does not
matter, because it's for another gmetads...)
6. I've set 3 bogus hosts to the data_source at the very beggining and
that is what i got after starting gmetad:

=========================================================
[root@some_host etc]# /usr/local/ganglia/sbin/gmetad -d 10
Going to run as user root
Sources are ...
Source: [SR1, step 15] has 10 sources
        192.168.0.36
        192.168.0.177
        192.168.0.144
        192.168.0.6
        192.168.0.7
        192.168.0.10
        192.168.0.9
        192.168.0.3
        193.25.1.121
        192.168.0.8
xml listening on port 8151
interactive xml listening on port 8652
cleanup thread has been started
Data thread 1262647616 is monitoring [SR1] data source
        192.168.0.36
        192.168.0.177
        192.168.0.144
        192.168.0.6
        192.168.0.7
        192.168.0.10
        192.168.0.9
        192.168.0.3
        193.25.1.121
        192.168.0.8
data_thread() for [SR1] failed to contact node 192.168.0.36
data_thread() for [SR1] failed to contact node 192.168.0.177
data_thread() for [SR1] failed to contact node 192.168.0.144
[SR1] is a 2.5 or later data stream

hash_create size = 1024
hash->size is 1031
hash_create size = 50
hash->size is 53
hash_create size = 50
hash->size is 53
Updating host some.working-host.com, metric load_one
====================================================



So... there is connectivity, gmonds are working fine and gmetad is
trying gmonds hosts from left to right, and when it hits a working one
- it gets data from only this one - and so on and so on..

------------------------------------------------------------------------------
Learn Windows Azure Live!  Tuesday, Dec 13, 2011
Microsoft is holding a special Learn Windows Azure training event for 
developers. It will provide a great way to learn Windows Azure and what it 
provides. You can attend the event by watching it streamed LIVE online.  
Learn more at http://p.sf.net/sfu/ms-windowsazure
_______________________________________________
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to