Re: [Ganglia-general] error building in FreeBSD like system: don't know how to make all

2014-01-07 Thread Nicholas Satterly
] Key fingerprint = 3EE9 550D D9D8 DB65 58C2 B58D CE78 EC6C 1EE3 8BD9 uid Nicholas Satterly (Debian Key) nfsatte...@gmail.com sub 4096R/23804EE9 2013-01-06 [expires: 2018-01-06] -- Rapidly troubleshoot

Re: [Ganglia-general] Grid of Grids Broken Again in 3.6.0? Is this a different problem?

2013-09-24 Thread Nicholas Satterly
uid Nicholas Satterly (Debian Key) nfsatte...@gmail.com sub 4096R/23804EE9 2013-01-06 [expires: 2018-01-06] -- October Webinars: Code for Performance Free Intel webinars can help you accelerate

Re: [Ganglia-general] IP not hostname show up for Web front end

2013-08-22 Thread Nicholas Satterly
Just to clarify this... you can certainly fix this by using override_hostname however there is no reason why it shouldn't work without it as long as the gmond that is being polled by the gmetad server can reverse lookup the IP addresses of all other servers with gmonds. Please check that reverse

Re: [Ganglia-general] Mysterious appearance of lvs (10.10.0.1) in ganlia page

2013-02-08 Thread Nicholas Satterly
Are you using override hostname or override IP? If so there was a patch that fixed hostname corruption a few months ago which could be related. --Nick. On 7 Feb 2013, at 21:43, Laurence Marks l-ma...@northwestern.edu wrote: A phantom node (spoof ?) calling itself lvs has recently appeared in

Re: [Ganglia-general] Can't use sFlow and Ganglia

2013-02-05 Thread Nicholas Satterly
D9D8 DB65 58C2 B58D CE78 EC6C 1EE3 8BD9 uid Nicholas Satterly (Debian Key) nfsatte...@gmail.com sub 4096R/23804EE9 2013-01-06 [expires: 2018-01-06] -- Free Next-Gen Firewall Hardware Offer Buy your

Re: [Ganglia-general] Can't use sFlow and Ganglia

2013-02-05 Thread Nicholas Satterly
was missing, but when I add it and restart ganglia, I get an error saying that module sFlow doesn't exist... I think I'm not running an enough recent version of ganglia, I'm using 3.5.0 Thanks in advance for your help. CyD Mardi 05/02/2013 à 13:21 Nicholas Satterly a écrit: Hi, Not sure if you

Re: [Ganglia-general] solaris 10 client, help.

2012-12-09 Thread Nicholas Satterly
Is the gmond.conf file any different for the Solaris host? --Nick. On Friday, December 7, 2012, Derek Smith wrote: Hi! I need some help. I installed this solaris 10 client using pkgadd and that went OK. Currenlty I have 4 AIX clients and 4 x86 RHel client reporting successfully to one

Re: [Ganglia-general] tcp error

2012-11-29 Thread Nicholas Satterly
(0x2b4119ae3000, 2223808) = 0 munmap(0x2b41198c4000, 2223840) = 0 munmap(0x2b41196a4000, 2226144) = 0 exit_group(1) = ? Regards Langton On 28/11/2012 22:56, Nicholas Satterly wrote: The TCP accept channel port is commented out in your config

Re: [Ganglia-general] tcp error

2012-11-29 Thread Nicholas Satterly
On 29/11/2012 13:58, Nicholas Satterly wrote: Something else is already bound to TCP port 8649... sin_addr=inet_addr(0.0.0.0)}, 16) = -1 EADDRINUSE (Address already in use) You can run lsof -i tcp:8649 as root to see what it is. It's most probably another gmond. --Nick. On Thu, Nov

Re: [Ganglia-general] tcp error

2012-11-29 Thread Nicholas Satterly
...@eclipseholdings.co.zawrote: I have commented out the tcp_accept_channel and it seems its running fine. what are the impacts running ganglia without the tcp channel On 29/11/2012 15:11, Nicholas Satterly wrote: It's not a bug. You can't have more than one gmond listening on the same TCP port. So

Re: [Ganglia-general] Gmond ideal

2012-11-22 Thread Nicholas Satterly
Run gmetad in debug mode and post the output... gmetad -d 2 On 21 Nov 2012, at 07:46, Langton langt...@eclipseholdings.co.za wrote: I have set a ganglia server to monitor a cluster of 390 nodes in unicast mode. Gmond goes ideal and nothing is shown on the graphs. The gmetad server is

Re: [Ganglia-general] Gmond send_metadata_interval

2012-11-15 Thread Nicholas Satterly
Hi The time_threshold setting for some metric collection groups is very high for metrics that don't change often (or never change). For example, the collection group for cpu_num, cpu_speed, mem_total etc has a time threshold of 1200 seconds or 20 minutes. The metadata interval configuration

Re: [Ganglia-general] .pyc file not getting made for python module

2012-10-26 Thread Nicholas Satterly
Hi Deep, Please send through the pyconf file so we can verify it looks OK. --Nick. On Fri, Oct 26, 2012 at 7:19 AM, deep desai deepdesai.n...@gmail.comwrote: Hi all, I have made a python module for getting metrics for rabbitmq. I followed following steps: 1) Put this rabbitmq.py file

Re: [Ganglia-general] .pyc file not getting made for python module

2012-10-26 Thread Nicholas Satterly
Modify your rabbitmq module to use metric names without dots, hashes and slashes and see if that makes a difference -- I think it will. 1. metric { 2. name = rmq_messages_ready.Queue#/ 3. } --Nick. On Fri, Oct 26, 2012 at 9:50 AM, deep desai deepdesai.n...@gmail.comwrote: hi

Re: [Ganglia-general] help :: making grids of gmetads

2012-10-24 Thread Nicholas Satterly
Hi Adrian, To create a grid of grids type hierarchy it depends on how the gmetad are configured, not the gmonds. To get a gemtad to pull metric data from another gmetad append the port number for the gmetad (normally 8651) to the data source. Using your example the gmetad3.conf would look like

Re: [Ganglia-general] help :: making grids of gmetads (and question about monitoring topology)

2012-10-24 Thread Nicholas Satterly
Yes, a gmetad or gmond can be polled by any number of different gmetad's in any combination or hierarchy that makes sense to you. --Nick. On Tue, Oct 23, 2012 at 9:57 AM, Adrian Sevcenco adrian.sevce...@cern.chwrote: On 10/22/2012 02:18 PM, Adrian Sevcenco wrote: Hi! I am a little bit lost

Re: [Ganglia-general] Question about scaling

2012-10-23 Thread Nicholas Satterly
Please send thru your gmetad.conf file so we can see how things are configured on the server side. * --Nick. * Be sure to anonymise any sensitive info. On 23 Oct 2012, at 19:21, Potter,Mark L mlpot...@mdanderson.org wrote: I am using what I think to be a fairly standard gmond.conf: globals

Re: [Ganglia-general] Question about scaling

2012-10-23 Thread Nicholas Satterly
://github.com/ganglia/monitor-core/pull/53 On Tue, Oct 23, 2012 at 7:36 PM, Potter,Mark L mlpot...@mdanderson.orgwrote: data_source MDACC 60 cnode340:8649 Everything else is default at this point. http://pastebin.com/UAQYxcX3 is a full copy. From: Nicholas

Re: [Ganglia-general] [Ganglia-developers] Adding Holt-Winters databases to existing rrd causes __SummaryInfo__ metric to fail to render on graphs

2012-10-23 Thread Nicholas Satterly
Hi Aaron, What is the output of rrdtool info cron.webServiceRequestCounter.Counter.rrd? --Nick. On Tue, Oct 23, 2012 at 9:44 PM, Aaron Nichols anich...@trumped.org wrote: Bumping this thread - I updated to rrdtool 1.4.7 rebuilt ganglia against the new version of rrdtool and it didn't appear

Re: [Ganglia-general] gmond python module, dynamic number of descriptors

2012-10-22 Thread Nicholas Satterly
Just to make it clear, you define statically your descriptors in the metric_init function, and regularly, you check for new metrics and call gmetric accordingly ? No, using gmetric within a python module means you do not need to use metric_init() to initialise any metrics at all. However, we

Re: [Ganglia-general] gmond python module, dynamic number of descriptors

2012-10-21 Thread Nicholas Satterly
Hi Jean-Baptiste, We had a similar requirement for dynamically adding new metrics. The only way we could make this work is to call gmetric from within the python module. A bit of a hack, I know, but it also gives you the ability to set the dmax option which means old metrics will expire and be

[Ganglia-general] override hostname with EC2 instance id

2012-10-12 Thread Nicholas Satterly
Hi, With the addition of the override_hostname configuration option for gmond I was wondering if anyone else had considered (or is already) using in EC2 the instance Id as the hostname instead of either the private or public DNS name. In dynamic cloud environments we (at the Guardian) seem to be

Re: [Ganglia-general] Ganglia no longer collecting GRID Data on port 8651

2012-10-10 Thread Nicholas Satterly
Hi Douglas, Yes, there was a change between 3.1.7 and 3.4.0 that broke grid of grids. A patch was submitted some time ago but hasn't been rolled out into a proper release yet [1]. Either rollback your gmetad's to 3.1.7 or build from the latest github repo. Regards, Nick [1]

Re: [Ganglia-general] Ganglia not collecting any metrics

2012-10-09 Thread Nicholas Satterly
hi, ensure iptables is configured to allow tcp and udp on port 8649, if it is running. run gmond in debug mode using -d 2 and put on pastebin regards nick On Tuesday, October 9, 2012, Potter,Mark L wrote: I've got one more idea/concept that I ran into this morning that MIGHT be your

Re: [Ganglia-general] Impact of gmond polling on data collection

2012-09-19 Thread Nicholas Satterly
, Nicholas Satterly nfsatte...@gmail.com wrote: Hi Chris, I've discovered there are two contributing factors to problems like this. 1. the number of metrics being sent (possibly in short bursts) can overflow the UDP receive buffer. 2. the time it takes to process metrics in the UDP receive

Re: [Ganglia-general] Impact of gmond polling on data collection

2012-09-19 Thread Nicholas Satterly
On Wed, Sep 19, 2012 at 8:45 AM, Nicholas Satterly nfsatte...@gmail.com wrote: Hi Peter, Thanks for the feedback. I've added a thread mutex to the hosts hash table as you suggested and will send a pull request in the next day or so. Regards, Nick On Mon, Sep 17, 2012 at 8:25

Re: [Ganglia-general] Impact of gmond polling on data collection

2012-09-17 Thread Nicholas Satterly
Hi Chris, I've discovered there are two contributing factors to problems like this. 1. the number of metrics being sent (possibly in short bursts) can overflow the UDP receive buffer. 2. the time it takes to process metrics in the UDP receive buffer causes TCP connections from the gmetad's to

[Ganglia-general] gmond Error 1 sending the modular data

2012-07-25 Thread Nicholas Satterly
Hi, I get a lot of these in /var/log/messages for different metrics... Jul 25 20:51:18 myhost06 /usr/sbin/gmond[13746]: Error 1 sending the modular data for mem_free Does anyone know what the root cause is, what might be effected and if there's a fix? Thanks, Nick

Re: [Ganglia-general] grid of grids (ganglia-3.3.1)

2012-04-26 Thread Nicholas Satterly
Hi, I've submitted a pull request to fix the gmetad problem mentioned below. See https://github.com/ganglia/monitor-core/pull/35 It's nothing fancy -- it just re-instates what was there before a patch broke it. Regards, Nick On Wed, Mar 28, 2012 at 4:42 PM, Arnau Bria listsar...@gmail.com

[Ganglia-general] Ganglia in EC2

2011-12-09 Thread Nicholas Satterly
Hi, Anyone willing to share best practise for deploying Ganglia in EC2? Specifically, I'm struggling with how to generate the gmetad data sources configuration when the list of instances is constantly changing. Also, does anyone bother using multiple gmetad's to produce a Grid of Grids type

[Ganglia-general] Do Ganglia python modules run with root privileges?

2011-08-31 Thread Nicholas Satterly
Hi, I've written some python modules that need to read some directories under the /proc filesystem that are read-only by root. I found on the Ganglia wiki mention of the python modules running as root even if the gmond runs as a non-root user ... * all metric collection routines run as root