Re: [Ganglia-general] Ganglia Web top-level project + versioning

2010-11-05 Thread Brad Nicholes
On 11/4/2010 at 6:21 PM, in message aanlkti=oxs0t1fbsf9no5og6phqxcbjxuscm1w9kt...@mail.gmail.com, Bernard Li bern...@vanhpc.org wrote: Hi Brad: [I've changed the subject line to be more reflective of the current discussions] On Thu, Nov 4, 2010 at 8:50 AM, Brad Nicholes bnicho

Re: [Ganglia-general] Welcoming newest members to the Ganglia Team!

2010-11-04 Thread Brad Nicholes
As part of the Ganglia development team, I just wanted to add my welcome to all of the new committers as well. It is always great to see so many community members wanting to pitch in and help move the project forward. Brad On 11/3/2010 at 11:48 PM, in message

Re: [Ganglia-general] [Ganglia-developers] IRC chat on Ganglia Web Frontend re-write 10/13/2010 (Wed) 9-10am PDT

2010-11-04 Thread Brad Nicholes
I'm not sure that we need to physically split the web frontend from the backend as far as the Ganglia project goes. IMO, why not just follow the pattern that we already have in SVN under trunk. Right now we have trunk/monitor-core which includes everything. Could we just create a new

Re: [Ganglia-general] Help: Using Ganglia with KVM/QEMU/libvirt ? Any Python DSOs out there?

2010-10-21 Thread Brad Nicholes
On 10/20/2010 at 8:22 PM, in message aanlkti=ezmjzo4s6sqyp4m7bdhposhxp2oufagz_z...@mail.gmail.com, Lukas Lundell lukaslund...@gmail.com wrote: Looking to use Ganglia to monitor a virtual linux environment (kvm/qemu). I haven't seen any plugins or Python DSOs for something like libvirt so that

Re: [Ganglia-general] How is the multicpu module to be used?

2010-06-16 Thread Brad Nicholes
On 6/16/2010 at 2:10 PM, in message 20100616201041.ga9...@transpect.com, Whit Blauvelt w...@transpect.com wrote: Hi, I've compiled ganglia-3.1.7 on CentOS 5.5. The main thing I'm trying to monitor on our cluster is load on individual CPU cores. It looks like the included multicpu module

Re: [Ganglia-general] sending float from python module ends up as integer (and very large)

2010-05-07 Thread Brad Nicholes
On 5/6/2010 at 5:57 PM, in message l2qd4c731da1005061657xf03acf27x1f1b19b4a7909...@mail.gmail.com, Bernard Li bern...@vanhpc.org wrote: Hi David: On Thu, May 6, 2010 at 3:39 PM, David Birdsong david.birds...@gmail.com wrote: i've since just convereted 1.xx seconds to milliseconds and now

Re: [Ganglia-general] sending float from python module ends up as integer (and very large)

2010-05-07 Thread Brad Nicholes
On 5/7/2010 at 12:48 PM, in message r2wd4c731da1005071148t4107614fj661b0e3b5a27a...@mail.gmail.com, Bernard Li bern...@vanhpc.org wrote: Hi Brad: On Fri, May 7, 2010 at 7:37 AM, Brad Nicholes bnicho...@novell.com wrote: The primary place in the code where the value type and format come

Re: [Ganglia-general] Problem to include Plugin

2010-04-27 Thread Brad Nicholes
On 4/27/2010 at 3:20 AM, in message 1272360012.4619.9.ca...@station3.hq, Patrick Datko patrick.da...@ymc.ch wrote: Hey People, i'm using Ganglia 3.1.2, installed with aptitude, to observe my cluster and it works without any problem. I wanted to integrate a metric which control the traffic of

Re: [Ganglia-general] Problem with custom metrics

2010-04-12 Thread Brad Nicholes
Actually Bernard is the guru here. thanks Bernard :) On 4/12/2010 at 12:20 PM, in message c7e8dca8.7895%hugo.hernan...@nih.gov, Hernandez, Hugo (NIH/NIAID) [C] hugo.hernan...@nih.gov wrote: Brad, Those changes did the trick. Thanks a lot! Now, I can explore my new metrics to be added.

Re: [Ganglia-general] [Ganglia-developers] Ganglia 3.1.7 ready for testing

2010-03-02 Thread Brad Nicholes
On 3/2/2010 at 4:23 AM, in message 4b8cf534.7090...@pocock.com.au, Daniel Pocock dan...@pocock.com.au wrote: Thanks to those who provided feedback - any objections to making 3.1.7 generally available? I would like to make it GA within the next 1-2 days now. +1 Michael Perzl wrote: I

Re: [Ganglia-general] gmetad and RDD problem

2010-02-10 Thread Brad Nicholes
On 2/10/2010 at 1:36 AM, in message 70933b58740d5049a7ab96254a66683301a0e...@yaca.intra.cea.fr, GOGUEY-MUETHON Nicolas OSIATIS nicolas.goguey-muet...@cea.fr wrote: Hello , I have lot of log with error like this: Feb 10 09:29:52 SERVEUR /usr/sbin/gmetad[22332]: RRD_update

Re: [Ganglia-general] [Ganglia-developers] Ganglia 3.1.5 beta ready for final testing

2009-12-02 Thread Brad Nicholes
On 12/2/2009 at 7:21 AM, in message 4b1677e4.8000...@pocock.com.au, Daniel Pocock dan...@pocock.com.au wrote: I would like gmond to return a non-zero return code if it fails to initialise, e.g. if it is unable to bind or if it is unable to resolve a hostname mentioned in gmond.conf

Re: [Ganglia-general] [Ganglia-developers] 3.1.4 to go GA?

2009-11-20 Thread Brad Nicholes
On 11/20/2009 at 8:07 AM, in message 4b06b0af.1050...@pocock.com.au, Daniel Pocock dan...@pocock.com.au wrote: Brad Nicholes wrote: I've been running it on a very small set of machines. It all looks good to me. No complaints from anyone... is that sufficient to go live? I'm

Re: [Ganglia-general] [Ganglia-developers] 3.1.4 to go GA?

2009-11-18 Thread Brad Nicholes
I've been running it on a very small set of machines. It all looks good to me. Brad On 11/18/2009 at 9:42 AM, in message d4c731da0911180842x74ecc2c3p2f440e9c521d7...@mail.gmail.com, Bernard Li bern...@vanhpc.org wrote: I haven't had a chance to test it out yet -- has anybody else been able

Re: [Ganglia-general] Ganglia cannot find a data source.

2009-11-17 Thread Brad Nicholes
On 11/17/2009 at 10:04 AM, in message b1eec58d0911170904r2f2613ads9244341a82b85...@mail.gmail.com, Ryan Robertson 89esp...@gmail.com wrote: I too have been bangin my head on this for a few weeks. After much googling i cannot seem to find the answer, so i hope someone (developer maybe) can

Re: [Ganglia-general] Ganglia cannot find a data source.

2009-11-17 Thread Brad Nicholes
Sounds to me like it could be a file permissions problems then. Is your apache server able to access the rrd files and/or port 8652? On 11/17/2009 at 1:00 PM, in message 0016e64c2536e598710478969...@google.com, 89esp...@gmail.com wrote: Ahh yes, i knew there was one other telnet snippet

Re: [Ganglia-general] Ganglia install instructions wiki link broken

2009-11-16 Thread Brad Nicholes
On 11/12/2009 at 8:57 AM, in message 4AFC3066.521 : 172 : 26400, Brad Nicholes wrote: On 11/12/2009 at 6:12 AM, in message f7b2d28a-290a-4142-8f13-6034d55c2...@beforedawnsolutions.com, John Martyniak j...@beforedawnsolutions.com wrote: First off is that the best way to install Ganglia

Re: [Ganglia-general] XML errors: XML_ParseBuffer() error at line 272: not well-formed(invalid token)

2009-11-13 Thread Brad Nicholes
On 11/12/2009 at 8:11 PM, in message b791204d0911121911t5628f609s88f339567d104...@mail.gmail.com, chifeng chif...@gmail.com wrote: Hi folks, I got a XML errors in Ganglia v3.1.2. It looks like this ticket: http://www.mail-archive.com/ganglia-general@lists.sourceforge.net/msg05054.htm l

Re: [Ganglia-general] special metric names in diskusage.pyconf file

2009-10-22 Thread Brad Nicholes
...@mail.gmail.com, Matt mattmora...@gmail.com wrote: Is there any solution to this? It would be really beneficial to work out the metrics we want to publish in the python code rather than supplying them up front in the pyconf file. 2009/7/15 Brad Nicholes bnicho...@novell.com: On 7/14/2009 at 4

Re: [Ganglia-general] Ganglia 3.1.3 beta ready for testing

2009-10-01 Thread Brad Nicholes
On 10/1/2009 at 4:33 PM, in message d4c731da0910011533p2d337d0ajc80ea158d2a7...@mail.gmail.com, Bernard Li bern...@vanhpc.org wrote: So has anybody else given 3.1.3 a test run? I have found some minor issues. It looks like there are new configure options added in regards to setuid and

Re: [Ganglia-general] How to remove a gmetric-added metric?

2009-09-02 Thread Brad Nicholes
No, but you should be able to get the same results by setting host_dmax in the gmond.conf file. Brad On 9/2/2009 at 1:37 AM, in message 68fea9390909020037y2094d15es42bd13da3ea0...@mail.gmail.com, Matt mattmora...@gmail.com wrote: Is there such a thing as dmax in the python interface?

Re: [Ganglia-general] python metric modules

2009-09-02 Thread Brad Nicholes
Why can't you just do the following: # webserver.pyconf modules { module { name = lsof language = python param httpd { value = doesnt-matter } param crawler { value = doesnt-matter } } } collection_group { collect_every = 30 time_threshold = 60

Re: [Ganglia-general] issues with upgrade to 3.1

2009-08-27 Thread Brad Nicholes
As Bernard mentioned, take a look at the upgrade release notes Please see the section Upgrading from 3.0 in the 3.1.x release notes: http://sourceforge.net/apps/trac/ganglia/wiki/ganglia_release_notes you can't mix gmond 3.0.x and gmond 3.1.x in the same cluster. All of the gmond nodes

Re: [Ganglia-general] ganglia returns wrong value for python module

2009-08-20 Thread Brad Nicholes
On 8/20/2009 at 9:01 AM, in message 68fea9390908200801m3e1f43ecy2c33e743ccc0d...@mail.gmail.com, Matt mattmora...@gmail.com wrote: Hi all, I'm getting inconsistent results when gmond is running my python module # gmond --version gmond 3.1.2 # ./lsof.py (0, '2699') 2699 2700 1000

Re: [Ganglia-general] special metric names in diskusage.pyconf file

2009-07-14 Thread Brad Nicholes
On 7/14/2009 at 4:36 PM, in message 4120cbd6bbd82647b89d6a70694510bed1c...@exchange02.presidio.alexa.com, Guolin Cheng guo...@alexa.com wrote: Hi, Any one knows what the metric name disk_used-metric-name stands for? The stanza is from diskusage.pyconf file, ganglia version 3.1.1/2.

Re: [Ganglia-general] gmond returning XML with large negative TNvalues(ganglia 3.1.2, linux x86_64)

2009-07-13 Thread Brad Nicholes
On 7/13/2009 at 1:06 AM, in message d9c3f61a0907130006q5cdf7d8fg85ed8ea7f7ea3...@mail.gmail.com, Pavel Shevaev pacha.shev...@gmail.com wrote: Hi folks, Looks like gmetad ignores reports from gmond returning records with large negative TN values. gmond started to behave like that after the

Re: [Ganglia-general] No rrd file being created for metrics

2009-06-30 Thread Brad Nicholes
You mentioned the udp_send_channel configuration but did you set up the udp_recv_channel? Gmond has to be able to listen to itself as well as everybody else in order to collect the metrics that will be reported to gmetad. Brad On 6/30/2009 at 12:46 AM, in message

Re: [Ganglia-general] Gmond strange metric TN too large issue ormulticast metric lost?

2009-06-24 Thread Brad Nicholes
The TN value is simple indicating the time offset from the reported timestamp that the metric was last received from the managed node. In other words it is the age of the metric. A large number would indicated that the metric value has not been updated for a long period of time. This might be

Re: [Ganglia-general] metric_cleanup not being called in my pythonmodule

2009-05-27 Thread Brad Nicholes
is fine. I'm not sure where to find a built RPM. I have always just built it myself. You want a debug version of APR. Gmond doesn't use apr-util. Brad On Tue, May 26, 2009 at 8:35 AM, Brad Nicholes bnicho...@novell.com wrote: On 5/24/2009 at 12:43 AM, in message

Re: [Ganglia-general] metric_cleanup not being called in my pythonmodule

2009-05-26 Thread Brad Nicholes
On 5/24/2009 at 12:43 AM, in message dcccdf790905232343y76481e5dw6c1df62bc732c...@mail.gmail.com, David Birdsong david.birds...@gmail.com wrote: I have a python module that spawns a separate thread that collects data off of a pipe. Everything runs fine, but I'm finding that metric_cleanup is

Re: [Ganglia-general] CVE-2009-0241

2009-03-10 Thread Brad Nicholes
On 3/10/2009 at 1:14 PM, in message m3fxhlchqn@unna.nsc.liu.se, Leif Nixon ni...@nsc.liu.se wrote: Linkoping University The issue has been there for a while. See the associated bug report. Also since it is an issue with the interactive port, the attacker would have to have access to

[Ganglia-general] [ANNOUNCEMENT] - Release Ganglia 3.1.2

2009-02-17 Thread Brad Nicholes
The Ganglia Project (http://ganglia.info) is pleased to announce the official release of Ganglia 3.1.2 The official tarball is available for immediate download at: http://sourceforge.net/project/showfiles.php?group_id=43021package_id=35280release_id=661845 For a full description of the bug

Re: [Ganglia-general] [ANNOUNCEMENT] Ganglia 3.1.2 testing tarball...

2009-02-09 Thread Brad Nicholes
that it is the latter. :) If anybody has anything to report (good or bad), please send a quick email to the list. thanks, Brad On 1/30/2009 at 8:18 AM, in message 4982b7ef02ac0003a...@lucius.provo.novell.com, Brad Nicholes bnicho...@novell.com wrote: In an effort to continue improving

Re: [Ganglia-general] ganglia upgrade

2009-01-20 Thread Brad Nicholes
On 1/20/2009 at 10:48 AM, in message 32128a4489900844a3dba3e8273be22e1348965...@in01wxmbx1.internal.synopsys.com, Hardik Shah hardik.s...@synopsys.com wrote: Hi, Does anyone has any information on upgrade on ganglia cluster? I have configured around 200 machines with ganglia 3.0.7 but now I

Re: [Ganglia-general] custom metric's value doesn't update --custom python metric modules on 3.1.1

2008-12-12 Thread Brad Nicholes
I haven't tried to actually run your module yet, but can this be a permissions problem. What user are you running gmond as? Does that user have permissions to run rndc and access named.stats? All modules run by gmond will be run as the same user as gmond. Therefore you have to make sure

Re: [Ganglia-general] Monitor Apache

2008-12-11 Thread Brad Nicholes
On 12/11/2008 at 11:33 AM, in message 49415cf1.1010...@greenberg.org, Ed Greenberg e...@greenberg.org wrote: Michael Henderson wrote: Hello all, Is there a way to monitor apache through ganglia? Thanks, ~Mike I'm interested in seeing what others say but... I rolled my own as

[Ganglia-general] Spoofing functionality in 3.1.x branch...

2008-12-04 Thread Brad Nicholes
For those that are interested in the module based spoofing feature, all of the functionality should be complete and has been backported to the 3.1.x branch. I have also added some spoofing module examples to trunk that can be downloaded from

[Ganglia-general] Testing BETA 3.1.x available...

2008-12-04 Thread Brad Nicholes
There is a new BETA tarball and RPMs on the Ganglia testing site (http://www.ganglia.info/testing/). The following includes a list of enhancements and bug fixes that are currently available in this testing BETA release. * gmond/gmetad: Sync-up the default values for the cluster section

Re: [Ganglia-general] nonresponsive gmond

2008-12-01 Thread Brad Nicholes
On 11/29/2008 at 11:54 AM, in message [EMAIL PROTECTED], Kostas Georgiou [EMAIL PROTECTED] wrote: On Tue, Nov 04, 2008 at 10:02:32AM -0700, Brad Nicholes wrote: On 11/3/2008 at 5:27 PM, in message [EMAIL PROTECTED], Kostas Georgiou [EMAIL PROTECTED] wrote: On Mon, Nov 03, 2008 at 11:46

Re: [Ganglia-general] gmetric fails when disk is unwriteable?

2008-11-26 Thread Brad Nicholes
On 11/26/2008 at 3:45 AM, in message [EMAIL PROTECTED], Martin Knoblauch [EMAIL PROTECTED] wrote: - Original Message From: Brad Nicholes [EMAIL PROTECTED] To: Ofer Inbar [EMAIL PROTECTED] Cc: ganglia-general@lists.sourceforge.net Sent: Tuesday, November 25, 2008 8:43:08 PM

Re: [Ganglia-general] gmetric fails when disk is unwriteable?

2008-11-26 Thread Brad Nicholes
On 11/26/2008 at 1:17 AM, in message [EMAIL PROTECTED], Carlo Marcelo Arenas Belon [EMAIL PROTECTED] wrote: On Tue, Nov 25, 2008 at 04:33:05PM -0700, Brad Nicholes wrote: The result was that if the wildcard produced more than 10 included files (which it easily does even in our default

Re: [Ganglia-general] gmetric fails when disk is unwriteable?

2008-11-25 Thread Brad Nicholes
On 11/25/2008 at 1:08 AM, in message [EMAIL PROTECTED], Carlo Marcelo Arenas Belon [EMAIL PROTECTED] wrote: On Mon, Nov 24, 2008 at 04:55:42PM -0700, Brad Nicholes wrote: On 11/24/2008 at 3:47 PM, in message [EMAIL PROTECTED], Ofer Inbar [EMAIL PROTECTED] wrote: I tried feeding one of my

Re: [Ganglia-general] gmetric fails when disk is unwriteable?

2008-11-25 Thread Brad Nicholes
On 11/25/2008 at 10:14 AM, in message [EMAIL PROTECTED], Ofer Inbar [EMAIL PROTECTED] wrote: Brad Nicholes [EMAIL PROTECTED] wrote: It needs a temp directory to get around some issues with libconfuse. Libconfuse doesn't actually support wildcard paths or files. A libconfuse include statement

Re: [Ganglia-general] gmetric fails when disk is unwriteable?

2008-11-25 Thread Brad Nicholes
On 11/25/2008 at 10:14 AM, in message [EMAIL PROTECTED], Ofer Inbar [EMAIL PROTECTED] wrote: Brad Nicholes [EMAIL PROTECTED] wrote: It needs a temp directory to get around some issues with libconfuse. Libconfuse doesn't actually support wildcard paths or files. A libconfuse include statement

Re: [Ganglia-general] gmetric fails when disk is unwriteable?

2008-11-24 Thread Brad Nicholes
On 11/21/2008 at 9:33 PM, in message [EMAIL PROTECTED], Ofer Inbar [EMAIL PROTECTED] wrote: One of our servers encountered an I/O error that put its root filesystem into read only mode. Both /var and /tmp are on that filesystem, so all logging stopped and most everything stopped. However,

Re: [Ganglia-general] gmetric fails when disk is unwriteable?

2008-11-24 Thread Brad Nicholes
, which is an obvious thing to do but unfortunately I didn't think of it at the time of the failure (it was late at night). However, Brad knows the answer: Brad Nicholes [EMAIL PROTECTED] wrote: Both gmetric and gmond read the same .conf file. If the .conf file has an include() statement

Re: [Ganglia-general] High system load when gmond is running

2008-11-13 Thread Brad Nicholes
On 11/13/2008 at 4:08 PM, in message [EMAIL PROTECTED], [EMAIL PROTECTED] wrote: I looked into it further and it looks like my problem isn't gmond its gmetad. If I just have gmond running without gmetad the system load is normal but as soon as I start gmetad the load starts to go up. I ran

Re: [Ganglia-general] cluster graphing stops entirely after mastergmond restart

2008-11-11 Thread Brad Nicholes
On 11/10/2008 at 6:11 PM, in message [EMAIL PROTECTED], Ofer Inbar [EMAIL PROTECTED] wrote: Brad Nicholes [EMAIL PROTECTED] wrote: The reason why is because with the introduction of the modular metric functionality, metric metadata is now passed between gmonds rather than it being hardcoded

Re: [Ganglia-general] cluster graphing stops entirely after mastergmond restart

2008-11-10 Thread Brad Nicholes
On 11/10/2008 at 3:26 PM, in message [EMAIL PROTECTED], Brad Fino [EMAIL PROTECTED] wrote: If I restart gmond on the master node that a cluster reports to, the entire cluster stops graphing entirely. Some nodes in the cluster start graphing immediately after a node gmond restart, and some do

Re: [Ganglia-general] nonresponsive gmond

2008-11-04 Thread Brad Nicholes
On 11/3/2008 at 5:27 PM, in message [EMAIL PROTECTED], Kostas Georgiou [EMAIL PROTECTED] wrote: On Mon, Nov 03, 2008 at 11:46:52PM +, Kostas Georgiou wrote: On Mon, Nov 03, 2008 at 01:55:22PM -0700, Brad Nicholes wrote: If a timeout is set, then is the resulting XML output still

Re: [Ganglia-general] ganglia vs. top: running processes

2008-10-28 Thread Brad Nicholes
On 10/24/2008 at 7:57 AM, in message [EMAIL PROTECTED], Ofer Inbar [EMAIL PROTECTED] wrote: Recently we noticed something we don't know the explanation for, on a CentOS4 for system running gmond 3.1.0: The Ganglia graph shows a line for running processes that sometimes spikes to 10, 20, or

Re: [Ganglia-general] About python module for gmond (multiple metricsper one single call back? dynamic pyconf?)

2008-10-22 Thread Brad Nicholes
On 10/20/2008 at 11:19 PM, in message [EMAIL PROTECTED], utopia zh [EMAIL PROTECTED] wrote: Hi, I'm recently working on the gmond python mode. I found that for some metrics, it will be beneficial if we can return multiple metric values in the single callback. For example, if we want to

Re: [Ganglia-general] metric name and spoofed metrics

2008-10-06 Thread Brad Nicholes
On 10/3/2008 at 12:23 PM, in message [EMAIL PROTECTED], Martin Hicks [EMAIL PROTECTED] wrote: (sorry if this is a duplicate. I sent it yesterday but I haven't seen it come back yet, nor has it shown up in the mailing list archives on sourceforge) Hi, I backported the spoofing patches

Re: [Ganglia-general] metric name and spoofed metrics

2008-10-06 Thread Brad Nicholes
On 10/6/2008 at 2:23 PM, in message [EMAIL PROTECTED], Martin Hicks [EMAIL PROTECTED] wrote: On Mon, Oct 06, 2008 at 11:11:51AM -0600, Brad Nicholes wrote: On 10/3/2008 at 12:23 PM, in message Am I expected to deal with each SPOOF_HOST when a call_back occurs for a particular metric

Re: [Ganglia-general] mod_python on Solaris not scanning directory

2008-09-25 Thread Brad Nicholes
On 9/25/2008 at 6:08 AM, in message [EMAIL PROTECTED], Gilad Raphaelli [EMAIL PROTECTED] wrote: - Original Message From: Brad Nicholes [EMAIL PROTECTED] To: Lieting Yu [EMAIL PROTECTED]; ganglia-general@lists.sourceforge.net; Gilad Raphaelli [EMAIL PROTECTED] Sent

Re: [Ganglia-general] mod_python on Solaris not scanning directory

2008-09-24 Thread Brad Nicholes
On 9/23/2008 at 7:03 PM, in message [EMAIL PROTECTED], Gilad Raphaelli [EMAIL PROTECTED] wrote: Lieting, I believe I ran into the same issue and cleared it up with this patch to mod_python.c: --- mod_python.c.orig 2008-09-24 10:52:17.0 +1000 +++ mod_python.c

Re: [Ganglia-general] [Ganglia-developers] can't get cpu_num toshow for whole cluster

2008-09-15 Thread Brad Nicholes
On 9/12/2008 at 11:48 AM, in message [EMAIL PROTECTED], Bernard Li [EMAIL PROTECTED] wrote: Hi all: On Fri, Sep 12, 2008 at 10:00 AM, Ofer Inbar [EMAIL PROTECTED] wrote: I added a host to an existing cluster, and noticed the total number of CPU cores for the cluster fluctuate, so I tried

[Ganglia-general] [ANNOUNCEMENT] Official release of Ganglia 3.1.1

2008-09-09 Thread Brad Nicholes
The Ganglia Project (http://ganglia.info) is pleased to announce the official release of Ganglia 3.1.1 The official tarball is available for immediate download at: http://sourceforge.net/project/showfiles.php?group_id=43021package_id=35280release_id=625044 For a full description of the bug

Re: [Ganglia-general] gmetad bug when gmond host hangs

2008-09-02 Thread Brad Nicholes
On 9/1/2008 at 3:35 PM, in message [EMAIL PROTECTED], Carlo Marcelo Arenas Belon [EMAIL PROTECTED] wrote: On Sat, Aug 30, 2008 at 10:09:02AM -0600, Brad Nicholes wrote: On 8/30/2008 at 12:25 AM, in message [EMAIL PROTECTED], Carlo Marcelo Arenas Belon [EMAIL PROTECTED] wrote: On Fri, Aug

Re: [Ganglia-general] gmetad bug when gmond host hangs

2008-08-30 Thread Brad Nicholes
On 8/30/2008 at 12:25 AM, in message [EMAIL PROTECTED], Carlo Marcelo Arenas Belon [EMAIL PROTECTED] wrote: On Fri, Aug 29, 2008 at 02:40:00PM -0400, Ofer Inbar wrote: Should this have made it into 3.1, or 3.1.1? It doesn't look like it. There is a fix in trunk now with r1738 and unless

[Ganglia-general] [ANNOUNCEMENT] Ganglia 3.1.1 tarball ready for testing...

2008-08-25 Thread Brad Nicholes
In an effort to continue improving the Ganglia software, the Ganglia Project has released an official testing release of Ganglia 3.1.1. The testing tarball is available for immediate download at: http://www.ganglia.info/testing/ The intent of this testing release of Ganglia 3.1.1 is to

Re: [Ganglia-general] Interrupted system call error shown by gmond -d 9

2008-08-15 Thread Brad Nicholes
On 8/15/2008 at 4:11 PM, in message [EMAIL PROTECTED], Sid Stuart [EMAIL PROTECTED] wrote: Hi, Has anyone else seen this error when running gmond in debug mode (gmond -d 9)? loaded module: python_module udp_recv_channel mcast_join=239.2.11.82 mcast_if=NULL port=8649 bind= 239.2.11.82

Re: [Ganglia-general] Patch - no_extra_data

2008-08-14 Thread Brad Nicholes
On Thu, Aug 14, 2008 at 3:33 PM, in message [EMAIL PROTECTED], Doug Nordwall [EMAIL PROTECTED] wrote: Here's a patch for ganglia. it allows the no_extra_data option to be added to the config file. when this is set to yes, it will not send any EXTRA_DATA or EXTRA_ELEMENTS in the xml.

Re: [Ganglia-general] Debugging Gmond Python Metric Module

2008-08-13 Thread Brad Nicholes
On 8/13/2008 at 10:42 AM, in message [EMAIL PROTECTED], Sid Stuart [EMAIL PROTECTED] wrote: After fixing a tabbing bug in your cacheHits() function, everything loaded fine and the callback function was called as it should be. The callback didn't actually work on my system, but that is a

Re: [Ganglia-general] Debugging Gmond Python Metric Module

2008-08-12 Thread Brad Nicholes
On 8/12/2008 at 3:03 PM, in message [EMAIL PROTECTED], Sid Stuart [EMAIL PROTECTED] wrote: Hi, I have written a small Python metric module that contains one metric, CacheHits. When the module is included in the configuration, gmond spits out the following error message, Unable to collect

Re: [Ganglia-general] Debugging Gmond Python Metric Module

2008-08-12 Thread Brad Nicholes
On 8/12/2008 at 3:03 PM, in message [EMAIL PROTECTED], Sid Stuart [EMAIL PROTECTED] wrote: Hi, I have written a small Python metric module that contains one metric, CacheHits. When the module is included in the configuration, gmond spits out the following error message, Unable to collect

Re: [Ganglia-general] [ANNOUNCEMENT] Official release of Ganglia 3.1.0

2008-07-31 Thread Brad Nicholes
that helped get Ganglia 3.1.0 out the door. Brad On 7/30/2008 at 2:42 PM, in message [EMAIL PROTECTED], Brad Nicholes [EMAIL PROTECTED] wrote: The Ganglia Project (http://ganglia.info) is pleased to announce the first official release of Ganglia 3.1.0 The official tarball is available

[Ganglia-general] [ANNOUNCEMENT] Official release of Ganglia 3.1.0

2008-07-30 Thread Brad Nicholes
The Ganglia Project (http://ganglia.info) is pleased to announce the first official release of Ganglia 3.1.0 The official tarball is available for immediate download at: http://sourceforge.net/project/showfiles.php?group_id=43021package_id=35280release_id=616721 Please refer to

Re: [Ganglia-general] [Ganglia-developers] [ANNOUNCEMENT] Ganglia 3.1.0 tarball ready fortesting...

2008-07-29 Thread Brad Nicholes
the 3.1.0 tarball as first official release of the Ganglia 3.1.x series. Comments? Votes? Brad On 7/15/2008 at 1:57 PM, in message [EMAIL PROTECTED], Brad Nicholes [EMAIL PROTECTED] wrote: The Ganglia Project is pleased to announce the first official testing release of Ganglia 3.1.x

Re: [Ganglia-general] [Ganglia-developers] [ANNOUNCEMENT]Ganglia 3.1.0 tarball ready fortesting...

2008-07-29 Thread Brad Nicholes
On 7/29/2008 at 10:06 AM, in message [EMAIL PROTECTED], Marc Van Kerkhoven1 [EMAIL PROTECTED] wrote: Hi Brad, One minor bug would be that the gmetrics link is no longer visible in the host view. Not sure if this is because I have done anything wrong, but it's pretty much a vanilla

Re: [Ganglia-general] [Ganglia-developers] [ANNOUNCEMENT]Ganglia 3.1.0 tarball ready fortesting...

2008-07-29 Thread Brad Nicholes
On 7/29/2008 at 11:18 AM, in message [EMAIL PROTECTED], Bernard Li [EMAIL PROTECTED] wrote: Hi Brad: On Tue, Jul 29, 2008 at 9:27 AM, Brad Nicholes [EMAIL PROTECTED] wrote: Thanks for installing and testing the 3.1 testing tarball. Removing the gmetric link from the host view

[Ganglia-general] Current testing cycle for Ganglia 3.1 release...

2008-07-21 Thread Brad Nicholes
Just as a reminder, there is currently a testing release of Ganglia 3.1 available for immediate testing and feedback. This testing release is available at: http://www.ganglia.info/testing/ Please see the previous announcement for more information. http://www.mail-archive.com/[EMAIL

Re: [Ganglia-general] gmetad giving high TN values

2008-06-25 Thread Brad Nicholes
On 6/25/2008 at 12:13 PM, in message [EMAIL PROTECTED], Kirk McDonald [EMAIL PROTECTED] wrote: I have a gmetad which probes a number of gmonds, and each gmond has a number of hosts associated with it. When I scrape the XML from each of the gmonds probed by gmetad myself, the TN value for each

Re: [Ganglia-general] gmetad giving high TN values

2008-06-25 Thread Brad Nicholes
On 6/25/2008 at 1:18 PM, in message [EMAIL PROTECTED], Kirk McDonald [EMAIL PROTECTED] wrote: On Wed, Jun 25, 2008 at 11:48 AM, Brad Nicholes [EMAIL PROTECTED] wrote: On 6/25/2008 at 12:13 PM, in message [EMAIL PROTECTED], Kirk McDonald [EMAIL PROTECTED] wrote: I have a gmetad which probes

Re: [Ganglia-general] Compiling gmond c module

2008-06-24 Thread Brad Nicholes
You will need to figure out where the u_short conflict is coming from. My first guess would be to use gcc rather than g++. Brad On 6/7/2008 at 8:35 PM, in message [EMAIL PROTECTED], Fábio Firmo [EMAIL PROTECTED] wrote: Hi everyone, I'm about to introduce Ganglia in a project to take care

Re: [Ganglia-general] SPOOF option

2008-06-18 Thread Brad Nicholes
On 6/18/2008 at 7:39 AM, in message [EMAIL PROTECTED], LINDA DOBAI [EMAIL PROTECTED] wrote: Hi, These days I was testing the 3.1.0.1399 release of Ganglia , most of all the Python modules feature. I managed to plug several metrics into Ganglia using the new feature and it works very

Re: [Ganglia-general] UUID status?

2008-06-13 Thread Brad Nicholes
On 6/13/2008 at 1:08 AM, in message [EMAIL PROTECTED], Carlo Marcelo Arenas Belon [EMAIL PROTECTED] wrote: On Tue, Jun 10, 2008 at 04:59:57PM -0600, Brad Nicholes wrote: it can be solved using Ganglia 3.1 and the new gmetad-python rewrite. does it mean that you are planning on adding

Re: [Ganglia-general] UUID status?

2008-06-10 Thread Brad Nicholes
On 6/10/2008 at 4:45 PM, in message [EMAIL PROTECTED], Michael Place [EMAIL PROTECTED] wrote: Hi all, The ganglia wish list at http://ganglia.wiki.sourceforge.net/ganglia_wish-list lists the following gmetad todo: * Name RRD directories based on UUID generated by client gmond Can

Re: [Ganglia-general] Ganglia 3.1.0 package

2008-06-10 Thread Brad Nicholes
On 6/10/2008 at 11:17 AM, in message [EMAIL PROTECTED], Bernard Li [EMAIL PROTECTED] wrote: Hi Stephen: On Tue, Jun 10, 2008 at 10:07 AM, Big Woobie [EMAIL PROTECTED] wrote: I'm running Redhat Linux and IBM's AIX. AIX I can't help you (I think Ulf is trying to get that working). It

Re: [Ganglia-general] Plugging Metrics in 3.1.0 release

2008-06-06 Thread Brad Nicholes
On 6/6/2008 at 2:28 AM, in message [EMAIL PROTECTED], LINDA DOBAI [EMAIL PROTECTED] wrote: Thank you very much for your responses. As OS I am using Linux RedHat 5 32bits. As Ganglia version, I installed the last version that I found at the following URL:

Re: [Ganglia-general] Plugging Metrics in 3.1.0 release

2008-06-05 Thread Brad Nicholes
On 6/5/2008 at 9:20 AM, in message [EMAIL PROTECTED], LINDA DOBAI [EMAIL PROTECTED] wrote: Dear Ganglia community: I am a beginner in Ganglia. I have just started an internship of four months and my subject is related to Ganglia. I have to test the new highlights of release 3.1.0. I

Re: [Ganglia-general] Plugging Metrics in 3.1.0 release

2008-06-05 Thread Brad Nicholes
On 6/5/2008 at 2:15 PM, in message [EMAIL PROTECTED], Bernard Li [EMAIL PROTECTED] wrote: On Thu, Jun 5, 2008 at 12:32 PM, Brad Nicholes [EMAIL PROTECTED] wrote: Which OS are you building for and what were the ./configure options that you used? This will help to determine why mod_python

Re: [Ganglia-general] Plugging Metrics in 3.1.0 release

2008-06-05 Thread Brad Nicholes
On 6/5/2008 at 4:48 PM, in message [EMAIL PROTECTED], Bernard Li [EMAIL PROTECTED] wrote: Hi Brad: On Thu, Jun 5, 2008 at 3:23 PM, Brad Nicholes [EMAIL PROTECTED] wrote: Done. Hopefully this will be a good guide for somebody that is getting started with Python modules. We still need

Re: [Ganglia-general] Plugging Metrics in 3.1.0 release

2008-06-05 Thread Brad Nicholes
On 6/5/2008 at 5:30 PM, in message [EMAIL PROTECTED], Bernard Li [EMAIL PROTECTED] wrote: Hi Brad: On Thu, Jun 5, 2008 at 4:25 PM, Brad Nicholes [EMAIL PROTECTED] wrote: The modules sections is one of those things that is becoming more important as functionality grows. Initially

Re: [Ganglia-general] gmetad.conf for large number of nodes

2008-05-28 Thread Brad Nicholes
On 5/27/2008 at 8:47 PM, in message [EMAIL PROTECTED], randy [EMAIL PROTECTED] wrote: Brad Nicholes wrote: Is there a reason why you would want to list all 120 nodes in the data_source directive of gmetad? When you list multiple modes in a data_source directive, it does not mean

Re: [Ganglia-general] gmetad.conf for large number of nodes

2008-05-27 Thread Brad Nicholes
On 5/26/2008 at 7:18 AM, in message [EMAIL PROTECTED], randy [EMAIL PROTECTED] wrote: I'm trying to configure ganglia (3.0.7) to monitor 120 nodes. It works fine if I just enter a small number of nodes as data_source in the gmetad.conf file, just like all the documentation shows. But if I

Re: [Ganglia-general] Migrating existing RRD's to a new server;

2008-05-21 Thread Brad Nicholes
This looks like a useful script. Can we add it to the contrib area in the Ganglia repository? Brad On 5/21/2008 at 9:51 AM, in message [EMAIL PROTECTED], Jason A. Smith [EMAIL PROTECTED] wrote: A few years ago I had put a script on ganglia's bugzilla that modifies the rrd files to do a few

Re: [Ganglia-general] strange problem - gmond on headnode reportsdifferent data than sources

2008-05-14 Thread Brad Nicholes
On 5/13/2008 at 11:34 PM, in message [EMAIL PROTECTED], Jeremy LaTrasse [EMAIL PROTECTED] wrote: I changed our configs over to unicast, which as seemingly eliminated most of our problems, except one egregious one, and the log files are still being filled with illegal attempt to update

Re: [Ganglia-general] Ganglia reported wrong OS...

2008-05-12 Thread Brad Nicholes
On 5/12/2008 at 9:09 AM, in message [EMAIL PROTECTED], Tom Pierce [EMAIL PROTECTED] wrote: Dear Ganglia Users, I upgraded some cluster nodes, from 32 bit OS RHEL4 to 64 bit RHEL5.1, but the ganglia node monitor still seems to remember that the old node (with the same name) was x86

[Ganglia-general] Time to produce a 3.1 beta

2008-05-01 Thread Brad Nicholes
The list has quieted down over the last week or so since we release the 3.1 snapshot. This either means that people are busy testing the 3.1 snapshot and haven't had time to respond yet or that things are good and there just isn't much to report. The STATUS file contains one back port

Re: [Ganglia-general] multiple gmetads polling single gmond

2008-04-25 Thread Brad Nicholes
Gmond is single threaded. However Gmetad is not when it produces the XML dump. Would it be possible for you to use the Gmetad port rather than hitting Gmond directly? If you hit the Gmetad interactive port you could request data for any of your individual clusters from your script.

[Ganglia-general] Platform experts needed (was:Re: [Ganglia-developers] Ganglia 3.1.x stable branch has been created...)

2008-04-18 Thread Brad Nicholes
So here is another request to all you platform experts out there. The Ganglia project will be rolling alpha tarballs of the Ganglia 3.1 version. If the tarball does not work on your platform, please fix it and submit a patch back to the project. Ganglia 3.0.x already works on a variety of

Re: [Ganglia-general] Fwd: [Beowulf] Performance metrics reporting

2008-04-11 Thread Brad Nicholes
On 4/11/2008 at 1:53 PM, in message [EMAIL PROTECTED], Witham, Timothy D [EMAIL PROTECTED] wrote: So I'd like to ask the Ganglia community -- do you guys find Ganglia to be a resource hog? No. But once I had a couple hundred gmetad processes on a 2GB server. When the size of active

Re: [Ganglia-general] Fwd: [Beowulf] Performance metrics reporting

2008-04-11 Thread Brad Nicholes
On 4/11/2008 at 4:09 PM, in message [EMAIL PROTECTED], Bernard Li [EMAIL PROTECTED] wrote: Hi Brad: On Fri, Apr 11, 2008 at 3:04 PM, Brad Nicholes [EMAIL PROTECTED] wrote: I agree that the size of the XML could be reduced in most cases, however it would be impractical to define

Re: [Ganglia-general] [Ganglia-developers] Time to create the 3.1.x stable branch...

2008-03-17 Thread Brad Nicholes
On 3/13/2008 at 3:46 PM, in message [EMAIL PROTECTED], Brad Nicholes [EMAIL PROTECTED] wrote: On 3/13/2008 at 2:16 PM, in message [EMAIL PROTECTED], Jesse Becker [EMAIL PROTECTED] wrote: On Thu, Mar 13, 2008 at 3:42 PM, Brad Nicholes [EMAIL PROTECTED] wrote: I think that with the removal

Re: [Ganglia-general] [Ganglia-developers] Time to create the 3.1.x stable branch...

2008-03-15 Thread Brad Nicholes
On 3/14/2008 at 1:35 AM, in message [EMAIL PROTECTED], Carlo Marcelo Arenas Belon [EMAIL PROTECTED] wrote: On Thu, Mar 13, 2008 at 01:42:05PM -0600, Brad Nicholes wrote: I think that with the removal of the srclib directory from the SVN trunk repository, we have completed everything

[Ganglia-general] Time to create the 3.1.x stable branch...

2008-03-13 Thread Brad Nicholes
I think that with the removal of the srclib directory from the SVN trunk repository, we have completed everything that we thought needed to be done before creating the 3.1.x stable branch. The only other thing that I know of is testing to make sure that an older 3.0.x gmetad can consume the

Re: [Ganglia-general] additional info about fsock open error

2008-02-15 Thread Brad Nicholes
On 2/15/2008 at 9:34 AM, in message [EMAIL PROTECTED], Mike Olson [EMAIL PROTECTED] wrote: Just an FYI, I have the ports 8649 to 8652 forwarded on my router to my Apache web server. I have looked at the file on line 283 and I don't know what part of that line is creating the error. The line

Re: [Ganglia-general] new property of a host

2008-01-31 Thread Brad Nicholes
On 1/31/2008 at 4:20 PM, in message [EMAIL PROTECTED], Doug Nordwall [EMAIL PROTECTED] wrote: For reference, here is a current HOST line of XML HOST NAME=mybox.local IP=10.1.1.1 REPORTED=1201820930 TN=7 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1200935314 We're looking at making

Re: [Ganglia-general] scaling_max_freq error

2008-01-28 Thread Brad Nicholes
On 1/25/2008 at 7:21 PM, in message [EMAIL PROTECTED], Jesse Becker [EMAIL PROTECTED] wrote: On Jan 25, 2008 9:06 PM, Bernard Li [EMAIL PROTECTED] wrote: Hi Jesse: On 1/25/08, Jesse Becker [EMAIL PROTECTED] wrote: Interesting. How about introducing a new metric: cpu_speed_current.

  1   2   >