Re: [Ganglia-general] [ Ganglia General ] -- Monitoring severals networks in a single Cluster

2017-03-31 Thread sobolev6
di 29 Mars 2017 14:55:16 Objet: [Ganglia-general] [ Ganglia General ] -- Monitoring severals networks in a single Cluster Hello everybody, I am a brend new guy in Ganglia. I have to monitoring a cluster of 20 nodes plus a master node. This single cluster has three networks. The first is

[Ganglia-general] [ Ganglia General ] -- Monitoring severals networks in a single Cluster

2017-03-29 Thread sobolev6
Hello everybody, I am a brend new guy in Ganglia. I have to monitoring a cluster of 20 nodes plus a master node. This single cluster has three networks. The first is 1 Go, the second 10 Go, and the third is Infiniband. In gmetad, the IP adresses of nodes corresponds to the 1 G.O . In g

[Ganglia-general] Monitoring Linux services

2016-12-16 Thread Peter Phaal
Hi All, For anyone interesting in monitoring Linux services, the latest Host sFlow release can automatically track and monitor services running under systemd: http://blog.sflow.com/2016/12/monitoring-linux-services.html Ganglia already includes support for the sFlow metrics: http://blog.sflow.com

Re: [Ganglia-general] Monitoring CTX switches and memory fragmentation

2015-05-05 Thread Vladimir Vuksan
Indeed it's in 3.7.1 Vladimir On 05/05/2015 11:24 AM, Martin Knoblauch wrote: > is the CTX stuff already in a released version? I may need to tell > the end customer to upgrade. -- One dashboard for servers and applic

Re: [Ganglia-general] Monitoring CTX switches and memory fragmentation

2015-05-05 Thread Martin Knoblauch
Hi Vladimir, is the CTX stuff already in a released version? I may need to tell the end customer to upgrade. Cheers Martin On Tue, May 5, 2015 at 4:12 PM, Vladimir Vuksan wrote: > I have wrote one for memory fragmentation. You can find it here > > > https://github.com/ganglia/gmond_python_mo

Re: [Ganglia-general] Monitoring CTX switches and memory fragmentation

2015-05-05 Thread Vladimir Vuksan
I have wrote one for memory fragmentation. You can find it here https://github.com/ganglia/gmond_python_modules/tree/master/system/mem_fragmentation Context stuff is now in the monitor-core master https://github.com/ganglia/monitor-core/blob/master/gmond/

[Ganglia-general] Monitoring CTX switches and memory fragmentation

2015-05-04 Thread Martin Knoblauch
Hi friends, short question: does Ganglia provide monitor agents for context switches and "memory fragmentation" (e.g. listing contents of /proc/buddyinfo)? I want to avoid double work, should they exist officially? Cheers Martin -- -- Martin K

Re: [Ganglia-general] Monitoring IBM LSF Platform and GPFS

2012-12-12 Thread Waleed Harbi
Paul, That's really interesting, I appreciated your efforts if you can share it. -- Best Wishes, Waleed Harbi Dream | Do | Be On Wed, Dec 12, 2012 at 7:09 PM, Paul Hewlett wrote: > Hi Waleed > > I recently wrote a python LSF module for my last contract. It reported >

Re: [Ganglia-general] Monitoring IBM LSF Platform and GPFS

2012-12-12 Thread Paul Hewlett
Hi Waleed I recently wrote a python LSF module for my last contract. It reported metrics on the jobs submitted to LSF as opposed to monitoring LSF itself (sbatchd,lim,res etc). Is this what you want? If so I could ask if the module could be made available Regards Paul On 11 December 2012

Re: [Ganglia-general] Monitoring IBM LSF Platform and GPFS

2012-12-11 Thread Waleed Harbi
I am looking for performance tuning for GPFS and LSF hosts, even if there are more functionality available that will be great. Both of them they are big product but I am looking for performance functions. -- Best Wishes, Waleed Harbi Dream | Do | Be On Tue, Dec 11, 2012

Re: [Ganglia-general] Monitoring IBM LSF Platform and GPFS

2012-12-11 Thread Vladimir Vuksan
What are you looking to monitor ? Queue sizes ? Vladimir On Tue, 11 Dec 2012, Waleed Harbi wrote: Hello,I am looking for ganglia gmetric to monitoring IBM LSF Platform and GPFS. I hihgily appracited your advice if have any comment. I cannot find it under https://github.com/ganglia/gmetric. -

[Ganglia-general] Monitoring IBM LSF Platform and GPFS

2012-12-11 Thread Waleed Harbi
Hello, I am looking for ganglia gmetric to monitoring IBM LSF Platform and GPFS. I hihgily appracited your advice if have any comment. I cannot find it under https://github.com/ganglia/gmetric. -- Best Wishes, Waleed Harbi Dream | Do | Be ---

Re: [Ganglia-general] Monitoring with Ganglia book from O'Reilly

2012-11-26 Thread Frederiko Costa
I was thinking about asking O'Reilly about it. I would like to get a printed copy. Has anyone already asked? On Mon, Nov 26, 2012 at 3:44 PM, Dave Josephsen wrote: > I don't suppose the contributing authors can get a complimentary print > copy from O'Reilly? apress or pren-hall would totally ho

Re: [Ganglia-general] Monitoring with Ganglia book from O'Reilly

2012-11-26 Thread Dave Josephsen
I don't suppose the contributing authors can get a complimentary print copy from O'Reilly? apress or pren-hall would totally hook us up ;-) - Original Message - > Monitoring with Ganglia book is out from O'Reilly. Sorry for the late > notice but you can get 50% off the Ebook today > > h

[Ganglia-general] Monitoring with Ganglia book from O'Reilly

2012-11-26 Thread Vladimir Vuksan
Monitoring with Ganglia book is out from O'Reilly. Sorry for the late notice but you can get 50% off the Ebook today http://shop.oreilly.com/product/0636920025573.do All the royalties go directly to http://www.scholarshipamerica.org Vladimir ---

Re: [Ganglia-general] Monitoring processes

2012-07-26 Thread Paul Hewlett
-- Message: 5 Date: Wed, 25 Jul 2012 11:30:27 -0500 From: Douglas Wagner Subject: Re: [Ganglia-general] Modifying ganglia. To: ganglia-general@lists.sourceforge.net Message-ID: Content-Type: text/plain; charset="iso-8859-1" On Wed, Jul 25, 2012 at 2:22 AM, ka

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-05-03 Thread Jesse Becker
From: Jesse Becker [haw...@gmail.com] > Sent: Monday, May 02, 2011 3:02 PM > To: Mostafa Ismail > Cc: ganglia-general@lists.sourceforge.net; Bernard Li > Subject: Re: [Ganglia-general] Monitoring SGE queues using Ganglia > > Try running this: >  qstat -u '*' > > Yes, yo

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-05-02 Thread Mostafa Ismail
0/16/161.62 lx24-amd64 > ~ > ~ > ~ > [root@sge01 tmp]# > > What does it mean? > > Thanks, > Mostafa Ismail > > -Original Message- > From: Jesse Becker [mailto:haw...@gmail.com] > Sent: Tuesday, April 19, 2011 7:17 PM > To: Bernard L

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-05-02 Thread Jesse Becker
    lx24-amd64 > ~ > ~ > ~ > [root@sge01 tmp]# > > What does it mean? > > Thanks, > Mostafa Ismail > > -Original Message- > From: Jesse Becker [mailto:haw...@gmail.com] > Sent: Tuesday, April 19, 2011 7:17 PM > To: Bernard Li > Cc: Mostafa Ismail; gang

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-05-02 Thread Mostafa Ismail
7:17 PM To: Bernard Li Cc: Mostafa Ismail; ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Monitoring SGE queues using Ganglia Yeah, pretty close to the same file. I'll post update both the collector and php file later on. On Tue, Apr 19, 2011 at 13:10, Bernard Li wrote:

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-04-19 Thread Jesse Becker
t;>> >>> Thanks, >>> Mostafa ismail >>> >>> -Original Message- >>> From: Jesse Becker [mailto:haw...@gmail.com] >>> Sent: Tuesday, April 19, 2011 3:39 PM >>> To: Mostafa Ismail >>> Cc: ganglia-general@lists

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-04-19 Thread Bernard Li
t; >> Thanks, >> Mostafa ismail >> >> -Original Message- >> From: Jesse Becker [mailto:haw...@gmail.com] >> Sent: Tuesday, April 19, 2011 3:39 PM >> To: Mostafa Ismail >> Cc: ganglia-general@lists.sourceforge.net >> Subject: Re: [Gangli

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-04-19 Thread Mostafa Ismail
here's any documentation which can I follow, then get back if I have issues Thanks, Mostafa Ismail -Original Message- From: Jesse Becker [mailto:haw...@gmail.com] Sent: Tuesday, April 19, 2011 4:01 PM To: Mostafa Ismail Cc: ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-04-19 Thread Jesse Becker
April 19, 2011 3:39 PM > To: Mostafa Ismail > Cc: ganglia-general@lists.sourceforge.net > Subject: Re: [Ganglia-general] Monitoring SGE queues using Ganglia > > On Tue, Apr 19, 2011 at 09:25, Mostafa Ismail > wrote: >> Hello, >> >> >> >> Is it possible t

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-04-19 Thread Mostafa Ismail
[mailto:haw...@gmail.com] Sent: Tuesday, April 19, 2011 3:39 PM To: Mostafa Ismail Cc: ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Monitoring SGE queues using Ganglia On Tue, Apr 19, 2011 at 09:25, Mostafa Ismail wrote: > Hello, > > > > Is it possible to moni

Re: [Ganglia-general] Monitoring SGE queues using Ganglia

2011-04-19 Thread Jesse Becker
On Tue, Apr 19, 2011 at 09:25, Mostafa Ismail wrote: > Hello, > > > > Is it possible to monitor the SGE queues (such as all.q) using ganglia? I > did search at “Ganglia-general” forum and I found no match. Yes, it is possible. You need to do two things: 1) collect the metrics from SGE. 2) graph

[Ganglia-general] Monitoring SGE queues using Ganglia

2011-04-19 Thread Mostafa Ismail
Hello, Is it possible to monitor the SGE queues (such as all.q) using ganglia? I did search at "Ganglia-general" forum and I found no match. Your response is highly appreciated. Thanks, Mostafa Ismail -- Benefiting fro

[Ganglia-general] Monitoring switches, "Gmond started" field.

2010-06-02 Thread Miguel A.
Hi I'm monitoring a switch through gmetrics. When I want to view the "Time and String Metrics", appears the "Gmond Started", "Uptime" and others variables with default values. Searching in several files such as (host_view.php,ganglia.php,functions.php ...) I achieved to fill the "Uptime" using "

Re: [Ganglia-general] Monitoring

2009-11-17 Thread chifeng
try this command #gstat --all -i a_hostname_in_cluster Chifeng On Tue, Nov 17, 2009 at 11:02 PM, John Martyniak < j...@beforedawnsolutions.com> wrote: > Ok. > > I just ran a 'gstat --all' > > And only one host comes up, just the localhost. > > So there is something missing. > > any ideas? > > -J

Re: [Ganglia-general] Monitoring

2009-11-17 Thread John Martyniak
Ok. I just ran a 'gstat --all' And only one host comes up, just the localhost. So there is something missing. any ideas? -John On Nov 17, 2009, at 9:22 AM, John Martyniak wrote: > > Hi everyone, > > Ok I got my Ganglia monitor up and working, and it was pulling > results from the localhost

[Ganglia-general] Monitoring

2009-11-17 Thread John Martyniak
Hi everyone, Ok I got my Ganglia monitor up and working, and it was pulling results from the localhost. So I enable the hadoop-metrics.properties and made the appropriate changes so that it pointed at me ganglia box. I made a data_source in the gmetad.conf file, and attached the two test

Re: [Ganglia-general] Monitoring geographically spread applications

2009-07-07 Thread Bernard Li
Hello Nigel: Unfortunately there is currently no way to have different "views" for your monitored resources. So in your example (below), you would probably want to set up a gmetad to aggregate the metrics across ApplicationA (and not by location). You could of course set up as many gmetads aggre

[Ganglia-general] Monitoring geographically spread applications

2009-07-07 Thread nigel . leach
Hi, I’m monitoring what is hopefully a fairly standard compute configuration using Ganglia, and want to take the opportunity of a v3.1 upgrade to rationalise my configuration. I have about ~40,000 cores, in ~10 geographic sites. Currently I also have a bit of a mess of Gmetad’s and WebFrontEnd'

Re: [Ganglia-general] Monitoring NFS share disk usage

2008-10-28 Thread Carlo Marcelo Arenas Belon
On Tue, Oct 28, 2008 at 05:30:25PM +1100, Adam Mitchell wrote: > >#!/bin/bash >VALUE=$(df /home/ | grep /home |awk '{print $3 }') >gmetric --name disk_nfs_used --value $VALUE --type uint32 --units Bytes not relevant for your problem but units here should be "KB" >gmond is running

[Ganglia-general] Monitoring NFS share disk usage

2008-10-27 Thread Adam Mitchell
Hi Everyone, I am new to this list and looking for some help. I have searched the archives for this list and many other corners of the web to no avail. Our user home directories are mounted on the compute nodes via an NFS share on the head node. User data is written to the home directories. W

Re: [Ganglia-general] Monitoring Linux Multipathed Devices

2008-07-11 Thread Craig Simpson
s if you are interested. > > Dan > Sent via BlackBerry by AT&T > > -Original Message- > From: "Craig Simpson" <[EMAIL PROTECTED]> > > Date: Fri, 11 Jul 2008 12:19:39 > To: > Subje

Re: [Ganglia-general] Monitoring Linux Multipathed Devices

2008-07-11 Thread Craig Simpson
Tried mapping asm01 to a raw device, called /dev/raw/asm01, but that doesn't seem to be something I can run iostat against either. I think a real trick for clustered storage is to understand the IO to multipathed devices and graph over time. Trying to gather (and graph my IO multipath aliases (

Re: [Ganglia-general] Monitoring Linux Multipathed Devices

2008-07-11 Thread Ethan Erchinger
Craig Simpson wrote: Does anyone have a method for monitoring Linux Multipathed Devices, created by multipthd and dm? Use udev to create /dev/ names that match your multipath names. On Rhat, a rule in /etc/udev/rules.d and a script in /etc/udev/scripts should be sufficient. http://www.red

[Ganglia-general] Monitoring Linux Multipathed Devices

2008-07-10 Thread Craig Simpson
Does anyone have a method for monitoring Linux Multipathed Devices, created by multipthd and dm? An iostat will just show the DM and not the actual alias. Would like to monitor IO via the Multipath alias name. Example would be: >From /etc/multipath.conf asm01 is created: multipath {

[Ganglia-general] monitoring and notification

2008-07-01 Thread David B. Ritch
Thanks! I'll take a look at GroundWorks. Looks like the consensus is to use Nagios, possibly with some additional products, for event monitoring and notification. David - Sponsored by: SourceForge.net Community Choice Awa

Re: [Ganglia-general] monitoring a HA cluster

2007-10-23 Thread richard grevis
Alex, oh dear, it looks like I answered the wrong question *again*. As I don't have test access to a running ganglia someone else should answer. But part of it may be to - - configure gmetad.conf to poll the failover VIP IP or DNS name, not the physical ones. - Configure each server in the fai

Re: [Ganglia-general] monitoring a HA cluster

2007-10-22 Thread richard grevis
Alex, They are the only 2 members of the cluster? How about this: - The gmond.conf on host A is configured unicast and to send data to the *physical* address (not the VIP) of Host B. Do not configure gmond.conf to send data to itself. The only UDP send channel is to host B - Configure the

[Ganglia-general] monitoring a HA cluster

2007-10-22 Thread alex
Second post today, separate topic... I've got a few machines set up as active/passive clusters running heartbeat/drbd. I am currently monitoring them with ganglia, but I think the information I'm getting leads to a misleading picture. Since both machines are monitored, it looks like I have 8

Re: [Ganglia-general] Monitoring one process

2006-10-15 Thread Vitaly Karasik
- > From: [EMAIL PROTECTED] [mailto:ganglia- > [EMAIL PROTECTED] On Behalf Of João Oliveira > Sent: Friday, October 13, 2006 3:24 PM > To: ganglia-general@lists.sourceforge.net > Subject: [Ganglia-general] Monitoring one process > > Hi all, > > i was reading the docum

Re: [Ganglia-general] Monitoring one process

2006-10-13 Thread Marcelo Veiga Neves
hi, I created this add-on. It allows you to collect metrics of one specific process using Ganglia. http://www-usr.inf.ufsm.br/~veiga/gappmon/ (Portuguese only) []'s -veiga On 10/13/06, João Oliveira <[EMAIL PROTECTED]> wrote: Hi all, i was reading the documentation's FAQ when i read about me

Re: [Ganglia-general] Monitoring one process

2006-10-13 Thread Alex Balk
You may monitor whatever you like through the use of the gmetric command. João Oliveira wrote: > Hi all, > > i was reading the documentation's FAQ when i read about metrics that > Ganglia supports. Well, i read all of them trying to understand each > but i couldn't find the one that interests

[Ganglia-general] Monitoring one process

2006-10-13 Thread João Oliveira
Hi all, i was reading the documentation's FAQ when i read about metrics that Ganglia supports. Well, i read all of them trying to understand each but i couldn't find the one that interests me the most, monitoring processes individually. So, can i collect CPU usage time of one specific process us

Re: [Ganglia-general] monitoring

2006-08-25 Thread Martin Knoblauch
Nagios? Cheers Martin --- Dirk Roessler <[EMAIL PROTECTED]> wrote: > Does someone knows an easy to install and easy to use solution for > monitoring and sending email notifications of down nodes and health > state on a Linux HPC cluster? > > Dirk > > begin:vcard > fn;quoted-printable:Dirk R

Re: [Ganglia-general] monitoring

2006-08-24 Thread Vladimir Vuksan
Dirk Roessler wrote: > Does someone knows an easy to install and easy to use solution for > monitoring and sending email notifications of down nodes and health > state on a Linux HPC cluster? You could use Nagios and Ganglia Python client. Basically you use the Ganglia Python client to get metric v

[Ganglia-general] monitoring

2006-08-23 Thread Dirk Roessler
Does someone knows an easy to install and easy to use solution for monitoring and sending email notifications of down nodes and health state on a Linux HPC cluster? Dirk begin:vcard fn;quoted-printable:Dirk R=C3=B6=C3=9Fler n;quoted-printable:R=C3=B6=C3=9Fler;Dirk org:_University of Potsdam;Dep

Re: [Ganglia-general] Monitoring

2002-10-08 Thread matt massie
leif- i've been wanting to have a way to implement an active alerting mechanism for a while. the development team would love some help if you're willing to donate a little time. i have an idea for a quick and smart hack (i think). gmetad is already doing the hardest part of this work. here's

Re: [Ganglia-general] Monitoring

2002-10-08 Thread Leif Nixon
Steven Wagner <[EMAIL PROTECTED]> writes: > Leif Nixon wrote: > > Steven Wagner <[EMAIL PROTECTED]> writes: > > Yes, that's what I did last week. It ain't no fun. Nagios' handling > > of passive service checks isn't flexible enough. And passive host > > checking Just Isn't Done. > > Once again, c

Re: [Ganglia-general] Monitoring

2002-10-07 Thread Steven Wagner
Leif Nixon wrote: Steven Wagner <[EMAIL PROTECTED]> writes: Yes, that's what I did last week. It ain't no fun. Nagios' handling of passive service checks isn't flexible enough. And passive host checking Just Isn't Done. Once again, considering you have the source at your disposal, I'm sure you

Re: [Ganglia-general] Monitoring

2002-10-07 Thread Leif Nixon
Steven Wagner <[EMAIL PROTECTED]> writes: > And, of course, the direction you're probably already going in - > writing an app in Perl (or Python or Java or C or C++ or Pascal or > Prolog or Pilot or COBOL or ... ) to connect to gmetad, parse the > output, and then fire off a stream of passive upda

Re: [Ganglia-general] Monitoring

2002-10-04 Thread Steven Wagner
Leif Nixon wrote: So, once you've gotten Ganglia to pull in metrics from gazillions of nodes in umpteen clusters, and got pretty graphs of everything, what do you use for monitoring the values? I mean, when a machine goes down, you don't want just a webpage to be updated, you want something to tr

[Ganglia-general] Monitoring

2002-10-04 Thread Leif Nixon
So, once you've gotten Ganglia to pull in metrics from gazillions of nodes in umpteen clusters, and got pretty graphs of everything, what do you use for monitoring the values? I mean, when a machine goes down, you don't want just a webpage to be updated, you want something to trigger the klaxons.

[Ganglia-general] monitoring core 2.4.0 and Solaris 8/SPARC woe

2002-05-28 Thread Steven Wagner
Just wondering if anyone has (anecdotal or better) evidence of getting the monitoring core working on Solaris 8. I just tried cranking up gmond on a Netra t1 test box - it compiles but dumpes core (Bus error). A little gdb work seems to indicate that it is having malloc problems setuid'ing to