di 29 Mars 2017 14:55:16
Objet: [Ganglia-general] [ Ganglia General ] -- Monitoring severals networks in
a single Cluster
Hello everybody,
I am a brend new guy in Ganglia.
I have to monitoring a cluster of 20 nodes plus
a master node.
This single cluster has three networks.
The first is
Hello everybody,
I am a brend new guy in Ganglia.
I have to monitoring a cluster of 20 nodes plus
a master node.
This single cluster has three networks.
The first is 1 Go, the second 10 Go, and the third is Infiniband.
In gmetad, the IP adresses of nodes corresponds to the 1 G.O .
In g
Hi All,
For anyone interesting in monitoring Linux services, the latest Host sFlow
release can automatically track and monitor services running under systemd:
http://blog.sflow.com/2016/12/monitoring-linux-services.html
Ganglia already includes support for the sFlow metrics:
http://blog.sflow.com
Indeed it's in 3.7.1
Vladimir
On 05/05/2015 11:24 AM, Martin Knoblauch wrote:
> is the CTX stuff already in a released version? I may need to tell
> the end customer to upgrade.
--
One dashboard for servers and applic
Hi Vladimir,
is the CTX stuff already in a released version? I may need to tell the end
customer to upgrade.
Cheers
Martin
On Tue, May 5, 2015 at 4:12 PM, Vladimir Vuksan wrote:
> I have wrote one for memory fragmentation. You can find it here
>
>
> https://github.com/ganglia/gmond_python_mo
I have wrote one for memory
fragmentation. You can find it here
https://github.com/ganglia/gmond_python_modules/tree/master/system/mem_fragmentation
Context stuff is now in the monitor-core master
https://github.com/ganglia/monitor-core/blob/master/gmond/
Hi friends,
short question: does Ganglia provide monitor agents for context switches
and "memory fragmentation" (e.g. listing contents of /proc/buddyinfo)? I
want to avoid double work, should they exist officially?
Cheers
Martin
--
--
Martin K
Paul,
That's really interesting, I appreciated your efforts if you can share it.
--
Best Wishes,
Waleed Harbi
Dream | Do | Be
On Wed, Dec 12, 2012 at 7:09 PM, Paul Hewlett wrote:
> Hi Waleed
>
> I recently wrote a python LSF module for my last contract. It reported
>
Hi Waleed
I recently wrote a python LSF module for my last contract. It reported
metrics on the jobs submitted to LSF as opposed to
monitoring LSF itself (sbatchd,lim,res etc).
Is this what you want?
If so I could ask if the module could be made available
Regards
Paul
On 11 December 2012
I am looking for performance tuning for GPFS and LSF hosts, even if there
are more functionality available that will be great. Both of them they are
big product but I am looking for performance functions.
--
Best Wishes,
Waleed Harbi
Dream | Do | Be
On Tue, Dec 11, 2012
What are you looking to monitor ? Queue sizes ?
Vladimir
On Tue, 11 Dec 2012, Waleed Harbi wrote:
Hello,I am looking for ganglia gmetric to monitoring IBM LSF Platform and GPFS.
I hihgily appracited
your advice if have any comment. I cannot find it
under https://github.com/ganglia/gmetric.
-
Hello,
I am looking for ganglia gmetric to monitoring IBM LSF Platform and GPFS. I
hihgily appracited your advice if have any comment. I cannot find it under
https://github.com/ganglia/gmetric.
--
Best Wishes,
Waleed Harbi
Dream | Do | Be
---
I was thinking about asking O'Reilly about it. I would like to get a
printed copy. Has anyone already asked?
On Mon, Nov 26, 2012 at 3:44 PM, Dave Josephsen wrote:
> I don't suppose the contributing authors can get a complimentary print
> copy from O'Reilly? apress or pren-hall would totally ho
I don't suppose the contributing authors can get a complimentary print copy
from O'Reilly? apress or pren-hall would totally hook us up ;-)
- Original Message -
> Monitoring with Ganglia book is out from O'Reilly. Sorry for the late
> notice but you can get 50% off the Ebook today
>
> h
Monitoring with Ganglia book is out from O'Reilly. Sorry for the late
notice but you can get 50% off the Ebook today
http://shop.oreilly.com/product/0636920025573.do
All the royalties go directly to
http://www.scholarshipamerica.org
Vladimir
---
--
Message: 5
Date: Wed, 25 Jul 2012 11:30:27 -0500
From: Douglas Wagner
Subject: Re: [Ganglia-general] Modifying ganglia.
To: ganglia-general@lists.sourceforge.net
Message-ID:
Content-Type: text/plain; charset="iso-8859-1"
On Wed, Jul 25, 2012 at 2:22 AM, ka
From: Jesse Becker [haw...@gmail.com]
> Sent: Monday, May 02, 2011 3:02 PM
> To: Mostafa Ismail
> Cc: ganglia-general@lists.sourceforge.net; Bernard Li
> Subject: Re: [Ganglia-general] Monitoring SGE queues using Ganglia
>
> Try running this:
> qstat -u '*'
>
> Yes, yo
0/16/161.62 lx24-amd64
> ~
> ~
> ~
> [root@sge01 tmp]#
>
> What does it mean?
>
> Thanks,
> Mostafa Ismail
>
> -Original Message-
> From: Jesse Becker [mailto:haw...@gmail.com]
> Sent: Tuesday, April 19, 2011 7:17 PM
> To: Bernard L
lx24-amd64
> ~
> ~
> ~
> [root@sge01 tmp]#
>
> What does it mean?
>
> Thanks,
> Mostafa Ismail
>
> -Original Message-
> From: Jesse Becker [mailto:haw...@gmail.com]
> Sent: Tuesday, April 19, 2011 7:17 PM
> To: Bernard Li
> Cc: Mostafa Ismail; gang
7:17 PM
To: Bernard Li
Cc: Mostafa Ismail; ganglia-general@lists.sourceforge.net
Subject: Re: [Ganglia-general] Monitoring SGE queues using Ganglia
Yeah, pretty close to the same file. I'll post update both the
collector and php file later on.
On Tue, Apr 19, 2011 at 13:10, Bernard Li wrote:
t;>>
>>> Thanks,
>>> Mostafa ismail
>>>
>>> -Original Message-
>>> From: Jesse Becker [mailto:haw...@gmail.com]
>>> Sent: Tuesday, April 19, 2011 3:39 PM
>>> To: Mostafa Ismail
>>> Cc: ganglia-general@lists
t;
>> Thanks,
>> Mostafa ismail
>>
>> -Original Message-
>> From: Jesse Becker [mailto:haw...@gmail.com]
>> Sent: Tuesday, April 19, 2011 3:39 PM
>> To: Mostafa Ismail
>> Cc: ganglia-general@lists.sourceforge.net
>> Subject: Re: [Gangli
here's any
documentation which can I follow, then get back if I have issues
Thanks,
Mostafa Ismail
-Original Message-
From: Jesse Becker [mailto:haw...@gmail.com]
Sent: Tuesday, April 19, 2011 4:01 PM
To: Mostafa Ismail
Cc: ganglia-general@lists.sourceforge.net
Subject: Re: [Ganglia-
April 19, 2011 3:39 PM
> To: Mostafa Ismail
> Cc: ganglia-general@lists.sourceforge.net
> Subject: Re: [Ganglia-general] Monitoring SGE queues using Ganglia
>
> On Tue, Apr 19, 2011 at 09:25, Mostafa Ismail
> wrote:
>> Hello,
>>
>>
>>
>> Is it possible t
[mailto:haw...@gmail.com]
Sent: Tuesday, April 19, 2011 3:39 PM
To: Mostafa Ismail
Cc: ganglia-general@lists.sourceforge.net
Subject: Re: [Ganglia-general] Monitoring SGE queues using Ganglia
On Tue, Apr 19, 2011 at 09:25, Mostafa Ismail wrote:
> Hello,
>
>
>
> Is it possible to moni
On Tue, Apr 19, 2011 at 09:25, Mostafa Ismail wrote:
> Hello,
>
>
>
> Is it possible to monitor the SGE queues (such as all.q) using ganglia? I
> did search at “Ganglia-general” forum and I found no match.
Yes, it is possible. You need to do two things:
1) collect the metrics from SGE.
2) graph
Hello,
Is it possible to monitor the SGE queues (such as all.q) using ganglia? I did
search at "Ganglia-general" forum and I found no match.
Your response is highly appreciated.
Thanks,
Mostafa Ismail
--
Benefiting fro
Hi
I'm monitoring a switch through gmetrics. When I want to view the "Time
and String Metrics", appears the "Gmond Started", "Uptime" and others
variables with default values.
Searching in several files such as
(host_view.php,ganglia.php,functions.php ...) I achieved to fill the
"Uptime" using "
try this command
#gstat --all -i a_hostname_in_cluster
Chifeng
On Tue, Nov 17, 2009 at 11:02 PM, John Martyniak <
j...@beforedawnsolutions.com> wrote:
> Ok.
>
> I just ran a 'gstat --all'
>
> And only one host comes up, just the localhost.
>
> So there is something missing.
>
> any ideas?
>
> -J
Ok.
I just ran a 'gstat --all'
And only one host comes up, just the localhost.
So there is something missing.
any ideas?
-John
On Nov 17, 2009, at 9:22 AM, John Martyniak wrote:
>
> Hi everyone,
>
> Ok I got my Ganglia monitor up and working, and it was pulling
> results from the localhost
Hi everyone,
Ok I got my Ganglia monitor up and working, and it was pulling results
from the localhost.
So I enable the hadoop-metrics.properties and made the appropriate
changes so that it pointed at me ganglia box.
I made a data_source in the gmetad.conf file, and attached the two
test
Hello Nigel:
Unfortunately there is currently no way to have different "views" for
your monitored resources. So in your example (below), you would
probably want to set up a gmetad to aggregate the metrics across
ApplicationA (and not by location). You could of course set up as
many gmetads aggre
Hi, I’m monitoring what is hopefully a fairly standard compute
configuration using Ganglia, and want to take the opportunity of a v3.1
upgrade to rationalise my configuration. I have about ~40,000 cores, in
~10 geographic sites. Currently I also have a bit of a mess of Gmetad’s
and WebFrontEnd'
On Tue, Oct 28, 2008 at 05:30:25PM +1100, Adam Mitchell wrote:
>
>#!/bin/bash
>VALUE=$(df /home/ | grep /home |awk '{print $3 }')
>gmetric --name disk_nfs_used --value $VALUE --type uint32 --units Bytes
not relevant for your problem but units here should be "KB"
>gmond is running
Hi Everyone,
I am new to this list and looking for some help. I have searched the
archives for this list and many other corners of the web to no avail.
Our user home directories are mounted on the compute nodes via an NFS
share on the head node. User data is written to the home directories.
W
s if you are interested.
>
> Dan
> Sent via BlackBerry by AT&T
>
> -Original Message-
> From: "Craig Simpson" <[EMAIL PROTECTED]>
>
> Date: Fri, 11 Jul 2008 12:19:39
> To:
> Subje
Tried mapping asm01 to a raw device, called /dev/raw/asm01, but that doesn't
seem to be something I can run iostat against either.
I think a real trick for clustered storage is to understand the IO to
multipathed devices and graph over time.
Trying to gather (and graph my IO multipath aliases (
Craig Simpson wrote:
Does anyone have a method for monitoring Linux Multipathed Devices,
created by multipthd and dm?
Use udev to create /dev/ names that match your multipath names. On
Rhat, a rule in /etc/udev/rules.d and a script in /etc/udev/scripts
should be sufficient.
http://www.red
Does anyone have a method for monitoring Linux Multipathed Devices,
created by multipthd and dm?
An iostat will just show the DM and not the actual alias.
Would like to monitor IO via the Multipath alias name.
Example would be:
>From /etc/multipath.conf asm01 is created:
multipath {
Thanks! I'll take a look at GroundWorks.
Looks like the consensus is to use Nagios, possibly with some additional
products, for event monitoring and notification.
David
-
Sponsored by: SourceForge.net Community Choice Awa
Alex,
oh dear, it looks like I answered the wrong question *again*.
As I don't have test access to a running ganglia someone else
should answer.
But part of it may be to -
- configure gmetad.conf to poll the failover VIP IP or DNS name,
not the physical ones.
- Configure each server in the fai
Alex,
They are the only 2 members of the cluster?
How about this:
- The gmond.conf on host A is configured unicast and to send
data to the *physical* address (not the VIP) of Host B.
Do not configure gmond.conf to send data to itself.
The only UDP send channel is to host B
- Configure the
Second post today, separate topic...
I've got a few machines set up as active/passive clusters running
heartbeat/drbd. I am currently monitoring them with ganglia, but I
think the information I'm getting leads to a misleading picture.
Since both machines are monitored, it looks like I have 8
-
> From: [EMAIL PROTECTED] [mailto:ganglia-
> [EMAIL PROTECTED] On Behalf Of João Oliveira
> Sent: Friday, October 13, 2006 3:24 PM
> To: ganglia-general@lists.sourceforge.net
> Subject: [Ganglia-general] Monitoring one process
>
> Hi all,
>
> i was reading the docum
hi,
I created this add-on. It allows you to collect metrics of one
specific process using Ganglia.
http://www-usr.inf.ufsm.br/~veiga/gappmon/ (Portuguese only)
[]'s
-veiga
On 10/13/06, João Oliveira <[EMAIL PROTECTED]> wrote:
Hi all,
i was reading the documentation's FAQ when i read about me
You may monitor whatever you like through the use of the gmetric command.
João Oliveira wrote:
> Hi all,
>
> i was reading the documentation's FAQ when i read about metrics that
> Ganglia supports. Well, i read all of them trying to understand each
> but i couldn't find the one that interests
Hi all,
i was reading the documentation's FAQ when i read about metrics that
Ganglia supports. Well, i read all of them trying to understand each
but i couldn't find the one that interests me the most, monitoring
processes individually.
So, can i collect CPU usage time of one specific process us
Nagios?
Cheers
Martin
--- Dirk Roessler <[EMAIL PROTECTED]> wrote:
> Does someone knows an easy to install and easy to use solution for
> monitoring and sending email notifications of down nodes and health
> state on a Linux HPC cluster?
>
> Dirk
> > begin:vcard
> fn;quoted-printable:Dirk R
Dirk Roessler wrote:
> Does someone knows an easy to install and easy to use solution for
> monitoring and sending email notifications of down nodes and health
> state on a Linux HPC cluster?
You could use Nagios and Ganglia Python client. Basically you use the
Ganglia Python client to get metric v
Does someone knows an easy to install and easy to use solution for
monitoring and sending email notifications of down nodes and health
state on a Linux HPC cluster?
Dirk
begin:vcard
fn;quoted-printable:Dirk R=C3=B6=C3=9Fler
n;quoted-printable:R=C3=B6=C3=9Fler;Dirk
org:_University of Potsdam;Dep
leif-
i've been wanting to have a way to implement an active alerting mechanism
for a while. the development team would love some help if you're willing
to donate a little time.
i have an idea for a quick and smart hack (i think). gmetad is already
doing the hardest part of this work.
here's
Steven Wagner <[EMAIL PROTECTED]> writes:
> Leif Nixon wrote:
> > Steven Wagner <[EMAIL PROTECTED]> writes:
> > Yes, that's what I did last week. It ain't no fun. Nagios' handling
> > of passive service checks isn't flexible enough. And passive host
> > checking Just Isn't Done.
>
> Once again, c
Leif Nixon wrote:
Steven Wagner <[EMAIL PROTECTED]> writes:
Yes, that's what I did last week. It ain't no fun. Nagios' handling
of passive service checks isn't flexible enough. And passive host
checking Just Isn't Done.
Once again, considering you have the source at your disposal, I'm sure you
Steven Wagner <[EMAIL PROTECTED]> writes:
> And, of course, the direction you're probably already going in -
> writing an app in Perl (or Python or Java or C or C++ or Pascal or
> Prolog or Pilot or COBOL or ... ) to connect to gmetad, parse the
> output, and then fire off a stream of passive upda
Leif Nixon wrote:
So, once you've gotten Ganglia to pull in metrics from gazillions of
nodes in umpteen clusters, and got pretty graphs of everything, what
do you use for monitoring the values? I mean, when a machine goes
down, you don't want just a webpage to be updated, you want something
to tr
So, once you've gotten Ganglia to pull in metrics from gazillions of
nodes in umpteen clusters, and got pretty graphs of everything, what
do you use for monitoring the values? I mean, when a machine goes
down, you don't want just a webpage to be updated, you want something
to trigger the klaxons.
Just wondering if anyone has (anecdotal or better) evidence of getting
the monitoring core working on Solaris 8. I just tried cranking up
gmond on a Netra t1 test box - it compiles but dumpes core (Bus error).
A little gdb work seems to indicate that it is having malloc problems
setuid'ing to
57 matches
Mail list logo