Well, we don't see these spikes in the other monitoring software (e.g.
nmon) we've run on the same nodes.

I'd prefer to prevent them from occurring than try to "remove" them.

I'd like to understand how gmond gathers the bytes_in/out stats.  As far
as I can tell, the packets_in/out stats don't suffer from the same
problem.

David Wong
Senior Systems Engineer
Management Dynamics, Inc.
Phone: 201-804-6127
[EMAIL PROTECTED]

-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of
Vladimir Vuksan
Sent: Wednesday, March 28, 2007 10:15 AM
To: [email protected]
Subject: Re: [Ganglia-general] Help! I have a petabyte/s network

You may want to run one of the removespikes.pl or killspikes scripts. I 
have seen this before especially with MRTG when e.g. a firewall is 
restarted or similar. You need to act quickly before the Petabytes gets 
into the averages.

Another note, before you run removespikes.pl make sure you back up all 
your RRDs.

Good luck,

Vladimir

David Wong wrote:
> Ganglia is reporting that I'm pushing up to 200 Petabytes/s through my
> network.  Nobody tell the network admin!
>
> I'm running Ganglia 3.0.4 with the Power5 add-ons on AIX5.3
>
> Bytes in and out statistics generally appear to have the right values.
> However at random times, I get spikes in the petabytes/s range.
>
> Here's a dump of the bytes_in database.  At first, I suspected perhaps
> these coincide with some counters getting reset, but they don't occur
at
> regular intervals.
>
>                         <!-- 2007-03-27 20:42:00 GMT / 1175028120 -->
> <row><v> 1.9268390706e+05 </v></row>
>                         <!-- 2007-03-27 20:48:00 GMT / 1175028480 -->
> <row><v> 1.5833184975e+05 </v></row>
>                         <!-- 2007-03-27 20:54:00 GMT / 1175028840 -->
> <row><v> 1.6838302753e+05 </v></row>
>                         <!-- 2007-03-27 21:00:00 GMT / 1175029200 -->
> <row><v> 1.3766069592e+05 </v></row>
>                         <!-- 2007-03-27 21:06:00 GMT / 1175029560 -->
> <row><v> 2.1711888414e+05 </v></row>
>                         <!-- 2007-03-27 21:12:00 GMT / 1175029920 -->
> <row><v> 4.9959709273e+16 </v></row>
>                         <!-- 2007-03-27 21:18:00 GMT / 1175030280 -->
> <row><v> 1.7401339783e+05 </v></row>
>                         <!-- 2007-03-27 21:24:00 GMT / 1175030640 -->
> <row><v> 2.0955720861e+05 </v></row>
>                         <!-- 2007-03-27 21:30:00 GMT / 1175031000 -->
> <row><v> 1.9032255300e+05 </v></row>
>                         <!-- 2007-03-27 21:36:00 GMT / 1175031360 -->
> <row><v> 1.9162727036e+05 </v></row>
>                         <!-- 2007-03-27 21:42:00 GMT / 1175031720 -->
> <row><v> 1.2703790825e+05 </v></row>
>
> Can anyone shed light on what might be happening?  Any pointers for
> debugging?
>
> David Wong
> Senior Systems Engineer
> Management Dynamics, Inc.
> Phone: 201-804-6127
> [EMAIL PROTECTED]
>
>
>
>
------------------------------------------------------------------------
-
> Take Surveys. Earn Cash. Influence the Future of IT
> Join SourceForge.net's Techsay panel and you'll get the chance to
share your
> opinions on IT & business topics through brief surveys-and earn cash
>
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDE
V
> _______________________________________________
> Ganglia-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/ganglia-general
>   


------------------------------------------------------------------------
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share
your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDE
V
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general



Reply via email to