Here is another example from gstat:
 CPUs (Procs/Total) [     1,     5, 15min] [  User,  Nice, System, Idle,
Wio]
    8 (    8/  122) [  4.59,  2.04,  1.35] [  99.8,   0.0,
0.2,-67062349824.0,-67062349824.0] OFF

Looking at the source code for AIX metrics
(https://github.com/ganglia/monitor-core/blob/master/libmetrics/aix/metrics.
c), it appears that negative values should be converted to 0.  This is
either not happening or the metrics are somehow being modified after the
fact.

g_val_t
cpu_wio_func ( void )
{
   g_val_t val;
   
   get_cpuinfo();
   val.f = CALC_CPUINFO(wait);


   if(val.f < 0) val.f = 0.0;
   return val;
}

g_val_t
cpu_idle_func ( void )
{
   g_val_t val;


   get_cpuinfo();
   val.f = CALC_CPUINFO(idle);


   if(val.f < 0) val.f = 0.0;
   return val;
}


From: K. Hansen
Sent: Wednesday, October 02, 2013 4:50 PM
To: ganglia-general@lists.sourceforge.net
Subject: Insane negative values for cpu_idle and cpu_wio when node is CPU
bound

Environment:
AIX 6.1 TL7 SP7
gmond 3.6.0 (from http://www.perzl.org/ganglia/)

I noticed that a particular node would send insanely high negative values
for cpu_idle and cpu_wait metrics when cpu_user + cpu_system were near 100%,
i.e. the node is completely CPU bound.  The result is major skewing of the
node's cpu_idle and cpu_wio graphs so that no true positive values are
visible, and the cpu_report graph for the node, cluster, and grid become
corrupted.

Here is an example of what I am talking about:  http://imgur.com/a/aIzyU

I am able to replicate this behavior on any AIX node by running the
following command to generate CPU load:

perl -e 'while (--$ARGV[0] and fork) {}; while () {}' 8

Where the last digit is the number of threads available to the server.  For
example, if a server has 2 POWER7 vCPU, then it has 8 threads (logical CPU)
due to 4-way simultaneous multithreading (SMT).

Has anyone else experienced this on AIX or Linux?

Thanks!



------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
_______________________________________________
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to