Hi dev zero :)
we use ganglia to monitoring our AIX boxes (AIX 5.3 TL 11) and saw the
same differences in the cpu usage/idle in the output from vmstat and
gangia.
First you can try to use the AIX RPM packages from this site [
http://www.perzl.org/ganglia/ ]. I have not yet testet the latest
version (3.1.2), maybe it will fix the problem.
In my case i replaced the default metrics from gmond by values from vmstat.
Regards,
Ron
Am 20.10.2010 18:15, schrieb dev Zero:
Have noticed that the metrics.c file has been coded using the
perfstat_cpu_total system call, and was wondering about the reasoning
in using the perfstat_cpu_total CPU stats in preference to those
produced via the perfstat_partition_total system call ?
The reason I ask is that there seems to be a wide level of varience
between the CPU User and CPU Idle metrics produced by both system calls.
If someone understands the difference between the CPU metrics provided
by these two system calls, and why they would provide considerably
different CPU User and Idle metrics on a 2 CPU Aix 5.3 LPAR with no
sharing of CPU's involved, I would very interested to hear from you.
The standard Aix performance tools (vmstat, sar, topas, nmon) all seem
to be using the metrics provided by the perfstat_partition_total. This
means (in my case) that Ganglia's CPU User and CPU Idle graphs do no
match what is shown to be happening on that LPAR with the standard Aix
Performance tools.
(ie. Ganglia graphs show LPARs with 30% ish idle time, but vmstat etc
shows no idle time).
Documentation on the perfstat_cpu_total system call seems to indicate
that the perfstat_partition_total system call should be used instead.
http://publib.boulder.ibm.com/infocenter/aix/v6r1/index.jsp?topic=/com.ibm.aix.basetechref/doc/basetrf1/perfstat_cputot.htm
Quote : All data consumers requiring processor use must be modified to
support PURR-based computations as shown in the example for the
perfstat_partition_total interface under Perfstat API programming.
While I'm confident that there is a good reason for the use of
perfstat_cpu_total in the aix/metrics.c source, I would like to
understand the reasoning and so banish any nervousness in trusting the
stats provided the Ganglia product.
Appreciate any information shedding light on this.
Thanks and Best Regards to all.
PS. My hat is off to the Ganglia Team/Community. :-) Very impressive
product.
------------------------------------------------------------------------------
Nokia and AT&T present the 2010 Calling All Innovators-North America contest
Create new apps& games for the Nokia N8 for consumers in U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store
http://p.sf.net/sfu/nokia-dev2dev
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general
------------------------------------------------------------------------------
Nokia and AT&T present the 2010 Calling All Innovators-North America contest
Create new apps & games for the Nokia N8 for consumers in U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store
http://p.sf.net/sfu/nokia-dev2dev
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general