Michael Perzl schrieb:
> Hi Markus,
>
> in case you want to save some time, you can get RRDTool compiled for 
> AIX and packaged as RPM from my website at
> http://www.perzl.org/aix/.
>
> The direct link is http://www.perzl.org/aix/index.php?n=Main.Rrdtool.
>
> Regards,
> Michael
>
> Markus Reusch wrote:
>> Aroop Maliakkal schrieb:
>>   
>>> It might be worth checking the disk IO... Disk IO is the main 
>>> constraint on monitoring clusters/grids actually.
>>>
>>> On Dec 12, 2007 5:42 PM, Markus Reusch < [EMAIL PROTECTED] 
>>> <mailto:[EMAIL PROTECTED]>> wrote:
>>>
>>>     Hi list(eners),
>>>
>>>     we are using Ganglia for about 20-30 AIX-Systems. All data is
>>>     collected
>>>     on 2 LPARs which run gmetad, rrdtool and the Apache, so it's
>>>     accessible
>>>     for the users with the supplied PHP web frontend.
>>>     Our rrd-database has currently a size of about 400 MB and consists of
>>>     about 4800 files. I have no idea if this size or number of files is a
>>>     problem at all.
>>>     Every day, several users connect to the web frontend and open those
>>>     graphs, which results in some rrdtool calls, exhausting all the
>>>     available CPU time of the machine. No matter if it's just 4 rrdtool
>>>     processes or 20 of them. The machine is already at it's limit when
>>>     working with 4 rrdtool calls. When about 20 of them are started, a 2
>>>     digit number of kernel threads is being queued...
>>>     When all this occurs, you are not able to type properly in your ssh
>>>     session and even some shell scripts being called by cron are having
>>>     problems and give delayed output. This is no peak behaviour - it's
>>>     constantly like this.
>>>
>>>     The LPAR has AIX 5.3 with 1 * 1,7 MHz CPU and 2 GB RAM. VMM is
>>>     tuned and
>>>     machine is usually not paging at all. Not a strong machine but we
>>>     thought sufficient for Ganglia + rrdtool + Apache.
>>>
>>>     Other applications on the machine are not noticable, as you can
>>>     see (topas):
>>>     Name            PID  CPU%  PgSp Owner
>>>     rrdtool      458924  26.0   0.5 nobody
>>>     rrdtool      966772  25.4   0.4 nobody
>>>     rrdtool     1302770  23.0   3.6 root
>>>     rrdtool     1183858  22.4   0.4 nobody
>>>     gmetad      1110100   8.2   5.3 nobody
>>>     topas       1159188   0.6   1.7 root
>>>     gmond        352448   0.1   1.7 nobody
>>>     java         848106   0.0  16.2 root
>>>     sched         12294   0.0   0.4 root
>>>     clstrmgr     405650   0.0  22.5 root
>>>
>>>
>>>
>>>     A snippet from vmstat to see yourself (currently 4 rrdtools are
>>>     running):
>>>     kthr    memory              page              faults        cpu
>>>     ----- ----------- ------------------------ ------------ -----------
>>>      r  b   avm   fre  re  pi  po  fr   sr  cy  in   sy  cs us sy id wa
>>>      5  1 304078  4075   0   0   0   0    0   0 285 22896 1079 90 10
>>>      0  0
>>>      6  1 303783  4373   0   0   0   0    0   0 249 2038 787 98  2  0  0
>>>      6  0 304004  4145   0   0   0   0    0   0 239 17285 1001 92  8  0  0
>>>
>>>
>>>     Here is a sample process of rrdtool taken with ps:
>>>     /usr/bin/rrdtool graph --start 1197456160 --end 1197459760 --width
>>>     300
>>>     --height 75 --title weight - DEF:sum=/var/lib/ganglia/rrds/PL1650
>>>     Linie
>>>     SRZ/<somehostname>/weight.rrd:sum:AVERAGE
>>>     AREA:sum#0000ff:<somehostname>
>>>     last hour (now -1.00)
>>>
>>>     My questions:
>>>     Is it normal behaviour/configuration, that so many rrdtools are
>>>     started
>>>     when people are doing some requests to the web frontend?
>>>     Do you guys spend a "big" box just running Ganglia + rrdtool +
>>>     Apache or
>>>     maybe just a small Intel/Linux box and it runs smooth?
>>>
>>>     Any suggestions/hints are welcome, thank you in forward.
>>>
>>>     Greetings
>>>     Markus
>>>
>>>     
>>> -------------------------------------------------------------------------
>>>     SF.Net email is sponsored by:
>>>     Check out the new SourceForge.net Marketplace.
>>>     It's the best place to buy or sell services for
>>>     just about anything Open Source.
>>>     http://sourceforge.net/services/buy/index.php
>>>     _______________________________________________
>>>     Ganglia-general mailing list
>>>     [email protected]
>>>     <mailto:[email protected]>
>>>     https://lists.sourceforge.net/lists/listinfo/ganglia-general
>>>     <https://lists.sourceforge.net/lists/listinfo/ganglia-general>
>>>
>>>
>>>
>>>
>>> -- 
>>> ~=~=~=~=~=~=~=~=~=~=~=~=~=~=~=
>>> Regards,
>>> Aroop
>>> ~=~=~=~=~=~=~=~=~=~=~=~=~=~=~= 
>>>     
>> Hi Aroop,
>>
>> thank you for the hint - we don't have a disk issue. I/O wait is about 
>> zero and iostat shows only low traffic on the disks both on the 
>> percentage of time for interval being used and the bandwith and 
>> transfers. They are all low.
>> Meanwhile I found this in the archives:
>> http://sourceforge.net/mailarchive/forum.php?forum_name=ganglia-general&max_rows=25&style=nested&viewmonth=200712
>> So I am trying to compile the latest stable version of rrdtool 1.2.26.
>>
>> Later
>> Markus
>>
>>   

Thanks a lot all,

rrdtool 1.2.26 has been compiled successfully and seems to work 
different now than before, ie. better now. We currently don't have that 
many rrdtool processes open at all - to be honest I just did a query via 
the web frontend and it seems so fast now, that I didn't even see a 
rrdtool process.
On the other hand we have a disk issue now and the machine is now paging 
too but that is ok. 2 more GB of RAM are on the way and we try to get 
Ganglia and rrdtool etc. on other disks outside rootvg.

For the part of the metrics (I am not that deep into Ganglia yet) I only 
know that we have additional custom metrics added to the default metrics 
and are using Michael's AIX Ganglia package.

-------------------------------------------------------------------------
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services
for just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to