Ahh yes, aggregating data in different ways after the fact.
We had a need to do that, and also a need to provide more than one
cluster heirachy (e.g. clusters grouped by region, but also clusters
grouped by technology owner (say)).

I have written some perl code to do this - sucking the data out of
defined
clusters and manually calling rrdtool update for different aggregate
views.
Doing it is a little tricky I must say, at least for me. The other thing
that
is a bit disappointing is that if you extract the data from some time
range,
if the finest grain data does not go back that far, it will use the
coarser grained
data for the full extract - even in the timeframe where there is finer
data.

The other step is to get a ganglia instance to understand enough to
display this
other rollup data. Your choices include faking up appropriate XML on
port 8652
to convince a 2nd gmetad instance to display the data, or hacking a 2nd
copy of the
php tree and replace the code that asks gmetad for data with file based
data (say).

The perl code is attached for, but this is only for interest. It is too
horrible to
be usable by others.

best regards,
richard

-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Alex
Balk
Sent: 09 June 2006 20:32
To: Bernard Li
Cc: Stackpole, Chris; [email protected]
Subject: Re: [Ganglia-general] Ganglia Alert and Tracking




Bernard Li wrote:

>> I am trying to write a script that pulls the info from netcat
>> and averages out some numbers but I believe that there is a 
>> easier way. Does ganglia store data in such a way that I 
>> could pull this type of information? This appears so useful 
>> to me that I am sure that there are others that have tried 
>> this, are there any ideas and suggestions?
>>     
>
> Sorry for hijacking your thread Chris but your question leads me to 
> think that there are some interesting data stored in the RRD database,

> perhaps we could write a script to mine this data and provide some 
> interesting historical reports?
>   

Actually, my patch for "custom graphs" accomplishes exactly what you're
talking about. It allows you to create a template and then load it for
whatever view (meta, cluster, host) you desire. Couple this with
gmetrics and you can pretty much generate a graph for anything (read -
visually represent any aspect of your data). It also supports rrdtool's
CDEFs, so you can do data transformations as well. Oh, and the rendering
backend may be called from within an <IMG SRC=...> which allows creating
"customized dashboards". I've started working on one where customers can
view different utilizations graphs based on the cluster specialty
(batch, interactive, infrastructure), NFS statistics, parallel job
utilization (how much does process named X consume across multiple
hosts), etc.


What I'm really missing is a method to "generate" aggregate data on the
fly. Something like "take these 3 hosts, all from different clusters,
and show me their aggregate CPU consumption".

Cheers,
Alex


_______________________________________________
Ganglia-general mailing list [email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

------------------------------------------------------------------------
For more information about Barclays Capital, please visit our web site at 
http://www.barcap.com.

Internet communications are not secure and therefore the Barclays Group does 
not accept legal responsibility for the contents of this message.  Although the 
Barclays Group operates anti-virus programmes, it does not accept 
responsibility for any damage whatsoever that is caused by viruses being 
passed.  Any views or opinions presented are solely those of the author and do 
not necessarily represent those of the Barclays Group.  Replies to this email 
may be monitored by the Barclays Group for operational or business reasons.
------------------------------------------------------------------------

Attachment: rollup
Description: rollup

Reply via email to