Ahh yes, aggregating data in different ways after the fact. We had a need to do that, and also a need to provide more than one cluster heirachy (e.g. clusters grouped by region, but also clusters grouped by technology owner (say)).
I have written some perl code to do this - sucking the data out of defined clusters and manually calling rrdtool update for different aggregate views. Doing it is a little tricky I must say, at least for me. The other thing that is a bit disappointing is that if you extract the data from some time range, if the finest grain data does not go back that far, it will use the coarser grained data for the full extract - even in the timeframe where there is finer data. The other step is to get a ganglia instance to understand enough to display this other rollup data. Your choices include faking up appropriate XML on port 8652 to convince a 2nd gmetad instance to display the data, or hacking a 2nd copy of the php tree and replace the code that asks gmetad for data with file based data (say). The perl code is attached for, but this is only for interest. It is too horrible to be usable by others. best regards, richard -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Alex Balk Sent: 09 June 2006 20:32 To: Bernard Li Cc: Stackpole, Chris; [email protected] Subject: Re: [Ganglia-general] Ganglia Alert and Tracking Bernard Li wrote: >> I am trying to write a script that pulls the info from netcat >> and averages out some numbers but I believe that there is a >> easier way. Does ganglia store data in such a way that I >> could pull this type of information? This appears so useful >> to me that I am sure that there are others that have tried >> this, are there any ideas and suggestions? >> > > Sorry for hijacking your thread Chris but your question leads me to > think that there are some interesting data stored in the RRD database, > perhaps we could write a script to mine this data and provide some > interesting historical reports? > Actually, my patch for "custom graphs" accomplishes exactly what you're talking about. It allows you to create a template and then load it for whatever view (meta, cluster, host) you desire. Couple this with gmetrics and you can pretty much generate a graph for anything (read - visually represent any aspect of your data). It also supports rrdtool's CDEFs, so you can do data transformations as well. Oh, and the rendering backend may be called from within an <IMG SRC=...> which allows creating "customized dashboards". I've started working on one where customers can view different utilizations graphs based on the cluster specialty (batch, interactive, infrastructure), NFS statistics, parallel job utilization (how much does process named X consume across multiple hosts), etc. What I'm really missing is a method to "generate" aggregate data on the fly. Something like "take these 3 hosts, all from different clusters, and show me their aggregate CPU consumption". Cheers, Alex _______________________________________________ Ganglia-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/ganglia-general ------------------------------------------------------------------------ For more information about Barclays Capital, please visit our web site at http://www.barcap.com. Internet communications are not secure and therefore the Barclays Group does not accept legal responsibility for the contents of this message. Although the Barclays Group operates anti-virus programmes, it does not accept responsibility for any damage whatsoever that is caused by viruses being passed. Any views or opinions presented are solely those of the author and do not necessarily represent those of the Barclays Group. Replies to this email may be monitored by the Barclays Group for operational or business reasons. ------------------------------------------------------------------------
rollup
Description: rollup

