[Ganglia-developers] Re: [Ganglia-general] Ganglia issues I've been experiencing

Ramon Bastiaans Wed, 16 Mar 2005 06:34:12 -0800

Nice,

Still leave's (slow) I/O as a bottleneck, but I'll manage. Making theRRA's a little bigger so that no values are lost while copying, orsomething like that ;)


Thanks.

- Ramon.


Matt Massie wrote:

actually. i just updated gmetad to allow custom RRAs to be defined.i just dropped the code into CVS so if you use the CVS code (whichwill be released as 3.0.1 very soon)... you can specify
RRAs "RRA:AVERAGE:0.5:1:240" \
     "RRA:AVERAGE:0.5:24:240" \
     "RRA:AVERAGE:0.5:168:240" \
     "RRA:AVERAGE:0.5:672:240" \
     "RRA:AVERAGE:0.5:5760:370"
in gmetad.conf to alter the round-robin archive format. this was asimple feature to add and i know it's in big demand ... no sensewaiting until later to add it.
forget everything that i wrote below... just use CVS for now or waitfor 3.0.1. :)
-matt


Matt Massie wrote:
here is an idea you might try.

all the rrd code is in ./gmetad/rrd_helpers.c
the function for creating rrds is RRD_create(). you can alter theformat of the round-robin archives there without breakingcompatibility (in upcoming version of gmetad will allow you tospecify the archives in the configuration file).
it's important that you do not change any line starting with "DS"(data source) since that _will_ break compatibility.
you could change your round-robin granularity there.
you could have rrdtool save all raw data points for a week (forexample). then you just need to have a cron job that once a weekcopies the database to another location. for example...
cp -r /my/gmetad/data/root /my/gmetad_archive/`date`/
you would then need to write a few simple scripts using rrdtool forquerying data for a particular time period.
-matt




Ramon Bastiaans wrote:
Jason A. Smith wrote:
If you really want long term storage of the raw or nearly raw datathenrrdtool is probably not the right tool to use. You would be betteroffwriting your own ganglia frontend client that would collect the xmldata
from gmetad at the interval you need, parse it and store it into some
other database or archive.  This could also be done from another
computer so it would have a negligible impact on the gmetad host.

~Jason
I have thought about this too.
The problem with this is the fact that if I go to something SQL-ishor similar, I will have to store about 25+ billion rows (<43metrics> * <275 hosts> * <1 year of seconds>) because I'd want tostore for about 1 year's worth of metrics, of the detailed view.Meaning a new value every 15 seconds per host per metric.
I am having nightmare's allready about working with a SQL databasewith 25+ billion rows, I doubt it will ever work on the hardware Ihave available for the project.
It would allmost be more useable (performance and storage wise) tojust write additional .rrd files in the same manner gmetad does andperhaps use a ramdisk for this.
I agree a SQL dbase would be much more desireable, however I am verytempted to just write a tool that grabs the xml and stores it inadditional rrd's. However it sure is beyond the whole concept ofround robin databases to use it for the archiving of data.
If you have a good idea or suggestion on how to store the amount ofdata efficiently, without needing a extra cluster just to store anduse the values, I would love to hear it.
- Ramon.


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from realusers.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
Ganglia-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

[Ganglia-developers] Re: [Ganglia-general] Ganglia issues I've been experiencing

Reply via email to