Ron Reeder wrote:
This has bitten me off and on for a couple of years.

See the example of chunky Load graph for a summary of a Cluster of Servers.

http://www.rreeder.net/graph.load.php.gif

What was going on?

There is not enough 'time' in the first level of the RRD database - so It drops down to the second level to get the requested
one hour duration.

The gmetad daemon was building the RRD database with 240 intervals of _13_ seconds, _not_ 15 seconds. I don't know why gmetad was doing this ... I know it was from: rrdtool info <hostname-metric.rrd>

Since the number 240 for intervals is _hard coded_ in the gmetad server, I don't know why you don't go ahead and hard code the length of interval to 15 seconds... I have done this at my site, and built
a patch and a new rpm based on that patch.

I have also written a simple script to update the RRD databases to fix the interval problem .... Just add a few intervals
so we have enough for a full hour.

Did this issue ever get addressed in the production code?

Is the script you sent to fix the old databases to work with the new gmetad? Will all the historic information still be present if you don't run the script?

Thanks,


Sean

Reply via email to