Sorry this is so late, I just noticed your post.

Have you looked into pnp4nagios?

It pulls the perfdata from your check results into an RRD database and then uses RRD to graph the results. Pretty customizable, I'm not an RRD/php guy but I managed to get IOPS/latency reports per volume and as an aggregate from our storage etc.

This does not address the "am I cool, hot or ON FIRE!!!" but it does let you as an admin go in later and look to see "hey look, these 3 web servers hung up when the database went down even though they are supposed to be orthogonal. It also makes cyclic stuff easy to spot (high utilization on the 3rd Tuesday of every month).

It should work with any "Nagios" based system - you basically send your check results to a perl script.

Please be advised that this email may contain confidential information. If you are not the intended recipient, please notify us by email by replying to the sender and delete this message. The sender disclaims that the content of this email constitutes an offer to enter into, or the acceptance of, any agreement; provided that the foregoing does not invalidate the binding effect of any digital or other electronic reproduction of a manual signature that is included in any attachment.


_______________________________________________
Tech mailing list
Tech@lists.lopsa.org
https://lists.lopsa.org/cgi-bin/mailman/listinfo/tech
This list provided by the League of Professional System Administrators
http://lopsa.org/

Reply via email to