I have used nagios successfully for many years to monitor everything
from 1 server at a client's location to > 1000 servers at $WORK. It
works very well and can be adapted to monitor pretty much anything. We
have adapted it to monitor everything from the applications running on
the servers to the temperatures in the computer room. It handles all our
on-call paging.
Another neat tool I use is Ganglia, it's quite powerful and extensible
and provides detailed RRDtool generated graphs for lots of system
metrics. It however does not handle paging/alerting.
Greg Saunders wrote:
Hi all, I'm looking for recommendations on server monitoring, reporting,
notification toolkit(s). Something to run as a cron job and let me know
if my drive is filling up, CPU has been running at 100% for days on end,
there's no RAM left and you could fry bacon on the HDD because of all
the swapping, that kind of thing. I'm starting to get too many servers
to worry about and I don't want to log in to each of them every day to
check on things ... I'm lazy :)
So, what's been working for ya all?
Thanks!
Greg
------------------------------------------------------------------------
_______________________________________________
clug-talk mailing list
[email protected]
http://clug.ca/mailman/listinfo/clug-talk_clug.ca
Mailing List Guidelines (http://clug.ca/ml_guidelines.php)
**Please remove these lines when replying
_______________________________________________
clug-talk mailing list
[email protected]
http://clug.ca/mailman/listinfo/clug-talk_clug.ca
Mailing List Guidelines (http://clug.ca/ml_guidelines.php)
**Please remove these lines when replying