I have used nagios successfully for many years to monitor everything from 1 server at a client's location to > 1000 servers at $WORK. It works very well and can be adapted to monitor pretty much anything. We have adapted it to monitor everything from the applications running on the servers to the temperatures in the computer room. It handles all our on-call paging.

Another neat tool I use is Ganglia, it's quite powerful and extensible and provides detailed RRDtool generated graphs for lots of system metrics. It however does not handle paging/alerting.

Greg Saunders wrote:
Hi all, I'm looking for recommendations on server monitoring, reporting, notification toolkit(s). Something to run as a cron job and let me know if my drive is filling up, CPU has been running at 100% for days on end, there's no RAM left and you could fry bacon on the HDD because of all the swapping, that kind of thing. I'm starting to get too many servers to worry about and I don't want to log in to each of them every day to check on things ... I'm lazy :)

So, what's been working for ya all?

Thanks!
Greg


------------------------------------------------------------------------

_______________________________________________
clug-talk mailing list
[email protected]
http://clug.ca/mailman/listinfo/clug-talk_clug.ca
Mailing List Guidelines (http://clug.ca/ml_guidelines.php)
**Please remove these lines when replying


_______________________________________________
clug-talk mailing list
[email protected]
http://clug.ca/mailman/listinfo/clug-talk_clug.ca
Mailing List Guidelines (http://clug.ca/ml_guidelines.php)
**Please remove these lines when replying

Reply via email to