Has anyone got something like this going... -
We’re going to have an
environment where we’ve got lots of servers out in the field (i.e., at over
a thousand locations, two servers per locations) that are running mon to watch
both themselves and their local standby server. If something goes awry
out in the field, the remote servers are going to alert the centralized
monitoring console back at the corporate office. -
The corporate office server’s
purpose is to make sure that everyone out in the field is working correctly,
track uptime statistics, and alert the IT staff if/when something goes wrong
out in the field. I’m thinking about setting up the corporate
monitoring server to just expect mon traps every “x” minutes from
each of the servers, and if it doesn’t get a timely response, alert on
that as well. What I’m trying to get running back at corporate is: -
A good looking display. The
mon.cgi web page is perfect for each of the branch servers (we’re going
to be running it out on each of the remote servers), but I don’t think
its going to scale well back at corporate trying to show a picture of all the
checks it is expecting from 2000+ different servers. -
A way to perform checks upon demand,
but not try to be a full-fledged polling system. Basically, expect
reports in from the servers and allow an ad-hoc request. -
Good roll up views (i.e., geography
based or company division based). -
Statistics available for each
branch, the environment as a whole, and hopefully a division/geography based
breakout as well. Does anyone have anything like Nagios tied into mon servers for
that kind of centralized console? Or is there perhaps another way of
doing this? Suggestions are welcome. Thanks, Tim |
_______________________________________________ mon mailing list mon@linux.kernel.org http://linux.kernel.org/mailman/listinfo/mon