Has anyone got something like this going... 

 

-          We’re going to have an environment where we’ve got lots of servers out in the field (i.e., at over a thousand locations, two servers per locations) that are running mon to watch both themselves and their local standby server.  If something goes awry out in the field, the remote servers are going to alert the centralized monitoring console back at the corporate office. 

-          The corporate office server’s purpose is to make sure that everyone out in the field is working correctly, track uptime statistics, and alert the IT staff if/when something goes wrong out in the field.  I’m thinking about setting up the corporate monitoring server to just expect mon traps every “x” minutes from each of the servers, and if it doesn’t get a timely response, alert on that as well.

 

What I’m trying to get running back at corporate is:

 

-          A good looking display.  The mon.cgi web page is perfect for each of the branch servers (we’re going to be running it out on each of the remote servers), but I don’t think its going to scale well back at corporate trying to show a picture of all the checks it is expecting from 2000+ different servers.

-          A way to perform checks upon demand, but not try to be a full-fledged polling system.  Basically, expect reports in from the servers and allow an ad-hoc request.

-          Good roll up views (i.e., geography based or company division based).

-          Statistics available for each branch, the environment as a whole, and hopefully a division/geography based breakout as well.

 

Does anyone have anything like Nagios tied into mon servers for that kind of centralized console?  Or is there perhaps another way of doing this?

 

Suggestions are welcome.

 

Thanks,

Tim

 

_______________________________________________
mon mailing list
mon@linux.kernel.org
http://linux.kernel.org/mailman/listinfo/mon

Reply via email to