Background: I'm looking to revamp our ColdFusion monitoring processes to determine that our shared hosting applications are "up". Right now, "up" is determined by doing a 'select tname from tab' for every datasource that exists... every 5 minutes. I don't know the reasoning behind it, it just exists. Every CF app that we host has at least one datasource, so in theory it's a valid test. Except... if our database happens to go down (and it does... there's no failover), every production CF server will eventually crash. And it usually crashes due to excessive amounts of threads stuck trying to open connections to the database. Servers that aren't being monitored in this manor (e.g. staging and test boxes) are fine.
So, it's time to refactor the way we monitor our applications. The monitoring team is moving to Nagios within the next 6 months, but we (the CF team) would like to do something sooner. What do others that host shared applications use to determine that those applications are up? One final note... we have a request in for FusionReactor, and we'll be receiving it sometime in the next month. However, the external monitor will still need something to test against... hence, we'll still need to determine a new way to define an application as being "up". Matthew Williams ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| Create robust enterprise, web RIAs. Upgrade & integrate Adobe Coldfusion MX7 with Flex 2 http://ad.doubleclick.net/clk;56760587;14748456;a?http://www.adobe.com/products/coldfusion/flex2/?sdid=LVNU Archive: http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:264043 Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4

