Our configuration on the master server (running nagios 2.11) includes the NDUUtils module which writes Nagios data into a set of MySQL tables. The MySQL server is in a separate rack from the Nagios master server. Late yesterday evening (Sunday) there was a network switch problem which meant (among other things that you do not need to know about) that the Nagios process lost contact with the MySQL server. From that point on there were no notifications nor event-handlers run. My assumption is that the loss of contact to the MySQL server caused the single-threaded part of the Nagios process to stall until contact was restored; as a result notifications and event-handlers did not run as they are also in the single-threaded part of the code. Is my assumption correct? If not, can anyone suggest an alternative explanation? As far as I can tell the Nagios process continued to run as the log continued to record events - however log switching (at midnight) did not happen (also in the single-threaded part of the code).
Jonathan Wheeler e-Science Centre Rutherford Appleton Laboratory -- Scanned by iCritical. ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ Nagios-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
