Looks like the restarter script is coded to restart the service on the 3rd SOFT non-OK status.
The log shows Soft1, Soft2 then Hard3. Looks like you have to restart service in the script on the 2nd Soft status to avoid the HARD state or change the retry count to allow for 3 SOFT status before the HARD. That being said, the service should have been restarted following the HARD, CRITICAL call. -----Original Message----- From: Alex Huth [mailto:[email protected]] Sent: Tuesday, September 08, 2009 9:25 AM To: [email protected] Subject: [Nagios-users] Remote restart by ssh Hi! I have a service on a remote machine which dies frequently. I want to restart him by ssh. I followed the steps in the guide, but when i stop him nothing happens. That's what i have done so far: commands.cfg define command{ command_name restart_saprouter command_line $USER1$/check_by_ssh -H $HOSTADDRESS$ -t 175 -l admin -i /adm/etc/nagios/id_nagios -C 'usr/local/libexec/eventhandlers/restart_saprouter.sh $SERVICESTATE$ $SERVICESTATETYPE$ $SERVICEATTEMPT$' } services.cfg define service{ name xxxx.base.saprouter-running use xxxx.base-service hostgroup_name xxxx.base.saprouters service_description SAP router process present check_command ssh_check_procs!1:1!1:1!saprouter event_handler restart_saprouter } define servicedependency{ hostgroup_name xxxx.base.saprouters dependent_hostgroup_name xxxx.base.saprouters service_description SSH dependent_service_description SAP router process present execution_failure_criteria n notification_failure_criteria w,u,c } the eventhandler script on the remote machine: #!/bin/sh case "$1" in OK) ;; WARNING) ;; UNKNOWN) ;; CRITICAL) case "$2" in SOFT) case "$3" in 3) echo -n "Restarting SAP-Router service (3rd soft critical state)..." /etc/init.d/saprouter.sh restart ;; esac ;; HARD) echo -n "Restarting SAP-Router service..." /etc/init.d/saprouter.sh restart ;; esac ;; esac exit 0 log of nagios: [1252415755] SERVICE ALERT: xxxxxxxx;SAP router process present;CRITICAL;SOFT;1;PROCS CRITICAL: 0 processes with command name 'saprouter' [1252415755] SERVICE EVENT HANDLER: xxxxxxxxxxxxx;SAP router process present;(null);(null);(null);restart_saprouter [1252415815] SERVICE ALERT: xxxxxxxxxxxx;SAP router process present;CRITICAL;SOFT;2;PROCS CRITICAL: 0 processes with command name 'saprouter' [1252415815] SERVICE EVENT HANDLER: xxxxxxxxxxx;SAP router process present;(null);(null);(null);restart_saprouter [1252415875] SERVICE ALERT: xxxxxxxxxx;SAP router process present;CRITICAL;HARD;3;PROCS CRITICAL: 0 processes with command name 'saprouter' [1252415875] SERVICE EVENT HANDLER: xxxxxxxxxxxxx;SAP router process present;(null);(null);(null);restart_saprouter Can someone give me light where my fault is? Thx Alex ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Nagios-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Nagios-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
