Looks like the restarter script is coded to restart the service on the 3rd SOFT 
non-OK status.

The log shows Soft1, Soft2 then Hard3.  Looks like you have to restart service 
in the script on the 2nd Soft status to avoid the HARD state or change the 
retry count to allow for 3 SOFT status before the HARD.

That being said, the service should have been restarted following the HARD, 
CRITICAL call.

-----Original Message-----
From: Alex Huth [mailto:[email protected]] 
Sent: Tuesday, September 08, 2009 9:25 AM
To: [email protected]
Subject: [Nagios-users] Remote restart by ssh

Hi!

I have a service on a remote machine which dies frequently. I want to restart
him by ssh. I followed the steps in the guide, but when i stop him nothing
happens. 
That's what i have done so far:

commands.cfg

define command{
        command_name    restart_saprouter
command_line $USER1$/check_by_ssh -H $HOSTADDRESS$ -t 175 -l admin -i
/adm/etc/nagios/id_nagios -C
'usr/local/libexec/eventhandlers/restart_saprouter.sh $SERVICESTATE$
$SERVICESTATETYPE$ $SERVICEATTEMPT$'    
        }


services.cfg

define service{
        name                            xxxx.base.saprouter-running
        use                             xxxx.base-service
        hostgroup_name                  xxxx.base.saprouters
        service_description             SAP router process present
        check_command                   ssh_check_procs!1:1!1:1!saprouter
        event_handler                   restart_saprouter
        }
define servicedependency{
        hostgroup_name                  xxxx.base.saprouters
        dependent_hostgroup_name        xxxx.base.saprouters
        service_description             SSH
        dependent_service_description   SAP router process present
        execution_failure_criteria      n
        notification_failure_criteria   w,u,c
        }

the eventhandler script on the remote machine:

#!/bin/sh

case "$1" in
OK)
        ;;
WARNING)
        ;;
UNKNOWN)
        ;;
CRITICAL)
        case "$2" in

        SOFT)

                case "$3" in

                3)
                        echo -n "Restarting SAP-Router service (3rd soft
critical state)..."
                        /etc/init.d/saprouter.sh restart
                        ;;
                        esac
                ;;

        HARD)
                echo -n "Restarting SAP-Router service..."
                /etc/init.d/saprouter.sh restart
                ;;
        esac
        ;;
esac
exit 0


log of nagios:

[1252415755] SERVICE ALERT: xxxxxxxx;SAP router process
present;CRITICAL;SOFT;1;PROCS CRITICAL: 0 processes with command name
'saprouter'
[1252415755] SERVICE EVENT HANDLER: xxxxxxxxxxxxx;SAP router process
present;(null);(null);(null);restart_saprouter
[1252415815] SERVICE ALERT: xxxxxxxxxxxx;SAP router process
present;CRITICAL;SOFT;2;PROCS CRITICAL: 0 processes with command name
'saprouter'
[1252415815] SERVICE EVENT HANDLER: xxxxxxxxxxx;SAP router process
present;(null);(null);(null);restart_saprouter
[1252415875] SERVICE ALERT: xxxxxxxxxx;SAP router process
present;CRITICAL;HARD;3;PROCS CRITICAL: 0 processes with command name
'saprouter'
[1252415875] SERVICE EVENT HANDLER: xxxxxxxxxxxxx;SAP router process
present;(null);(null);(null);restart_saprouter

Can someone give me light where my fault is?

Thx

Alex

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Nagios-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Nagios-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Reply via email to