Re: [Nagios-users] Nagios Warning Bug/Misconfiguration

2009-04-08 Thread Thomas Donnelly
Got the issue. Turns out someone installed the Cisco MIBS along with the 
net snmp mibs. Thank you very much for your help.

For anyone searching and finds this check

/usr/local/share/snmp/mibs

and see if you have some-snmp-mib.txt and some-snmp-mib.my


Thanks Again!
-=Tom Donnelly


Patrick Morris wrote:
 On Tue, 07 Apr 2009, Thomas Donnelly wrote:

   
 Thanks for the quick reply!

 Ran from the command line got:

 # ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95: -c 75:
 SNMP WARNING - 100 | SNMPv2-SMI::mib-2.33.1.2.4.0=100

 # ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95 -c 75:
 SNMP WARNING - *100* | SNMPv2-SMI::mib-2.33.1.2.4.0=100

 So by intentionally triggering it again (remove :), it shows the *'s
 

 How about if you add a -v to get verbose output?

 Also, you may want to check the return code from the manual run on an OK
 resultx (for example, by running echo $? aafter your check_snmp
 command to make sure it matches what you see in the output).

 What happens in my case occasionally is that I install a screwed-up MIB
 for an unrelated service. It won't show any obvious errors, but it will
 cause check_snmp to return a warning result code regardless of whether
 the SNMP result falls within my thresholds.

 In effect, it's warning me that my MIBs are hosed, based on
 the fact that it got a non-OK result from snmpget (which is what
 check_snmp calls to do the actual SNMP getting).



   
 Not really sure what they mean by:

 1. Prevent check_snmp from loading the MIBs (default behaviour) by using 
 numeric oids AND using the -m : option
 

 If you a numeric OID rather than mib-2.33.1.2.4.0 and pass the -m :
 then check_snmp (and, by extension, snmpget) don't need to load the MIBs
 at all, so you don't get an error if you've got a bad MIB.

   


 Patrick Morris wrote:
 
 What happens when you run it manually? 

 This, maybe?

 http://www.nagios.org/faqs/viewfaq.php?faq_id=208

 On Tue, 07 Apr 2009, Thomas Donnelly wrote:

   
   
 Hi all,

 I am having an issue with all of the devices I added showing warning all 
 the time. It is a simple snmp check to see if the amps are above 
 160warn/180critical. They always say warning even though they are less 
 than the specified 160. One thing to note is once it hits the 160 mark 
 it gets the * value * in turn, showing that it is actually in the 
 warning range. I have shown the neccesary data I hope below. Any/all 
 help is greatly appreciated.



 # uname -a
 FreeBSD server.example.net 5.5-RELEASE-p2 FreeBSD 5.5-RELEASE-p2 #3: Tue 
 Oct  9 22:39:13 EST 2007 
 r...@server.example.net:/usr/obj/usr/src/sys/MONITOR  i386

 Nagios
 Version 2.0b3


 # ./check_snmp -V
 check_snmp (nagios-plugins 1.4.3) 1.58



  From the webui

 APC-RR-R3-1.hou
  check_rr_amp
  WARNING  04-07-2009 13:00:17   8d 3h 17m 2s  10/10  SNMP WARNING - 90

  APC-RR-R3-2.hou
  check_rr_amp
  WARNING  04-07-2009 12:57:52  18d 2h 46m 48s  10/10  SNMP WARNING - *160*

 ^note the * 160 * for the one that actually is in the warning range.


 from checkcommands.cfg

 define command {
command_name check_rr_amp
command_line$USER1$/check_snmp -H $HOSTADDRESS$ -o 
 mib-2.33.1.4.4.1.3.1 -C cPanel -w $ARG1$ -c $AR
 }


  From the hosts config file.

 define service{
host_name   APC-RR-R1-1.hou
service_description check_rr_amp
check_command   check_rr_amp!159!179
max_check_attempts  10
normal_check_interval   5
retry_check_interval3
check_period24x7
notification_interval   30
notification_period 24x7
notification_optionsw,c,r
contact_groups  backup-admins
 }



 Thanks!
 -=Tom


 --
 This SF.net email is sponsored by:
 High Quality Requirements in a Collaborative Environment.
 Download a free trial of Rational Requirements Composer Now!
 http://p.sf.net/sfu/www-ibm-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when 
 reporting any issue. 
 ::: Messages without supporting info will risk being sent to /dev/null
 
 


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS

[Nagios-users] Nagios Warning Bug/Misconfiguration

2009-04-07 Thread Thomas Donnelly
Hi all,

I am having an issue with all of the devices I added showing warning all 
the time. It is a simple snmp check to see if the amps are above 
160warn/180critical. They always say warning even though they are less 
than the specified 160. One thing to note is once it hits the 160 mark 
it gets the * value * in turn, showing that it is actually in the 
warning range. I have shown the neccesary data I hope below. Any/all 
help is greatly appreciated.



# uname -a
FreeBSD server.example.net 5.5-RELEASE-p2 FreeBSD 5.5-RELEASE-p2 #3: Tue 
Oct  9 22:39:13 EST 2007 
r...@server.example.net:/usr/obj/usr/src/sys/MONITOR  i386

Nagios
Version 2.0b3


# ./check_snmp -V
check_snmp (nagios-plugins 1.4.3) 1.58



 From the webui

APC-RR-R3-1.hou
 check_rr_amp
 WARNING  04-07-2009 13:00:17   8d 3h 17m 2s  10/10  SNMP WARNING - 90

 APC-RR-R3-2.hou
 check_rr_amp
 WARNING  04-07-2009 12:57:52  18d 2h 46m 48s  10/10  SNMP WARNING - *160*

^note the * 160 * for the one that actually is in the warning range.


from checkcommands.cfg

define command {
   command_name check_rr_amp
   command_line$USER1$/check_snmp -H $HOSTADDRESS$ -o 
mib-2.33.1.4.4.1.3.1 -C cPanel -w $ARG1$ -c $AR
}


 From the hosts config file.

define service{
   host_name   APC-RR-R1-1.hou
   service_description check_rr_amp
   check_command   check_rr_amp!159!179
   max_check_attempts  10
   normal_check_interval   5
   retry_check_interval3
   check_period24x7
   notification_interval   30
   notification_period 24x7
   notification_optionsw,c,r
   contact_groups  backup-admins
}



Thanks!
-=Tom


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios Warning Bug/Misconfiguration

2009-04-07 Thread Thomas Donnelly
Thanks for the quick reply!

Ran from the command line got:

# ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95: -c 75:
SNMP WARNING - 100 | SNMPv2-SMI::mib-2.33.1.2.4.0=100

# ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95 -c 75:
SNMP WARNING - *100* | SNMPv2-SMI::mib-2.33.1.2.4.0=100

So by intentionally triggering it again (remove :), it shows the *'s



Not really sure what they mean by:

1. Prevent check_snmp from loading the MIBs (default behaviour) by using 
numeric oids AND using the -m : option




Patrick Morris wrote:
 What happens when you run it manually? 

 This, maybe?

 http://www.nagios.org/faqs/viewfaq.php?faq_id=208

 On Tue, 07 Apr 2009, Thomas Donnelly wrote:

   
 Hi all,

 I am having an issue with all of the devices I added showing warning all 
 the time. It is a simple snmp check to see if the amps are above 
 160warn/180critical. They always say warning even though they are less 
 than the specified 160. One thing to note is once it hits the 160 mark 
 it gets the * value * in turn, showing that it is actually in the 
 warning range. I have shown the neccesary data I hope below. Any/all 
 help is greatly appreciated.



 # uname -a
 FreeBSD server.example.net 5.5-RELEASE-p2 FreeBSD 5.5-RELEASE-p2 #3: Tue 
 Oct  9 22:39:13 EST 2007 
 r...@server.example.net:/usr/obj/usr/src/sys/MONITOR  i386

 Nagios
 Version 2.0b3


 # ./check_snmp -V
 check_snmp (nagios-plugins 1.4.3) 1.58



  From the webui

 APC-RR-R3-1.hou
  check_rr_amp
  WARNING  04-07-2009 13:00:17   8d 3h 17m 2s  10/10  SNMP WARNING - 90

  APC-RR-R3-2.hou
  check_rr_amp
  WARNING  04-07-2009 12:57:52  18d 2h 46m 48s  10/10  SNMP WARNING - *160*

 ^note the * 160 * for the one that actually is in the warning range.


 from checkcommands.cfg

 define command {
command_name check_rr_amp
command_line$USER1$/check_snmp -H $HOSTADDRESS$ -o 
 mib-2.33.1.4.4.1.3.1 -C cPanel -w $ARG1$ -c $AR
 }


  From the hosts config file.

 define service{
host_name   APC-RR-R1-1.hou
service_description check_rr_amp
check_command   check_rr_amp!159!179
max_check_attempts  10
normal_check_interval   5
retry_check_interval3
check_period24x7
notification_interval   30
notification_period 24x7
notification_optionsw,c,r
contact_groups  backup-admins
 }



 Thanks!
 -=Tom


 --
 This SF.net email is sponsored by:
 High Quality Requirements in a Collaborative Environment.
 Download a free trial of Rational Requirements Composer Now!
 http://p.sf.net/sfu/www-ibm-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue. 
 ::: Messages without supporting info will risk being sent to /dev/null
 


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Hosts randomly disappearing

2008-12-08 Thread Thomas Donnelly
Hello All,

I have about 115 hosts in my nagios pool and I recently added a group of 
7 UPSs. I made my own check using the check_snmp to check the load 
percentage. Funny thing is, some times they show up in the host detail 
and some times they do not. Refreshing a few times will get them to 
show, then another refresh usually causes them to disappear. Opening one 
of the hosts usually results in the message: Error: Host Status 
Information Not Found! a few refreshes gets the host state to become 
viewable. I find this behavior only with the 7 hosts i just added. The 
config file is:

#cat ups.cfg

#UPS1
define host{
host_name   ups-r1.hou.example.com
alias   UPS Rack 1
address 192.168.96.8
check_command   check-host-alive
max_check_attempts  10
process_perf_data   1
retain_status_information   1
retain_nonstatus_information0
contact_groups  web-admins
notification_optionsd,r
notification_interval   30
notification_period 24x7
parents co1.hou.example.com
}
define hostextinfo{
host_name   ups-r1.hou.example.com
icon_image  www.jpg
statusmap_image www.gd2
}


define service{
host_name   ups-r1.hou.example.com
service_description check_ping
check_command   check_ping!140,25%!180,50%
max_check_attempts  10
normal_check_interval   5
retry_check_interval3
check_period24x7
notification_interval   30
notification_period 24x7
notification_optionsw,c,r
contact_groups  web-admins
}

define service{
host_name   ups-r1.hou.example.com
service_description check_ups_load
check_command   check_ups_load!84!89
max_check_attempts  10
normal_check_interval   5
retry_check_interval3
check_period24x7
notification_interval   30
notification_period 24x7
notification_optionsw,c,r
contact_groups  web-admins
}

#UPS2
define host{
host_name   ups-r2.hou.example.com
alias   UPS Rack 2
address 192.168.96.9
check_command   check-host-alive
max_check_attempts  10
process_perf_data   1
retain_status_information   1
retain_nonstatus_information0
contact_groups  web-admins
notification_optionsd,r
notification_interval   30
notification_period 24x7
parents co1.hou.example.com
}
define hostextinfo{
host_name   ups-r2.hou.example.com
icon_image  www.jpg
statusmap_image www.gd2
}


define service{
host_name   ups-r2.hou.example.com
service_description check_ping
check_command   check_ping!140,25%!180,50%
max_check_attempts  10
normal_check_interval   5
retry_check_interval3
check_period24x7
notification_interval   30
notification_period 24x7
notification_optionsw,c,r
contact_groups  web-admins
}

define service{
host_name   ups-r2.hou.example.com
service_description check_ups_load
check_command   check_ups_load!84!89
max_check_attempts  10
normal_check_interval   5
retry_check_interval3
check_period24x7
notification_interval   30
notification_period 24x7
notification_optionsw,c,r
contact_groups  web-admins
}

#UPS3
define host{
host_name   ups-r3.hou.example.com
alias   UPS Rack 3
address 192.168.96.10
check_command   check-host-alive
max_check_attempts  10
process_perf_data   1
retain_status_information   1
retain_nonstatus_information0
contact_groups  web-admins