Re: [Nagios-users] serious performance issue
i met almost the same issue. after tweaking enable_embedded_perl=0, the load average was brought up but latencies became lower. On Wed, Apr 8, 2009 at 11:54 AM, shadih rahman shadhi...@gmail.com wrote: I am seeing a ton of orphaned error message for both services and hosts. I am running nagios on a quad core 2.2 GHZ machine running 4 GHZ memory. I will paste my configuration file below. I have the machine sending ndo to a local database sitting on a 170 GB Hard drive. nagios is obcessing on both host and services and sending data to a machine with identical configuration. I am doing failover using NSCA. Please advise on this. nagios.cfg log_file=/var/log/nagios/nagios.log cfg_file=/etc/nagios/commands.cfg cfg_file=/etc/nagios/contacts.cfg cfg_file=/etc/nagios/timeperiods.cfg cfg_file=/etc/nagios/templates.cfg cfg_dir=/etc/nagios/hosts cfg_dir=/etc/nagios/services object_cache_file=/var/log/nagios/objects.cache precached_object_file=/var/log/nagios/objects.precache resource_file=/etc/nagios/resource.cfg status_file=/var/log/nagios/status.dat status_update_interval=60 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/var/log/nagios/rw/nagios.cmd external_command_buffer_slots=8192 lock_file=/var/log/nagios/nagios.lock temp_file=/var/log/nagios/nagios.tmp temp_path=/tmp event_broker_options=8 broker_module=/usr/lib64/nagios/ndomod.o config_file=/etc/nagios/ndomod.cfg log_rotation_method=m log_archive_path=/var/log/nagios/archives use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=1 service_inter_check_delay_method=n max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=2 max_check_result_reaper_time=10 check_result_path=/var/log/nagios/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=1 auto_reschedule_checks=1 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=30 host_check_timeout=20 event_handler_timeout=30 notification_timeout=60 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=var/log/nagios/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=1 enable_event_handlers=1 process_performance_data=0 obsess_over_services=1 ocsp_command=send_service_check ochp_command=send_host_check obsess_over_hosts=1 translate_passive_host_checks=0 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=0 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=us enable_embedded_perl=1 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^*|'?,()= illegal_macro_output_chars=`~$|' use_regexp_matching=0 use_true_regexp_matching=0 admin_email=sr2...@columbia.edu daemon_dumps_core=0 use_large_installation_tweaks=1 enable_environment_macros=1 debug_level=-1debug_verbosity=2 debug_file=/var/log/nagios/nagios.debug max_debug_file_size=100 my nagiostats output [sr2690nagiostats Nagios Stats 3.0.6 Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org) Last Modified: 12-01-2008 License: GPL CURRENT STATUS DATA -- Status File:/var/log/nagios/status.dat Status File Age:0d 0h 0m 19s Status File Version:3.0.6 Program Running Time: 0d 2h 5m 28s Nagios PID: 12139 Used/High/Total Command Buffers:0 / 0 / 8192 Total Services: 2783 Services Checked: 2783 Services Scheduled: 2782 Services Actively Checked: 2783 Services Passively Checked: 0 Total Service State Change: 0.000 / 52.830 / 0.263 % Active Service Latency: 1.304 /
Re: [Nagios-users] Interesting problem while trying to monitor Oracle RAC services [Solved]
check the environment of the users launching the script. Which user do you use to launch the script locally? And which one from remote? On nagios server I have tried executing it as root user as well as nagios user but the problem remains. Hello all, I made a mistake in the plug-in. Both Perl script and KSH script were residing in the same directory. When executing locally it knew where to look for external shell script. The code was as follows: my $PIPED = qx# ksh check_oracle_services.sh $SERVICE #; But when executed from nagios server, NRPE daemon on monitored host wouldn't know where to look for the shell script hence the wrong output. Adding absolute path to the check_oracle_services.sh fixed the problem my $PIPED = qx# ksh /home/nagios/nrpe/libexec/check_oracle_services.sh $SERVICE #; The new code is as follows (may be someone would find it useful): check_oracle_services.pl #!/usr/bin/env perl use strict; use Getopt::Std; my %return_value = ( OK = 0, CRIT = 2, UNKNOWN = 3 ); my $message = nagios; my $exit_status; my %opt=(); getopts(p:h, \%opt); sub usage(){ print Usage: $0 -p service_name\n; exit $return_value{'UNKNOWN'}; } usage() if defined $opt{'h'}; my $SERVICE = $opt{'p'} if defined $opt{'p'} || usage(); my $PIPED = qx# ksh /home/nagios/nrpe/libexec/check_oracle_services.sh $SERVICE #; if ($PIPED =~ /OFFLINE/g) { $exit_status = $return_value{'CRIT'}; $message = Critical: $SERVICE is not running.; } else { $exit_status = $return_value{'OK'}; $message = OK: $SERVICE is running.; } print $message\n; exit $exit_status; check_oracle_services.sh #!/usr/bin/ksh RSC_KEY=$1 /oracle/crs_home/bin/crs_stat -u | awk \ 'BEGIN { FS==; state = 0; } \ $1~/NAME/ $2~/'$RSC_KEY'/ {appname = $2; state=1}; \ state == 0 {next;} \ $1~/TARGET/ state == 1 {apptarget = $2; state=2;} \ $1~/STATE/ state == 2 {appstate = $2; state=3;} \ state == 3 {printf %-45s %-18s\n, appname, appstate; state=0;}' Sorry for the inconvenience caused. Thanks -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitor primergy servers with esx
hi, I have done some tests. One needs the RAID.mib file and this file is only in the windows package for serverviewraid (tsk, tsk). Anyway, once we have this file we can test stuff. Important OIDS: .1.3.6.1.4.1.231.2.49.1.5.2.1 - .iso.org.dod.internet.private.enterprises.sni.sniProductMibs.fscRAIDMIB.svrObjects.svrPhysicalDeviceInfo.svrPhysicalDeviceTable.svrPhysicalDeviceEntry .1.3.6.1.4.1.231.2.49.1.3 - .iso.org.dod.internet.private.enterprises.sni.sniProductMibs.fscRAIDMIB.svrObjects.svrStatus Situation 1: disks are online and working fine: $ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.3 FSC-RAID-MIB::svrStatusLogicalDrives.0 = INTEGER: ok(1) FSC-RAID-MIB::svrStatusPhysicalDevices.0 = INTEGER: ok(1) FSC-RAID-MIB::svrStatusControllers.0 = INTEGER: ok(1) FSC-RAID-MIB::svrStatusOverall.0 = INTEGER: ok(1) [j...@pc2668-210307 mibs]$ snmpwalk izvm01 -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.5.2.1 FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.1.0.0 = INTEGER: 1 FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.2.3.0 = INTEGER: 1 FSC-RAID-MIB::svrPhysicalDeviceChannel.1.1.0.0 = INTEGER: 1 FSC-RAID-MIB::svrPhysicalDeviceChannel.1.2.3.0 = INTEGER: 2 FSC-RAID-MIB::svrPhysicalDeviceTarget.1.1.0.0 = INTEGER: 0 FSC-RAID-MIB::svrPhysicalDeviceTarget.1.2.3.0 = INTEGER: 3 FSC-RAID-MIB::svrPhysicalDeviceLUN.1.1.0.0 = INTEGER: 0 FSC-RAID-MIB::svrPhysicalDeviceLUN.1.2.3.0 = INTEGER: 0 FSC-RAID-MIB::svrPhysicalDeviceModelName.1.1.0.0 = STRING: ST373455SS FSC-RAID-MIB::svrPhysicalDeviceModelName.1.2.3.0 = STRING: ST373455SS FSC-RAID-MIB::svrPhysicalDeviceVendorName.1.1.0.0 = STRING: SEAGATE FSC-RAID-MIB::svrPhysicalDeviceVendorName.1.2.3.0 = STRING: SEAGATE FSC-RAID-MIB::svrPhysicalDeviceCapacity.1.1.0.0 = INTEGER: 68 FSC-RAID-MIB::svrPhysicalDeviceCapacity.1.2.3.0 = INTEGER: 68 FSC-RAID-MIB::svrPhysicalDeviceMaxTransferRate.1.1.0.0 = INTEGER: 300 FSC-RAID-MIB::svrPhysicalDeviceMaxTransferRate.1.2.3.0 = INTEGER: 300 FSC-RAID-MIB::svrPhysicalDeviceType.1.1.0.0 = INTEGER: disk(2) FSC-RAID-MIB::svrPhysicalDeviceType.1.2.3.0 = INTEGER: disk(2) FSC-RAID-MIB::svrPhysicalDeviceConfiguredDisk.1.1.0.0 = INTEGER: true(2) FSC-RAID-MIB::svrPhysicalDeviceConfiguredDisk.1.2.3.0 = INTEGER: true(2) FSC-RAID-MIB::svrPhysicalDeviceInterface.1.1.0.0 = INTEGER: sas(6) FSC-RAID-MIB::svrPhysicalDeviceInterface.1.2.3.0 = INTEGER: sas(6) FSC-RAID-MIB::svrPhysicalDeviceErrors.1.1.0.0 = Counter32: 0 FSC-RAID-MIB::svrPhysicalDeviceErrors.1.2.3.0 = Counter32: 0 FSC-RAID-MIB::svrPhysicalDeviceNrBadBlocks.1.1.0.0 = Counter32: 0 FSC-RAID-MIB::svrPhysicalDeviceNrBadBlocks.1.2.3.0 = Counter32: 0 FSC-RAID-MIB::svrPhysicalDeviceSmartStatus.1.1.0.0 = INTEGER: ok(1) FSC-RAID-MIB::svrPhysicalDeviceSmartStatus.1.2.3.0 = INTEGER: ok(1) FSC-RAID-MIB::svrPhysicalDeviceStatus.1.1.0.0 = INTEGER: online(3) FSC-RAID-MIB::svrPhysicalDeviceStatus.1.2.3.0 = INTEGER: online(3) FSC-RAID-MIB::svrPhysicalDeviceFirmwareRevision.1.1.0.0 = STRING: 1651 FSC-RAID-MIB::svrPhysicalDeviceFirmwareRevision.1.2.3.0 = STRING: 1651 FSC-RAID-MIB::svrPhysicalDeviceSerialNumber.1.1.0.0 = STRING: 3LQ0DA03 FSC-RAID-MIB::svrPhysicalDeviceSerialNumber.1.2.3.0 = STRING: 3LQ0DAD7 FSC-RAID-MIB::svrPhysicalDeviceForeignConfig.1.1.0.0 = INTEGER: false(1) FSC-RAID-MIB::svrPhysicalDeviceForeignConfig.1.2.3.0 = INTEGER: false(1) FSC-RAID-MIB::svrPhysicalDeviceIdx.1.1.0.0 = INTEGER: 11 FSC-RAID-MIB::svrPhysicalDeviceIdx.1.2.3.0 = INTEGER: 12 FSC-RAID-MIB::svrPhysicalDeviceEntry.20.1.1.0.0 = INTEGER: 4 FSC-RAID-MIB::svrPhysicalDeviceEntry.20.1.2.3.0 = INTEGER: 4 FSC-RAID-MIB::svrPhysicalDeviceEntry.21.1.1.0.0 = INTEGER: 70007 FSC-RAID-MIB::svrPhysicalDeviceEntry.21.1.2.3.0 = INTEGER: 70007 Disks are online Situation 2: I remove one disk from its bay $ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.3 FSC-RAID-MIB::svrStatusLogicalDrives.0 = INTEGER: prefailure(2) FSC-RAID-MIB::svrStatusPhysicalDevices.0 = INTEGER: failure(3) FSC-RAID-MIB::svrStatusControllers.0 = INTEGER: prefailure(2) FSC-RAID-MIB::svrStatusOverall.0 = INTEGER: prefailure(2) Everything is 'prefailure', except for physicaldevices, it's a 'failure' (disk is physically removed from the bay). I forgot to check the other OID for this one, I'll post the results later. Situation 3: 'failed' disk is back in bay, rebuilding starts: $ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.3 FSC-RAID-MIB::svrStatusLogicalDrives.0 = INTEGER: prefailure(2) FSC-RAID-MIB::svrStatusPhysicalDevices.0 = INTEGER: ok(1) FSC-RAID-MIB::svrStatusControllers.0 = INTEGER: prefailure(2) FSC-RAID-MIB::svrStatusOverall.0 = INTEGER: prefailure(2) everthing is 'prefailure' except for svrStatusPhysicalDevices.0, it is 'ok', disks is in bay. $ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.5.2.1 FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.0.0.0 = INTEGER: 1 FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.3.0.3 = INTEGER: 1 FSC-RAID-MIB::svrPhysicalDeviceChannel.1.0.0.0 =
[Nagios-users] Nagios and Cacti
Hi all, I've been looking into making use of Cacti to act as an SNMP management tool which runs alongside my Nagios instance. Ideally, what I would like to do is have Cacti monitor various SNMP-exposed metrics on my hosts, and then have a service check in Nagios which parses Cacti's results (which I believe are RRD files) and send alerts etc. Nagios itself will still be used for running directly checks for services running, errors in log files etc. Does this approach make sense? One issue that I can think of is the difficulty in keeping the config files of Nagios and Cacti synchronised. I was planning on using Lilac Platform to act as my Nagios config file management tool, but how that is kept in synch with Cacti is a problem. Has anyone ever set up an arrangement like this before? Cheers, Chris -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Name or service not known
can you provide the relevant conf so we may understand more about your problem ? On Wednesday 08 April 2009 14:41:27 Thierry Lavallée wrote: Hi, I am getting the following about 50 times per day: HTTP CRITICAL04-08-2009 09:37:29 0d 0h 1m 50s1/4 Name or service not known SSH UNKNOWN 04-08-2009 09:34:39 0d 0h 4m 40s1/4 Usage:check_ssh [-46] [-t timeout] [-r remote version] [-p port] host I am somewhat at lost here... Can anyone help with this? -- Assaf Flatto SSP Ops Team Linux System Administrator 169 Euston Road, London, NW1 2AE IMPORTANT . this email and the information in it may be confidential, legally privileged and/or protected by law. It is intended solely for the use of the person to whom it is addressed. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Please also delete all copies of this email and any attachments from your system. We cannot guarantee the security or confidentiality of email communications. We do not accept any liability for losses or damages that you may suffer as a result of your receipt of this email including but not limited to computer service or system failure, access delays or interruption, data non-delivery or mis-delivery, computer viruses or other harmful components. Copyright in this email and any attachments belong to Select Service Partner UK Limited. Should you communicate with anyone at Select Service Partner UK Limited by email, you consent to us monitoring and reading any such correspondence. Nothing in this email shall be taken or read as suggesting, proposing or relating to any agreement concerted practice or other practice that could infringe UK or EC competition legislation. Select Service Partner UK Limited is a company registered in England and Wales (company number 05687183) whose registered office is at 1 The Heights, Brooklands, Weybridge. Surrey. KT13 0NY -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Name or service not known
or maybe you mean: check_http check-host-alive$USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c 5000.0,100% -p 5 check_dhcp $USER1$/check_dhcp $ARG1$ check_ftp $USER1$/check_ftp -H $HOSTADDRESS$ $ARG1$ check_hpjd $USER1$/check_hpjd -H $HOSTADDRESS$ $ARG1$ check_http $USER1$/check_http -I $HOSTADDRESS$ $ARG1$ check_imap $USER1$/check_imap -H $HOSTADDRESS$ $ARG1$ check_local_disk$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ check_local_load$USER1$/check_load -w $ARG1$ -c $ARG2$ check_local_mrtgtraf$USER1$/check_mrtgtraf -F $ARG1$ -a $ARG2$ -w $ARG3$ -c $ARG4$ -e $ARG5$ check_local_procs $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s $ARG3$ check_local_swap$USER1$/check_swap -w $ARG1$ -c $ARG2$ check_local_users $USER1$/check_users -w $ARG1$ -c $ARG2$ check_nt$USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$ check_ping $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -p 5 check_pop $USER1$/check_pop -H $HOSTADDRESS$ $ARG1$ check_smtp $USER1$/check_smtp -H $HOSTADDRESS$ $ARG1$ check_snmp $USER1$/check_snmp -H $HOSTADDRESS$ $ARG1$ check_ssh $USER1$/check_ssh $ARG1$ $HOSTADDRESS$ check_tcp $USER1$/check_tcp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$ check_udp $USER1$/check_udp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$ notify-host-by-email/usr/bin/printf %b * Nagios *\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n | /bin/mail -s ** $NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is $HOSTSTATE$ ** $CONTACTEMAIL$ notify-service-by-email /usr/bin/printf %b * Nagios *\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$ | /bin/mail -s ** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ ** $CONTACTEMAIL$ process-host-perfdata /usr/bin/printf %b $LASTHOSTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$HOSTSTATETYPE$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$\n /usr/local/nagios/var/host-perfdata.out process-service-perfdata/usr/bin/printf %b $LASTSERVICECHECK$\t$HOSTNAME$\t$SERVICEDESC$\t$SERVICESTATE$\t$SERVICEATTEMPT$\t$SERVICESTATETYPE$\t$SERVICEEXECUTIONTIME$\t$SERVICELATENCY$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\n /usr/local/nagios/var/service-perfdata.out check_SSH check-host-alive$USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c 5000.0,100% -p 5 check_dhcp $USER1$/check_dhcp $ARG1$ check_ftp $USER1$/check_ftp -H $HOSTADDRESS$ $ARG1$ check_hpjd $USER1$/check_hpjd -H $HOSTADDRESS$ $ARG1$ check_http $USER1$/check_http -I $HOSTADDRESS$ $ARG1$ check_imap $USER1$/check_imap -H $HOSTADDRESS$ $ARG1$ check_local_disk$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ check_local_load$USER1$/check_load -w $ARG1$ -c $ARG2$ check_local_mrtgtraf$USER1$/check_mrtgtraf -F $ARG1$ -a $ARG2$ -w $ARG3$ -c $ARG4$ -e $ARG5$ check_local_procs $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s $ARG3$ check_local_swap$USER1$/check_swap -w $ARG1$ -c $ARG2$ check_local_users $USER1$/check_users -w $ARG1$ -c $ARG2$ check_nt$USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$ check_ping $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -p 5 check_pop $USER1$/check_pop -H $HOSTADDRESS$ $ARG1$ check_smtp $USER1$/check_smtp -H $HOSTADDRESS$ $ARG1$ check_snmp $USER1$/check_snmp -H $HOSTADDRESS$ $ARG1$ check_ssh $USER1$/check_ssh $ARG1$ $HOSTADDRESS$ check_tcp $USER1$/check_tcp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$ check_udp $USER1$/check_udp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$ notify-host-by-email/usr/bin/printf %b * Nagios *\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n | /bin/mail -s ** $NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is $HOSTSTATE$ ** $CONTACTEMAIL$ notify-service-by-email /usr/bin/printf %b * Nagios *\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$ | /bin/mail -s ** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ ** $CONTACTEMAIL$ process-host-perfdata /usr/bin/printf %b $LASTHOSTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$HOSTSTATETYPE$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$\n /usr/local/nagios/var/host-perfdata.out process-service-perfdata
Re: [Nagios-users] Name or service not known
Thierry The information you need to look into is the configuration files of the nagios , located in the etc directory of the nagios installation directory . which version of nagios are you running ? did you install from source or packages ? what distro are you using ? how did you preform the initial installation ? Assaf On Wednesday 08 April 2009 15:29:02 Thierry Lavallée wrote: thanks a lot for your reply Assaf, I am not sure which conf you mean but here are a few things in the meantime. Maybe you could point this Nagios newbie what you need? :/ thanks! -- Thierry -- Assaf Flatto SSP Ops Team Linux System Administrator 169 Euston Road, London, NW1 2AE IMPORTANT . this email and the information in it may be confidential, legally privileged and/or protected by law. It is intended solely for the use of the person to whom it is addressed. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Please also delete all copies of this email and any attachments from your system. We cannot guarantee the security or confidentiality of email communications. We do not accept any liability for losses or damages that you may suffer as a result of your receipt of this email including but not limited to computer service or system failure, access delays or interruption, data non-delivery or mis-delivery, computer viruses or other harmful components. Copyright in this email and any attachments belong to Select Service Partner UK Limited. Should you communicate with anyone at Select Service Partner UK Limited by email, you consent to us monitoring and reading any such correspondence. Nothing in this email shall be taken or read as suggesting, proposing or relating to any agreement concerted practice or other practice that could infringe UK or EC competition legislation. Select Service Partner UK Limited is a company registered in England and Wales (company number 05687183) whose registered office is at 1 The Heights, Brooklands, Weybridge. Surrey. KT13 0NY -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Name or service not known
Thierry please send all your replies to the list and not to individual members - so other may learn from the information gathered by other members . second : this is a command declaration in the commands file , have you associated a service to the host ? and even before that - did you install the nagios-plugins ? the nagios-plugins are the actual check scripts executed to preform the checks , and without them you will not be able to use any of nagios capabilities. Assaf check_ssh $USER1$/check_ssh $ARG1$ $HOSTADDRESS$ -- Assaf Flatto SSP Ops Team Linux System Administrator 169 Euston Road, London, NW1 2AE IMPORTANT . this email and the information in it may be confidential, legally privileged and/or protected by law. It is intended solely for the use of the person to whom it is addressed. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Please also delete all copies of this email and any attachments from your system. We cannot guarantee the security or confidentiality of email communications. We do not accept any liability for losses or damages that you may suffer as a result of your receipt of this email including but not limited to computer service or system failure, access delays or interruption, data non-delivery or mis-delivery, computer viruses or other harmful components. Copyright in this email and any attachments belong to Select Service Partner UK Limited. Should you communicate with anyone at Select Service Partner UK Limited by email, you consent to us monitoring and reading any such correspondence. Nothing in this email shall be taken or read as suggesting, proposing or relating to any agreement concerted practice or other practice that could infringe UK or EC competition legislation. Select Service Partner UK Limited is a company registered in England and Wales (company number 05687183) whose registered office is at 1 The Heights, Brooklands, Weybridge. Surrey. KT13 0NY -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Adaptive Monitoring: Broken?
On Apr 7, 2009, at 1:26 PM, Patrick Morris wrote: Here are the important stats: Nagios Version: Version 3.1.0 Proficiency Level: Pretty damned high While the first command works fine, and sets the service to an OK state, the next two (which I've tried in various combinations) show up in the Nagios logs as having been sent, but do nothing. The check that appears in the config files keeps running instead of my check_ok check. Here's how it shows up in the logs: [1239128528] EXTERNAL COMMAND: CHANGE_SVC_EVENT_HANDLER;dummy- host;DNS;check_ok [1239128528] EXTERNAL COMMAND: CHANGE_SVC_CHECK_COMMAND;dummy- host;DNS;check_ok I've noticed the message is different if I use an invalid command, so I'm relatively sure I'm using the right ones; they just don't do anything. Event handlers are enabled for these services, but even if they weren't the check command should change, right? Am I doing something wrong here, or have I run into a bug? I'm not using 3.x yet but just to provide some feedback, what you're doing looks reasonable from my reading of the documentation. I do see this in 3.1.0's commands.c though -- /* SECURITY PATCH - disable these for the time being */ switch(cmd){ case CMD_CHANGE_GLOBAL_HOST_EVENT_HANDLER: case CMD_CHANGE_GLOBAL_SVC_EVENT_HANDLER: case CMD_CHANGE_HOST_EVENT_HANDLER: case CMD_CHANGE_SVC_EVENT_HANDLER: case CMD_CHANGE_HOST_CHECK_COMMAND: case CMD_CHANGE_SVC_CHECK_COMMAND: return ERROR; } That's in the right section and my reading of the code is that it does exactly that; prevent changing of those values... Maybe it's something being worked on in the development branch? -- Marc -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and Cacti
And just an FYI from my own experience... putting Nagios Cacti on the same server has been somewhat problematic for us. We have over 400 network devices between switches, routers, WAPs, etc. We also have about 300 monitored servers. Initially I had Nagios and Cacti both on one server with Cacti running via cron every 5 minutes. About every 5 minutes, my shells would become unresponsive for roughly 30 to 90 seconds. Turning off either Nagios or Cacti resolved the issue. Running both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does both trending and alerts of all servers. Cacti does trending only of all network devices/ports. Once I moved Cacti to its own server, all was fine as far as load/latency went. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marco Tirado wrote: Hello: There are a couple of examples in the nagios exchange page of different approachs for integrating nagios and cacti. You should check that out. I believe the synchronization is going to cost you time and money, a better approach is to use nagios + pnp4naigos (this generates nice graphs) + check_snmp_int.pl (this for bandwidth tests). That way you have only one place to place your configuration. There are tons of other snmp plugins you can use for other tests (CPU, Memory, etc), //Marco On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney crist...@gmail.com mailto:crist...@gmail.com wrote: Hi all, I've been looking into making use of Cacti to act as an SNMP management tool which runs alongside my Nagios instance. Ideally, what I would like to do is have Cacti monitor various SNMP-exposed metrics on my hosts, and then have a service check in Nagios which parses Cacti's results (which I believe are RRD files) and send alerts etc. Nagios itself will still be used for running directly checks for services running, errors in log files etc. Does this approach make sense? One issue that I can think of is the difficulty in keeping the config files of Nagios and Cacti synchronised. I was planning on using Lilac Platform to act as my Nagios config file management tool, but how that is kept in synch with Cacti is a problem. Has anyone ever set up an arrangement like this before? Cheers, Chris -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net mailto:Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and Cacti
2009/4/8 Andrew Davis ncc...@gmail.com: And just an FYI from my own experience... putting Nagios Cacti on the same server has been somewhat problematic for us. We have over 400 network devices between switches, routers, WAPs, etc. We also have about 300 monitored servers. Initially I had Nagios and Cacti both on one server with Cacti running via cron every 5 minutes. About every 5 minutes, my shells would become unresponsive for roughly 30 to 90 seconds. Turning off either Nagios or Cacti resolved the issue. Running both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does both trending and alerts of all servers. Cacti does trending only of all network devices/ports. Once I moved Cacti to its own server, all was fine as far as load/latency went. That's useful to know Andrew, thanks. Regarding the trending of network devices - is there any reason why this can't be done by Nagios? I intend to install PNP4Nagios to take care of graphing anyway, but I think it would be nice to have all my monitored resources under the one system (for notifications and ease of administration). Is there some major advantage that Cacti provides when it comes to SNMP monitoring of network devices that cannot be achieved with Nagios and the various SNMP plug-ins available for it (e.g. like these ones http://nagios.manubulon.com) ? Cheers, Chris -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Name or service not known
thanks Assaf, But I cannot get passed step #1 Relax - it's going to take some time. ;) hehe. Really, I don't want to redo the whole install as I am not THAT server techy and I try to rely on a service called Supreme Support that are supposed to make my life better. I need to point to Supreme Support where the problem is. Do you think you can point it out quickly from the config files I sent? Because most seems installed correctly, it looks there are loose handles. thanks again! -- Thierry 2009/4/8 Assaf Flatto assaf.fla...@ssp-intl.com: In that case I suggest you start here : http://nagios.sourceforge.net/docs/3_0/beginners.html read the documentation and most of your questions will be answered. Assaf On Wednesday 08 April 2009 16:00:35 you wrote: thanks Assaf. I did not do the installation, but the support who did the installation (Supreme Support) do not seem knowledgeable enough. I am attaching my config files (please tell me if not secure to send like this) I am runnig 3.0.6 No idea about Distro hoping you can still help me. thanks! -- Assaf Flatto SSP Ops Team Linux System Administrator 169 Euston Road, London, NW1 2AE IMPORTANT . this email and the information in it may be confidential, legally privileged and/or protected by law. It is intended solely for the use of the person to whom it is addressed. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Please also delete all copies of this email and any attachments from your system. We cannot guarantee the security or confidentiality of email communications. We do not accept any liability for losses or damages that you may suffer as a result of your receipt of this email including but not limited to computer service or system failure, access delays or interruption, data non-delivery or mis-delivery, computer viruses or other harmful components. Copyright in this email and any attachments belong to Select Service Partner UK Limited. Should you communicate with anyone at Select Service Partner UK Limited by email, you consent to us monitoring and reading any such correspondence. Nothing in this email shall be taken or read as suggesting, proposing or relating to any agreement concerted practice or other practice that could infringe UK or EC competition legislation. Select Service Partner UK Limited is a company registered in England and Wales (company number 05687183) whose registered office is at 1 The Heights, Brooklands, Weybridge. Surrey. KT13 0NY -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] problems with log rotation
hi i used nagios3.1 i have configured a daily log rotation but after several days i suspect there s a problem with the log rotation i found several instance of nagios which the majority are defunct ones and they all begun at 00:00 but on different days. and the scheduler is going mad. a lot of check are not launched and i got a lot of these messages [1239201355] Warning: The check of service 'memoire disponible' on host 'www-tp' looks like it was orphaned (results never came back). I'm scheduling an immediate check of the service... [1239201355] Warning: The check of service 'nombre total de process' on host 'www-tp' looks like it was orphaned (results never came back). I'm scheduling an immediate check of the service... [1239201355] Warning: The check of service 'HTTP' on host 'yum' looks like it was orphaned (results never came back). I'm scheduling an immediate check of the service... [1239201355] Warning: The check of service 'charge' on host 'yum' looks like it was orphaned (results never came back). I'm scheduling an immediate check of the service... [1239201355] Warning: The check of service 'nombre d utilisateur' on host 'yum' looks like it was orphaned (results never came back). I'm scheduling an immediate check of the service... [1239201414] Warning: The check of service 'ciscoswitch' on host 'Indicateurs' looks like it was orphaned (results never came back). I'm scheduling an immediate check of the service... [1239201414] Warning: The check of service 'smtpsout' on host 'Indicateurs_detail' looks like it was orphaned (results never came back). I'm scheduling an immediate check of the service... Does someone knows how to solve that problems? thanks in advance for any help -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Name or service not known
From the first glance it looks like they just installed it and never bothered to configure anything in it . not nagios is a great tool , but it requires quite a bit of initial configuration and setup , which looks like the didn't do . Assaf On Wednesday 08 April 2009 16:20:06 Thierry Lavallée wrote: thanks Assaf, But I cannot get passed step #1 Relax - it's going to take some time. ;) hehe. Really, I don't want to redo the whole install as I am not THAT server techy and I try to rely on a service called Supreme Support that are supposed to make my life better. I need to point to Supreme Support where the problem is. Do you think you can point it out quickly from the config files I sent? Because most seems installed correctly, it looks there are loose handles. thanks again! -- Assaf Flatto SSP Ops Team Linux System Administrator 169 Euston Road, London, NW1 2AE IMPORTANT . this email and the information in it may be confidential, legally privileged and/or protected by law. It is intended solely for the use of the person to whom it is addressed. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Please also delete all copies of this email and any attachments from your system. We cannot guarantee the security or confidentiality of email communications. We do not accept any liability for losses or damages that you may suffer as a result of your receipt of this email including but not limited to computer service or system failure, access delays or interruption, data non-delivery or mis-delivery, computer viruses or other harmful components. Copyright in this email and any attachments belong to Select Service Partner UK Limited. Should you communicate with anyone at Select Service Partner UK Limited by email, you consent to us monitoring and reading any such correspondence. Nothing in this email shall be taken or read as suggesting, proposing or relating to any agreement concerted practice or other practice that could infringe UK or EC competition legislation. Select Service Partner UK Limited is a company registered in England and Wales (company number 05687183) whose registered office is at 1 The Heights, Brooklands, Weybridge. Surrey. KT13 0NY -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and Cacti
It depends on the intensity of your snmp usage. Cacti has a native daemon to do large scale snmp getting, and it does a great job of it. So if u have hundreds of devices, each with a lot of interfaces, u will probably like cacti. The user interface is also well done for graphing snmp data and thresholding on it using the threshold plugin. = Daniel Feinsmith = {sent from iPhone} On Apr 8, 2009, at 8:15 AM, Christopher McAtackney crist...@gmail.com wrote: 2009/4/8 Andrew Davis ncc...@gmail.com: And just an FYI from my own experience... putting Nagios Cacti on the same server has been somewhat problematic for us. We have over 400 network devices between switches, routers, WAPs, etc. We also have about 300 monitored servers. Initially I had Nagios and Cacti both on one server with Cacti running via cron every 5 minutes. About every 5 minutes, my shells would become unresponsive for roughly 30 to 90 seconds. Turning off either Nagios or Cacti resolved the issue. Running both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does both trending and alerts of all servers. Cacti does trending only of all network devices/ports. Once I moved Cacti to its own server, all was fine as far as load/latency went. That's useful to know Andrew, thanks. Regarding the trending of network devices - is there any reason why this can't be done by Nagios? I intend to install PNP4Nagios to take care of graphing anyway, but I think it would be nice to have all my monitored resources under the one system (for notifications and ease of administration). Is there some major advantage that Cacti provides when it comes to SNMP monitoring of network devices that cannot be achieved with Nagios and the various SNMP plug-ins available for it (e.g. like these ones http://nagios.manubulon.com) ? Cheers, Chris --- --- --- - This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and Cacti
If you move your mysql instance to another server, you can get much better performance on a nagios/cacti server. Check top while cacti is running a large install and you will see that mysql is hoarding CPU and memory resources not leaving much for nagios. = Daniel Feinsmith = {sent from iPhone} On Apr 8, 2009, at 8:03 AM, Andrew Davis ncc...@gmail.com wrote: And just an FYI from my own experience... putting Nagios Cacti on the same server has been somewhat problematic for us. We have over 400 network devices between switches, routers, WAPs, etc. We also have about 300 monitored servers. Initially I had Nagios and Cacti both on one server with Cacti running via cron every 5 minutes. About every 5 minutes, my shells would become unresponsive for roughly 30 to 90 seconds. Turning off either Nagios or Cacti resolved the issue. Running both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does both trending and alerts of all servers. Cacti does trending only of all network devices/ports. Once I moved Cacti to its own server, all was fine as far as load/ latency went. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marco Tirado wrote: Hello: There are a couple of examples in the nagios exchange page of different approachs for integrating nagios and cacti. You should check that out. I believe the synchronization is going to cost you time and money, a better approach is to use nagios + pnp4naigos (this generates nice graphs) + check_snmp_int.pl (this for bandwidth tests). That way you have only one place to place your configuration. There are tons of other snmp plugins you can use for other tests (CPU, Memory, etc), //Marco On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney crist...@gmail.com wrote: Hi all, I've been looking into making use of Cacti to act as an SNMP management tool which runs alongside my Nagios instance. Ideally, what I would like to do is have Cacti monitor various SNMP-exposed metrics on my hosts, and then have a service check in Nagios which parses Cacti's results (which I believe are RRD files) and send alerts etc. Nagios itself will still be used for running directly checks for services running, errors in log files etc. Does this approach make sense? One issue that I can think of is the difficulty in keeping the config files of Nagios and Cacti synchronised. I was planning on using Lilac Platform to act as my Nagios config file management tool, but how that is kept in synch with Cacti is a problem. Has anyone ever set up an arrangement like this before? Cheers, Chris --- --- --- - This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/ null --- --- --- - This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/ null --- --- --- - This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now!
Re: [Nagios-users] Nagios and Cacti
On Wed, Apr 8, 2009 at 11:52 AM, Daniel Emmanuel Feinsmith dan...@danielemmanuelfeinsmith.com wrote: It depends on the intensity of your snmp usage. Cacti has a native daemon to do large scale snmp getting, and it does a great job of it. So if u have hundreds of devices, each with a lot of interfaces, u will probably like cacti. The user interface is also well done for graphing snmp data and thresholding on it using the threshold plugin. With parallel checks in Nagios 3 and some configuration tuning and well-written SNMP checks, I'd argue that Nagios is as good if not a better poller than cactid :). our instance is not huge, but currently we do 7000+ SNMP-based checks in 3 minutes on a dual quad-core Linux-based server. Before PNP I used to use Cacti and Nagios. I like Cacti, but with PNP around I would never go back to that combination again .. Nagios + PNP really does simplify life for Nagios administrators and provides a lot of flexibility as far as how you scale your graphing as your node base grows. - Max -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and Cacti
I agree. Initially I had Nagios doing all the trending. But with 400+ network devices and many of them with multiple 48 port blades, I found Cacti was easier to configure... it scaled a lot better. For a smaller network, you could easily do just Nagios. I've had no issues at all with Nagios + PNP for alerts and trending. In fact, Nagios still watches my core network devices (but not all the ports of them... ie: Nagios watches that switch1 is up and available and trends its CPU and memory usage... however I use Cacti for trending the 6 blades each with 48 ports in switch1). This way, if switch1 fails or utilization is too high, Nagios tells me, but if a particular user is hogging all our bandwidth or having lots of packet loss, I find that via Cacti. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Daniel Emmanuel Feinsmith wrote: It depends on the intensity of your snmp usage. Cacti has a native daemon to do large scale snmp getting, and it does a great job of it. So if u have hundreds of devices, each with a lot of interfaces, u will probably like cacti. The user interface is also well done for graphing snmp data and thresholding on it using the threshold plugin. = Daniel Feinsmith = {sent from iPhone} On Apr 8, 2009, at 8:15 AM, Christopher McAtackney crist...@gmail.com wrote: 2009/4/8 Andrew Davis ncc...@gmail.com: And just an FYI from my own experience... putting Nagios Cacti on the same server has been somewhat problematic for us. We have over 400 network devices between switches, routers, WAPs, etc. We also have about 300 monitored servers. Initially I had Nagios and Cacti both on one server with Cacti running via cron every 5 minutes. About every 5 minutes, my shells would become unresponsive for roughly 30 to 90 seconds. Turning off either Nagios or Cacti resolved the issue. Running both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does both trending and alerts of all servers. Cacti does trending only of all network devices/ports. Once I moved Cacti to its own server, all was fine as far as load/latency went. That's useful to know Andrew, thanks. Regarding the trending of network devices - is there any reason why this can't be done by Nagios? I intend to install PNP4Nagios to take care of graphing anyway, but I think it would be nice to have all my monitored resources under the one system (for notifications and ease of administration). Is there some major advantage that Cacti provides when it comes to SNMP monitoring of network devices that cannot be achieved with Nagios and the various SNMP plug-ins available for it (e.g. like these ones http://nagios.manubulon.com) ? Cheers, Chris --- --- --- - This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Warning Bug/Misconfiguration
Got the issue. Turns out someone installed the Cisco MIBS along with the net snmp mibs. Thank you very much for your help. For anyone searching and finds this check /usr/local/share/snmp/mibs and see if you have some-snmp-mib.txt and some-snmp-mib.my Thanks Again! -=Tom Donnelly Patrick Morris wrote: On Tue, 07 Apr 2009, Thomas Donnelly wrote: Thanks for the quick reply! Ran from the command line got: # ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95: -c 75: SNMP WARNING - 100 | SNMPv2-SMI::mib-2.33.1.2.4.0=100 # ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95 -c 75: SNMP WARNING - *100* | SNMPv2-SMI::mib-2.33.1.2.4.0=100 So by intentionally triggering it again (remove :), it shows the *'s How about if you add a -v to get verbose output? Also, you may want to check the return code from the manual run on an OK resultx (for example, by running echo $? aafter your check_snmp command to make sure it matches what you see in the output). What happens in my case occasionally is that I install a screwed-up MIB for an unrelated service. It won't show any obvious errors, but it will cause check_snmp to return a warning result code regardless of whether the SNMP result falls within my thresholds. In effect, it's warning me that my MIBs are hosed, based on the fact that it got a non-OK result from snmpget (which is what check_snmp calls to do the actual SNMP getting). Not really sure what they mean by: 1. Prevent check_snmp from loading the MIBs (default behaviour) by using numeric oids AND using the -m : option If you a numeric OID rather than mib-2.33.1.2.4.0 and pass the -m : then check_snmp (and, by extension, snmpget) don't need to load the MIBs at all, so you don't get an error if you've got a bad MIB. Patrick Morris wrote: What happens when you run it manually? This, maybe? http://www.nagios.org/faqs/viewfaq.php?faq_id=208 On Tue, 07 Apr 2009, Thomas Donnelly wrote: Hi all, I am having an issue with all of the devices I added showing warning all the time. It is a simple snmp check to see if the amps are above 160warn/180critical. They always say warning even though they are less than the specified 160. One thing to note is once it hits the 160 mark it gets the * value * in turn, showing that it is actually in the warning range. I have shown the neccesary data I hope below. Any/all help is greatly appreciated. # uname -a FreeBSD server.example.net 5.5-RELEASE-p2 FreeBSD 5.5-RELEASE-p2 #3: Tue Oct 9 22:39:13 EST 2007 r...@server.example.net:/usr/obj/usr/src/sys/MONITOR i386 Nagios Version 2.0b3 # ./check_snmp -V check_snmp (nagios-plugins 1.4.3) 1.58 From the webui APC-RR-R3-1.hou check_rr_amp WARNING 04-07-2009 13:00:17 8d 3h 17m 2s 10/10 SNMP WARNING - 90 APC-RR-R3-2.hou check_rr_amp WARNING 04-07-2009 12:57:52 18d 2h 46m 48s 10/10 SNMP WARNING - *160* ^note the * 160 * for the one that actually is in the warning range. from checkcommands.cfg define command { command_name check_rr_amp command_line$USER1$/check_snmp -H $HOSTADDRESS$ -o mib-2.33.1.4.4.1.3.1 -C cPanel -w $ARG1$ -c $AR } From the hosts config file. define service{ host_name APC-RR-R1-1.hou service_description check_rr_amp check_command check_rr_amp!159!179 max_check_attempts 10 normal_check_interval 5 retry_check_interval3 check_period24x7 notification_interval 30 notification_period 24x7 notification_optionsw,c,r contact_groups backup-admins } Thanks! -=Tom -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS
Re: [Nagios-users] Children unreachable on soft down?
On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote: So is this just something I'll have to live with? I don't seem to be getting much feedback on the subject. :( Well, my response would be to fix the problem that's causing the outages in the first place or adjust the way you're monitoring the parents so that the plugin used recognizes when this temporary event is occurring. What you're asking for is that nagios track that the child went from down-unreachable-down without an intermediate OK state and suppress notifications in that case. That would appear to be a code change and would be better discussed on nagios-devel but I would encourage the check plugin approach first. -- Marc -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and Cacti
Is there some major advantage that Cacti provides when it comes to SNMP monitoring of network devices that cannot be achieved with Nagios and the various SNMP plug-ins available for it (e.g. like these ones http://nagios.manubulon.com) ? Also does anyone have some nagios config examples integrating PNP and these SNMP plugins... I have been trying to get an idea how to create the commands.cfg and services.cfg using these parts... Any examples of host based checks would be great. I have tried to read the relevant docs but have not found explicit nagios .cfg examples... if they exist a gentle pointer would also be great. thank you, Giovanni -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Children unreachable on soft down?
On Apr 8, 2009, at 9:28 AM, Marc Powell wrote: On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote: So is this just something I'll have to live with? I don't seem to be getting much feedback on the subject. :( Well, my response would be to fix the problem that's causing the outages in the first place or adjust the way you're monitoring the parents so that the plugin used recognizes when this temporary event is occurring. Ok, fair enough. There is nothing we can do about the outages (as I explained in one of my e-mail, they are an artifact of the connection type), so that leaves us with adjusting the monitoring. Now I thought that the recheck options were there exactly for this reason: to catch brief outages and not alert. And for the parent host that seems to be the case, but apparently that logic doesn't carry on to the child hosts. As such, somehow things would need to be adjusted so it never even sees the outages, even enough to go into a soft down state. Anyone have any suggestions for how I can accomplish this? Adjusting the timeout or using, say, an ssh check rather than icmp won't do it - the packets are still lost, and the ssh check would still timeout.. Perhaps if I sent more pings at longer intervals (so that if it doesn't get a response the single check retries at 15 second intervals or so before returning a response), but then the check would start taking several seconds or more to complete, and that wouldn't be a good thing. Assuming nagios even allowed a check to run that long - doesn't it have a mechanism to kill a check that doesn't return in a given time frame? I'm a little stumped here how I can adjust things. What you're asking for is that nagios track that the child went from down-unreachable-down without an intermediate OK state and suppress notifications in that case. That would appear to be a code change and would be better discussed on nagios-devel but I would encourage the check plugin approach first. Ok. I know there is code in there that know who it sent down messages to and doesn't send up messages to people that didn't get a down (primarily dealing with escalations) so I was hoping that maybe there would be something similar for this, i.e. seeing that the last notification sent was a down notification, and as such there is no need to send another. But if not, so be it. Thanks for the response! --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- Marc -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] multiple parents
We have several hosts that have multiple parents. Will the child notify as down if only one of the parents is down? Or will the child suppress notifications because one of its parents is down? -Lori -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Children unreachable on soft down?
I wonder if there is something you can do with notification escalations? I know you can control how the notifications are sent out, but I don't know if a state change from down to unreachable to down will cause the escalation to reset. From: Israel Brewster [mailto:isr...@frontierflying.com] Sent: Wednesday, April 08, 2009 2:32 PM To: Marc Powell Cc: nagios-users@lists.sourceforge.net Users Subject: Re: [Nagios-users] Children unreachable on soft down? On Apr 8, 2009, at 9:28 AM, Marc Powell wrote: On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote: So is this just something I'll have to live with? I don't seem to be getting much feedback on the subject. :( Well, my response would be to fix the problem that's causing the outages in the first place or adjust the way you're monitoring the parents so that the plugin used recognizes when this temporary event is occurring. Ok, fair enough. There is nothing we can do about the outages (as I explained in one of my e-mail, they are an artifact of the connection type), so that leaves us with adjusting the monitoring. Now I thought that the recheck options were there exactly for this reason: to catch brief outages and not alert. And for the parent host that seems to be the case, but apparently that logic doesn't carry on to the child hosts. As such, somehow things would need to be adjusted so it never even sees the outages, even enough to go into a soft down state. Anyone have any suggestions for how I can accomplish this? Adjusting the timeout or using, say, an ssh check rather than icmp won't do it - the packets are still lost, and the ssh check would still timeout.. Perhaps if I sent more pings at longer intervals (so that if it doesn't get a response the single check retries at 15 second intervals or so before returning a response), but then the check would start taking several seconds or more to complete, and that wouldn't be a good thing. Assuming nagios even allowed a check to run that long - doesn't it have a mechanism to kill a check that doesn't return in a given time frame? I'm a little stumped here how I can adjust things. What you're asking for is that nagios track that the child went from down-unreachable-down without an intermediate OK state and suppress notifications in that case. That would appear to be a code change and would be better discussed on nagios-devel but I would encourage the check plugin approach first. Ok. I know there is code in there that know who it sent down messages to and doesn't send up messages to people that didn't get a down (primarily dealing with escalations) so I was hoping that maybe there would be something similar for this, i.e. seeing that the last notification sent was a down notification, and as such there is no need to send another. But if not, so be it. Thanks for the response! --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- Marc -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and Cacti
I agree with Daniel's post below. We have Nagios and Cacti running on the same system; Nagios monitors 691 hosts and 1800 services while Cacti is pulling stats for about the same number of hosts, but something like 3200 data sources. They run on a dual Xeon 2.8 Ghz box with only 2 Gb or RAM (no swapping going on). Average load is about 1.5 and peaks at 3 about 3-4 times a day. The key is that mysql operations are on a dedicated box with 15k SCSI drives and RAID 10. James Moseley Daniel Emmanuel Feinsmith dan...@danielemm To anuelfeinsmith.co ncc...@gmail.com mncc...@gmail.com cc 04/08/2009 10:36 Nagios Users AMNagios-users@lists.sourceforge.net Subject Re: [Nagios-users] Nagios and Cacti If you move your mysql instance to another server, you can get much better performance on a nagios/cacti server. Check top while cacti is running a large install and you will see that mysql is hoarding CPU and memory resources not leaving much for nagios. = Daniel Feinsmith = {sent from iPhone} On Apr 8, 2009, at 8:03 AM, Andrew Davis ncc...@gmail.com wrote: And just an FYI from my own experience... putting Nagios Cacti on the same server has been somewhat problematic for us. We have over 400 network devices between switches, routers, WAPs, etc. We also have about 300 monitored servers. Initially I had Nagios and Cacti both on one server with Cacti running via cron every 5 minutes. About every 5 minutes, my shells would become unresponsive for roughly 30 to 90 seconds. Turning off either Nagios or Cacti resolved the issue. Running both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does both trending and alerts of all servers. Cacti does trending only of all network devices/ports. Once I moved Cacti to its own server, all was fine as far as load/latency went. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marco Tirado wrote: Hello: There are a couple of examples in the nagios exchange page of different approachs for integrating nagios and cacti. You should check that out. I believe the synchronization is going to cost you time and money, a better approach is to use nagios + pnp4naigos (this generates nice graphs) + check_snmp_int.pl (this for bandwidth tests). That way you have only one place to place your configuration. There are tons of other snmp plugins you can use for other tests (CPU, Memory, etc), //Marco On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney crist...@gmail.com wrote: Hi all, I've been looking into making use of Cacti to act as an SNMP management tool which runs alongside my Nagios instance. Ideally, what I would like to do is have Cacti monitor various SNMP-exposed metrics on my hosts, and then have a service check in Nagios which parses Cacti's results (which I believe are RRD files) and send alerts etc. Nagios itself will still be used for running directly checks for services running, errors in log files etc. Does this approach make sense? One issue that I can think of is the difficulty in keeping the config files of Nagios and Cacti synchronised. I was planning on using Lilac Platform to act as my
Re: [Nagios-users] Adaptive Monitoring: Broken?
On Wed, 08 Apr 2009, Marc Powell wrote: On Apr 7, 2009, at 1:26 PM, Patrick Morris wrote: Here are the important stats: Nagios Version: Version 3.1.0 Proficiency Level: Pretty damned high While the first command works fine, and sets the service to an OK state, the next two (which I've tried in various combinations) show up in the Nagios logs as having been sent, but do nothing. The check that appears in the config files keeps running instead of my check_ok check. Here's how it shows up in the logs: [1239128528] EXTERNAL COMMAND: CHANGE_SVC_EVENT_HANDLER;dummy- host;DNS;check_ok [1239128528] EXTERNAL COMMAND: CHANGE_SVC_CHECK_COMMAND;dummy- host;DNS;check_ok I've noticed the message is different if I use an invalid command, so I'm relatively sure I'm using the right ones; they just don't do anything. Event handlers are enabled for these services, but even if they weren't the check command should change, right? Am I doing something wrong here, or have I run into a bug? I'm not using 3.x yet but just to provide some feedback, what you're doing looks reasonable from my reading of the documentation. I do see this in 3.1.0's commands.c though -- /* SECURITY PATCH - disable these for the time being */ switch(cmd){ case CMD_CHANGE_GLOBAL_HOST_EVENT_HANDLER: case CMD_CHANGE_GLOBAL_SVC_EVENT_HANDLER: case CMD_CHANGE_HOST_EVENT_HANDLER: case CMD_CHANGE_SVC_EVENT_HANDLER: case CMD_CHANGE_HOST_CHECK_COMMAND: case CMD_CHANGE_SVC_CHECK_COMMAND: return ERROR; } That's in the right section and my reading of the code is that it does exactly that; prevent changing of those values... Maybe it's something being worked on in the development branch? Thanks! I should have done some code-diving, because that goes pretty far toward explaining why those commands don't work for me as currently documented. -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] how to add Q1,Q2,Q3,Q4 reports
Any idea how do i add few custom reports to nagios report period drop down list in availability report like Q1(first quarter), Q2(second quarter)... etc. -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] how to add Q1,Q2,Q3,Q4 reports
XYZ XYZ wrote: Any idea how do i add few custom reports to nagios report period drop down list in availability report like Q1(first quarter), Q2(second quarter)... etc. Edit the source code and recompile? -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Adaptive Monitoring: Broken?
Marc Powell wrote: On Apr 7, 2009, at 1:26 PM, Patrick Morris wrote: Here are the important stats: Nagios Version: Version 3.1.0 Proficiency Level: Pretty damned high While the first command works fine, and sets the service to an OK state, the next two (which I've tried in various combinations) show up in the Nagios logs as having been sent, but do nothing. The check that appears in the config files keeps running instead of my check_ok check. Here's how it shows up in the logs: [1239128528] EXTERNAL COMMAND: CHANGE_SVC_EVENT_HANDLER;dummy- host;DNS;check_ok [1239128528] EXTERNAL COMMAND: CHANGE_SVC_CHECK_COMMAND;dummy- host;DNS;check_ok I've noticed the message is different if I use an invalid command, so I'm relatively sure I'm using the right ones; they just don't do anything. Event handlers are enabled for these services, but even if they weren't the check command should change, right? Am I doing something wrong here, or have I run into a bug? I'm not using 3.x yet but just to provide some feedback, what you're doing looks reasonable from my reading of the documentation. I do see this in 3.1.0's commands.c though -- /* SECURITY PATCH - disable these for the time being */ switch(cmd){ case CMD_CHANGE_GLOBAL_HOST_EVENT_HANDLER: case CMD_CHANGE_GLOBAL_SVC_EVENT_HANDLER: case CMD_CHANGE_HOST_EVENT_HANDLER: case CMD_CHANGE_SVC_EVENT_HANDLER: case CMD_CHANGE_HOST_CHECK_COMMAND: case CMD_CHANGE_SVC_CHECK_COMMAND: return ERROR; } That's in the right section and my reading of the code is that it does exactly that; prevent changing of those values... Maybe it's something being worked on in the development branch? It's not. That snippet comes from Nov 30 2008 as a measure to prevent CVE-2008-5027 (cmd.cgi authorization bypass vulnerability) and CVE-2008-5028 (cross-site request forgery) from becoming remote command execution vulnerabilities. Ethan added that snippet as an extra security measure. It's been in Nagios since 3.0.4. Assuming both the patches I sent are applied, it's safe to remove that particular snippet and recompile Nagios. I wrote about the two vulnerabilities here in case anyone needs to refresh their memory: http://blogs.op5.org/blog4.php/2008/11/11/nagios-cmd-cgi-authorization-bypass-vuln http://blogs.op5.org/blog4.php/2008/11/11/cross-site-request-forgery-vulnerability-6 The patches to prevent them are available here: http://git.op5.org/git/?p=nagios.git;a=shortlog;h=refs/heads/security -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] multiple parents
Lori Adams wrote: We have several hosts that have multiple parents. Will the child notify as down if only one of the parents is down? Or will the child suppress notifications because one of its parents is down? All parents of a host has to be down for it to become unreachable. If you have configured the host (and your contacts) to send notifications on unreachable states, you may still get notifications for it, but not HOST DOWN ones. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hostnames and regex
I recently changed a Nagios configuration to use a regex with hostnames but it hasn't been entirely satisfactory. I have e.g. define service{ use generic-service host_name host* service_description Total Processes check_command snmp_procs!120!150 but for some hosts, I need to use different thresholds. I tried to use define service{ use generic-service host_name hostNN service_description Total Processes check_command snmp_procs!200!250 but hostNN still alerts based on the host* thresholds. I tried placing the definition for host NN before AND after the host* definition - it made no difference. From what I read of Nagios regex, they're not full regex so it wouldn't be possible to write one which matched host* but didn't match hostNN. Is there a way of doing what I want, apart from the obvious one of renaming the hosts which I don't want to match the regex? __ Kindest regards, Niall O Broin MakaluMedia Group | http://makalumedia.com -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSClient not providing results
Hi, When I set NSClient++ to start automatically in services, its running but not providing the results. All I'm getting is could not fetch information from server . Even when I run the check_nt plugin manually also, I'm getting the same result. [r...@nagios libexec]# /usr/local/nagios/libexec/check_nt -H 192.168.0.119 -p 12489 -v MEMUSE -w 80 -c 90 could not fetch information from server After searching and reading mailing lists I've tweaked certain things like disabling firewall, enabling DEP, etc. Still the same result. Only one way I could make the NSClient to respond is, running it in Command line with -test option. c:\NSClient++NSClient++.exe -test When I run this, I'm getting the CHECKS GREEN.. [r...@nagios libexec]# /usr/local/nagios/libexec/check_nt -H 192.168.0.119 -p 12489 -v MEMUSE -w 80 -c 90 Memory usage: total:4308.71 Mb - used: 1606.41 Mb (37%) - free: 2702.30 Mb (63%) | 'Memory usage'=1606.41Mb;3446.97;3877.84;0.00;4308.71 Everytime I reboot the machine. I need to start this in command line. Any solution, inputs ? Please provide... Thanks. -- Regards Mkrish -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Monitor netstat connection states using nagios.
Hi All, I am using check_tcp to check status of a particular port on the server. This is working good. I would also need to monitor LISTEN status (SYN_RECV) of that port. for ex, I have a ldap service running on port 3890, so the command netstat -anp | grep 3890 tcp0 0 0.0.0.0:3890 0.0.0.0:* LISTEN 16029/java tcp0 0 10.121.30.121:3890 10.121.6.1:8831 ESTABLISHED 16029/java tcp0 0 10.121.30.121:3890 10.121.6.1:61052 ESTABLISHED 16029/java tcp 228 0 10.121.30.121:3890 10.121.6.1:49440 ESTABLISHED 16029/java tcp0 0 10.121.30.121:3890 10.121.6.1:11664 SYN_RECV16029/java The establish connections are ok to allow, but we need to monitor SYN_RECV status. If there is any such(SYN_RECV) connection appears, we immediately get an alert from nagios. Is there any way to monitor such states with nagios or check_tcp? I have written some shell script to monitor such events, but also I would like to integrate that scripts into nagios? Is that possible? Please help me Thanks -- Shankar -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Monitor netstat connection states using nagios.
Hi All, I am using check_tcp to check status of a particular port on the server. This is working good. I would also need to monitor LISTEN status (SYN_RECV) of that port. for ex, I have a ldap service running on port 3890, so the command netstat -anp | grep 3890 tcp0 0 0.0.0.0:3890 0.0.0.0:* LISTEN 16029/java tcp0 0 10.121.30.121:3890 10.121.6.1:8831 ESTABLISHED 16029/java tcp0 0 10.121.30.121:3890 10.121.6.1:61052 ESTABLISHED 16029/java tcp 228 0 10.121.30.121:3890 10.121.6.1:49440 ESTABLISHED 16029/java tcp0 0 10.121.30.121:3890 10.121.6.1:11664 SYN_RECV16029/java The establish connections are ok to allow, but we need to monitor SYN_RECV status. If there is any such(SYN_RECV) connection appears, we immediately get an alert from nagios. Is there any way to monitor such states with nagios or check_tcp? I have written some shell script to monitor such events, but also I would like to integrate that scripts into nagios? Is that possible? Please help me or provide some suggestions -- Shankar -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios for IRIX
Greeting. I have IRIX 6.5 servers which I like to monitor using Nagios 3.0. I am able to monitor other servers like Linux Windows but not able to configure nagios on IRIX OS servers. Please guide me how can I install nagios plug-in on IRIX OS and configure. Where do we will get nagios plug-ins for IRIX. Thanks and Regards Amol Bute ** Email Disclaimer: Information contained and transmitted by this e-mail (including any attachments) is confidential, proprietary and legally privileged data of Tata Technologies that is intended for use only by the addressee. If you are not the intended recipient, you are notified that any review, use, dissemination, distribution, copying or printing of this e-mail is strictly prohibited. You are requested to delete this e-mail or any copies immediately and notify the sender by reply email. Internet communications cannot be guaranteed to be timely, secure, error or virus-free. Tata Technologies does not accept any liability for virus infected email or errors or omissions or consequences which may arise as a result of this e-mail transmission. To know more about Tata Technologies please visit http://www.tatatechnologies.com -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null