[Nagios-users] Scheduling Queue stucked a few minutes after restart
hello, I have a really odd issue running Nagios: a few minutes after starting the scheduling queue seems to freeze and no more active checks are performed. The queue remains stucked for hours until I have to manually restart Nagios. Passive checks are processed normally. I'm running Nagios 3.0.6 (deb package) on a Debian lenny system. The harware is an 8-core Xeon CPU with 16GB RAM. Nagios is monitoring about 1K hosts and 10K services. Reverting back the configuration to last known good configuration did not help, neither did rebooting the server and several Nagios restarts and reloads. Already tried fixes: - disabled all active hosts checks - increased ulimit for nagios user - disabled all event handlers - disabled all obsess stuff Any help or hint would be appreciated. nagios.cfg follows *** log_file=/nagios_fe/var/log/nagios3/nagios.log cfg_file=/etc/nagios3/commands.cfg cfg_dir=/etc/nagios-plugins/config cfg_dir=/nagios_fe/etc/cmon/nagios3 cfg_dir=/nagios_fe/etc/nagiosgrapher/nagios3 object_cache_file=/nagios_fe/var/cache/nagios3/objects.cache precached_object_file=/nagios_fe/var/lib/nagios3/objects.precache resource_file=/nagios_fe/etc/cmon/nagios3/macros.res status_file=/nagios_fe/var/cache/nagios3/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/nagios_fe/var/lib/nagios3/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/nagios_fe/var/run/nagios3/nagios3.pid temp_file=/nagios_fe/var/cache/nagios3/nagios.tmp temp_path=/tmp event_broker_options=-1 log_rotation_method=d log_archive_path=/nagios_fe/var/log/nagios3/archives use_syslog=0 log_notifications=1 log_service_retries=0 log_host_retries=0 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=0 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/nagios_fe/var/lib/nagios3/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/nagios_fe/var/lib/nagios3/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=1 enable_event_handlers=0 process_performance_data=1 service_perfdata_file=/nagios_fe/var/lib/nagiosgrapher/ngraph.pipe service_perfdata_file_template=$HOSTNAME$\t$SERVICEDESC$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\t$TIMET$\n service_perfdata_file_mode=a service_perfdata_file_processing_interval=5 service_perfdata_file_processing_command=ngraph-process-service-perfdata-pipe obsess_over_services=0 obsess_over_hosts=0 translate_passive_host_checks=0 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=0 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=euro p1_file=/usr/lib/nagios3/p1.pl enable_embedded_perl=0 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^*|'?,()= illegal_macro_output_chars=`~$|' use_regexp_matching=0 use_true_regexp_matching=0 admin_email=r...@localhost admin_pager=pager...@localhost daemon_dumps_core=0 use_large_installation_tweaks=1 enable_environment_macros=0 debug_level=144 debug_verbosity=1 debug_file=/nagios_fe/var/log/nagios3/nagios.debug max_debug_file_size=20 *** -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net
[Nagios-users] Plugin: nagiosVMware
Hi I am trying to Configure the nagiosVMware plugin I found on https://www.monitoringexchange.org/inventory/Check-Plugins/Virtualization/VMWare-%2528ESX%2529/nagiosVMware Generally it works but now I found that most of the time the CPU and MEM-checks give no result: [nag...@nagios-check-vc ~]$ ./nsca_vmware.pl 5min Processing host esx19.dtnet.de ... overcommit -1, cpuload -1 on 0 cpus, memtot -1, memfree -1 esx19 ESX-OVERCMMT3 Memory overcommitment -1 esx19 ESX-CPU-LOAD3 CPU load average -1 on 0 CPUs esx19 ESX-MEMORY 3 Memory use -1 total -1 free ... mem/cpu took 8 seconds ... host esx19 took 8 seconds Processing host esx20.dtnet.de but sometimes it works: [nag...@nagios-check-vc ~]$ ./nsca_vmware.pl 5min Processing host esx19.dtnet.de ... overcommit 0.00, cpuload 0.19 on 8 cpus, memtot 32766, memfree 11680 esx19 ESX-OVERCMMT0 Memory overcommitment 0% esx19 ESX-CPU-LOAD0 CPU load average 19% on 8 CPUs esx19 ESX-MEMORY 0 Memory use 64% (21086 of 32766 MB used) ... mem/cpu took 8 seconds ... host esx19 took 8 seconds This is the same effect with all ESX servers. All are ESX4.0 in the cmd file I get this error [2010-12-28 14:17:16.275 7324B90 warning 'App'] Closing Response processing in unexpected state: 3 Has anyone managed to run this without errors? s_teeter: are you on this list? (I did not find an email address) Regards Sebastian Ries -- DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: sebastian.r...@dtnet.de -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling Queue stucked a few minutes after restart
Maurizio Pinotti wrote: I have a really odd issue running Nagios: a few minutes after starting the scheduling queue seems to freeze and no more active checks are performed. The queue remains stucked for hours until I have to manually restart Nagios. I'm running Nagios 3.0.6 (deb package) on a Debian lenny system. The harware is I think that is a bug in that version of Nagios. I had the same problem. It got fixed, but I still go look at my service checks every morning to make sure. Also, I see where the server guys acknowledge problems and then forget about them, heh heh. There is a much newer version of Nagios available in lenny-backports. I would give it a shot if you can. http://packages.debian.org/source/lenny-backports/backports/nagios3 -- -Chris -- Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Host/service escalation notification
Hi. I want to create several levels of host and service escalation. Say 3 levels. In notification I want to know on which escalation level this particular notification occurred. Can't find any variable reflecting escalation level. Thanks a lot! Dmytro Leonenko -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring windows event viewer.
Toonz IT wrote: Is it possible to monitor specific event ids like disk error, fro windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris -- Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring windows event viewer.
If you will be monitoring event logs, you may want to look at applications made for monitoring event logs. One application that works well for us is syslog-ng. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -Original Message- From: Chris Beattie [mailto:cbeat...@geninfo.com] Sent: Tuesday, December 28, 2010 9:32 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Toonz IT wrote: Is it possible to monitor specific event ids like disk error, fro windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris -- Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Host/service escalation notification
On Tue, Dec 28, 2010 at 04:19:21PM +0200, Дмитрий Леоненко wrote: I want to create several levels of host and service escalation. Say 3 levels. In notification I want to know on which escalation level this particular notification occurred. Can't find any variable reflecting escalation level. Escalation levels are connected to the notification number, and escalations can be kind of orthogonal. Do the Notification Number ($SERVICENOTIFICATIONNUMBER$ and/or $HOSTNOTIFICATIONNUMBER$) macros the job? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring windows event viewer.
Doesn't syslog-ng just consolidate the logs, it doesn't really monitor anything right? Dan -Original Message- From: Polifemo, Salvatore [mailto:polife...@conedsolutions.com] Sent: Tuesday, December 28, 2010 8:38 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. If you will be monitoring event logs, you may want to look at applications made for monitoring event logs. One application that works well for us is syslog-ng. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -Original Message- From: Chris Beattie [mailto:cbeat...@geninfo.com] Sent: Tuesday, December 28, 2010 9:32 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Toonz IT wrote: Is it possible to monitor specific event ids like disk error, fro windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris -- Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring windows event viewer.
One use of syslog to set up rules and then take an action. We look for error then send out an email. Take a look at the syslog-ng forum. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -Original Message- From: Daniel Wittenberg [mailto:daniel.wittenberg.r...@statefarm.com] Sent: Tuesday, December 28, 2010 9:46 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Doesn't syslog-ng just consolidate the logs, it doesn't really monitor anything right? Dan -Original Message- From: Polifemo, Salvatore [mailto:polife...@conedsolutions.com] Sent: Tuesday, December 28, 2010 8:38 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. If you will be monitoring event logs, you may want to look at applications made for monitoring event logs. One application that works well for us is syslog-ng. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -Original Message- From: Chris Beattie [mailto:cbeat...@geninfo.com] Sent: Tuesday, December 28, 2010 9:32 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Toonz IT wrote: Is it possible to monitor specific event ids like disk error, fro windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris -- Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption
[Nagios-users] Disapering popup windows
I have a 3.1.0 instnace which ahs been in service for a long time. Over the years, we have had some issues with the popup windows in the browser flashing up for a fraction of a second and disapering. I have always chrged this off to wierd browser behavior, however, I am now setting up a child instance, and it is at 3.2.0. Today I observed this ebhavior on the same browser, running on the smae machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in production. Does nayone have any thoughts as to what might have chnaged between these 2 versions that fixed this? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null