Hi list, I'm trying to debug a problem where CPU usage (specifically system%) of my Nagios host increase over time, about 0.5% an hour. Continuously watching the Nagios root process in "ps auxww' shows process %CPU increasing, while VSZ and RSS stay constant. Based on VSZ/RSS, it doesn't look like a memory leak.
If I completely stop and re-start Nagios, it goes away. If it's unchecked, after several days CPU hits 99% and service latencies skyrocket. Through process of elimination, I think I've tracked it down to perl plugins. ePN is in use. I'm tracking 11,309 services on 1,364 hosts, 26% of those service checks are perl (manubulon.com's check_snmp_mem, check_snmp_load) and the rest are C (check_icmp, check_snmp). Any way I can analyze the perl plug-ins for issues or see what's happening with the embedded perl intepreter? Or anyone have any other insight into the process CPU utilization? I'm running Nagios 3.0.6. This happens on different CentOS kernels (2.6.18-92.1.10.el5PAE and 2.6.18-53.1.14.el5). Both systems have 8 GB memory and it's never hitting swap. If memory serves right, it's default config except for using use_large_installation_tweaks=1 and enable_environment_macros=0. Met vriendelijke groet/kind regards, bryan ------------------------------------------------------------------------------ Check out the new SourceForge.net Marketplace. It is the best place to buy or sell services for just about anything Open Source. http://p.sf.net/sfu/Xq1LFB _______________________________________________ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null