On 10/03/2012 05:58 PM, Frost, Mark {BIS} wrote: > Dan, > > As I understand it, the issue is less about Nagios and more about > npcd. Nagios merrily produces the perfdata files and then npcd > comes along and scoops them up, but as it's processing them it's > opening a lot of rrd files and inserting data into them.
That's what rrdcached is for. It's supposed to stash the perfdata in memory so writes don't have to happen so often. You defeat the purpose if you use a cache-timer shorter than your check-interval though, but if you have, say, a 30 minute cache time and a 5 minute check-interval, you basically cut I/O down to 1/6th of the normal. If you also hook up a decent-sized ssd, you get x21 throughput from before, so all in all you can get a rough increase of 12000% with both those options enabled. > So really > it's npcd that's the problem. Well, not really a problem, but > ultimately it's doing its thing and then Nagios gets less than a fair > share of the box's I/O. It's not that it's horrible right now, but > we're starting to notice it and I would tend to be concerned about > scaling problems. > > Honestly even with Nagios 3, it seems like Nagios' own I/O is > entirely manageable so far with strategic use of ram disk. It's > just putting Nagios and PNP4Nagios (plus Apache to serve up the graph > contents which I'm also not happen going on on the same server) on > the same boxes that I don't like. > With Nagios 4, set status_file=/dev/null and use livestatus instead and you're golden. Apart from the nagios.log, there's no other I/O going on from Nagios' side, so it won't get stuck waiting for pnp to get done. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null