Re: [Nagios-users] Nagiosgraph and load graphing

2010-04-08 Thread Tobias Klausmann
Hi! On Wed, 07 Apr 2010, Paras pradhan wrote: > How do I change it to match the output similar to what w and > top provides. i.e instead of 350m, 440m,200m graph it as 0.03, > 0.04, 0.02. This is not a matter of NagiosGraph itself, but of its rrdtool backend. To create the legend on the graph,

Re: [Nagios-users] PNP swap template HOWTO

2009-05-06 Thread Tobias Klausmann
Hi! On Wed, 06 May 2009, Jim Avery wrote: > 2009/5/6 Tobias Klausmann : > > I did something else: I patched PNP so that it removes the > > check_nrpe! prefix if it's there, then does processing as usual. > > I've sent this (trival, four-line) patch to the PNP

Re: [Nagios-users] PNP swap template HOWTO

2009-05-06 Thread Tobias Klausmann
Hi! On Wed, 06 May 2009, Jim Avery wrote: > It's not quite as simple as that though, because if you set up > a check_nrpe.php template which makes your swap graphs look > lovely, it might make all the other checks you run using > check_nrpe look awful! You might need to consider setting up a > se

Re: [Nagios-users] Service latency suddenly through the roof

2009-03-23 Thread Tobias Klausmann
Hi! On Mon, 23 Mar 2009, Deborah Martin wrote: > Should I expect latency to be a lot lower with the new version > of Nagios ? I'm currently looking at the logs produced so far > for the new version to see what the latency levels are like. Definitely. Nagios 3 does not schedule hosts checks diff

Re: [Nagios-users] cool Nagios + DNX tutorial

2008-11-09 Thread Tobias Klausmann
Hi! On Sun, 09 Nov 2008, Andreas Ericsson wrote: > Tobias Klausmann wrote: > > Apart from that I *really* like it, since it makes a distributed > > setup feasible (NSCA et al fall very short on that front). > > Try pnsca. It removes the fork()-bomb related performance probl

Re: [Nagios-users] cool Nagios + DNX tutorial

2008-11-08 Thread Tobias Klausmann
Hi! On Fri, 07 Nov 2008, Roger wrote: > My buddy Pat at Petta Tech just put up a great DNX + Nagios tutorial > > http://nagioswiki.com/wiki/index.php/Nagios_%2B_DNX > > He did a good job of documenting how he worked out the various kinks. He's > running about 5000 checks on 800 hosts, and the s

Re: [Nagios-users] How do I receive alerts for only some of services I am a contact for?

2008-07-04 Thread Tobias Klausmann
Hi! On Fri, 04 Jul 2008, Matthew Jurgens wrote: > I'm wondering if anyone can think of a better way to configure the > following scenario: > > Assume I have 2 services being monitored, service1 and service2. > User1 wants to be able to see both services through the CGI interface and > hence is

Re: [Nagios-users] Multiple interfaces, multiple parents

2008-05-26 Thread Tobias Klausmann
Hi! On Tue, 27 May 2008, Hugo van der Kooij wrote: >> Unfortunately, Nagios connects multiple parent hosts with >> logical AND, which means that the host only turns UNREACHABLE >> when *both* switches are gone. > > The funny thing with redundant paths is that they are in fact > redundant. So if

[Nagios-users] Multiple interfaces, multiple parents

2008-05-26 Thread Tobias Klausmann
Hey everybody, I've hit a snag when configuring parents for hosts. First a little bit about our setup. Most of our hosts have only one connected ethernet interface (if you don't count the management cards). Still, we have quite a handful of hosts (over 100) that have two interfaces. Up until now

Re: [Nagios-users] Strange notification problem

2008-05-05 Thread Tobias Klausmann
Hi! On Mon, 05 May 2008, Ilya Meylikhov wrote: > I've solved the problem - the CRITICAL state output of some > services on this host had more than 160 symbols - gnokii was > unable to send an SMS that is more than 160 symbols. Btw maybe > anyone knows how to make gnokii send sms which contains mo

[Nagios-users] Heads up: Comments might bite you; was Re: Possible bug in 3.0rc3

2008-02-29 Thread Tobias Klausmann
Hi! Actually, this was both my and Nagios' fault. You see, faced with a config block like this: # foo=bar\ foo=baz Nagios will see... nothing. The trailing \ in the first line joins up the second and then both disappear since it's now one long comment. Logic-wise I'd say it's 50/50 bug/feature

[Nagios-users] Possible bug in 3.0rc3

2008-02-28 Thread Tobias Klausmann
Hi! I think I may have found a bug in the latest rc. I'm not sure if it's my own fault, so I'll aks here, first. Apparently, this line: service_perfdata_file_template=$HOSTNAME$\t$SERVICEDESC$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\t$TIMET$ in my nagios.cfg is ignored. I can change it to whatever

[Nagios-users] Docs at leats partially wrong

2008-02-07 Thread Tobias Klausmann
Hi! We've been using the $ADDRESSN$ macro in our notification commands while we were using 2.x. Since we upgraded, some notifications fail - since said macro does not seem to be expanded anymore. The docs are inconsistent in this regard. On one hand, http://nagios.sourceforge.net/docs/3_0/objec

Re: [Nagios-users] Nagiosgrapher limits?

2007-11-21 Thread Tobias Klausmann
Hi! On Wed, 21 Nov 2007, Palle L Jensen wrote: > Are there any limitations to Nagiosgrapher like how many > hosts/services it is capable of to produce graphs for? > > As soon as I get to a certain limit of hosts/services (passed > around 34 hosts and/or around 100 services), the graphs acts > "w

Re: [Nagios-users] NRPE Service Dependencies

2007-08-25 Thread Tobias Klausmann
Hi! On Sat, 25 Aug 2007, Anand Capur wrote: > Is there a way to configure nagios, so if NRPE is down or not responding we > only get 1 notification per box, and not a notification for every service on > the box? We've set things up so that NRPE itself (we used a dummy check inside NRPE, but you

Re: [Nagios-users] snmp over internet "best practice"

2007-08-25 Thread Tobias Klausmann
Hi! On Fri, 24 Aug 2007, Russell Adams wrote: > The argument was always SNMP (inferring v1), versus NRPE. I've been an > advocate of using SNMP because there was little client software to > maintain. I prefer NRPE over SNMP (no matter what version) for a two simple reasons: 1) Code complexity.

[Nagios-users] Distributed setups

2007-07-02 Thread Tobias Klausmann
Apologies for the other mail with the wrong setup. Too little coffee on my part. Hi! We're currently looking at creating a distributed setup using NSCA. One thing that I've found no mention of is how the host and service commands are forwarded. Even if the central machien does all the notifica

Re: [Nagios-users] check_amanda with nsca

2007-07-02 Thread Tobias Klausmann
Hi! We're currently looking at creating a distributed setup using NSCA. One thing that I've found no mention of is how the host and service commands are forwarded. Even if the central machien does all the notifications (as we're planning), completely dis/enabling service/host checks would have t

[Nagios-users] Notificatiosn - best common practice

2007-05-31 Thread Tobias Klausmann
Hi! The mails by shacky and Janet Post got me to thinking about a thread regarding "best common practice" when it comes to user accounts. We currently have 370 hosts and 3740 services. They can be sorted into some 60 groups of related servers. In total, we have about 70 real people which manag

Re: [Nagios-users] front end tools for Nagios

2007-05-17 Thread Tobias Klausmann
Hi! On Thu, 17 May 2007, [EMAIL PROTECTED] wrote: > Hari Sekhon wrote: >> Hugo van der Kooij wrote: >> > On Wed, 16 May 2007, RR wrote: >> >> I'm relatively new to Nagios and am looking for cool front end tools >> >> to managing the config files. >> >> >> >> Of these, which ones do other users

Re: [Nagios-users] Monitor DB Server without outside IP Address

2007-03-01 Thread Tobias Klausmann
Hi! On Thu, 01 Mar 2007, Marc Powell wrote: > > From: [EMAIL PROTECTED] [mailto:nagios-users- > > [EMAIL PROTECTED] On Behalf Of Patrick Morris > > Sent: Thursday, March 01, 2007 4:07 PM > > To: James Pells > > Cc: nagios-users@lists.sourceforge.net > > Subject: Re: [Nagios-users] Monitor DB Serv

Re: [Nagios-users] cpu usage / memory usage

2007-02-13 Thread Tobias Klausmann
Hi! On Tue, 13 Feb 2007, Niels Hamaker wrote: > the check_mem plugin is a simple and effective plugin to check memory. You > can find it on nagiosexchange.org. > We generally don't check CPU usage, just load, but it depends on your > setup, the applications your running, which of the two is the

Re: [Nagios-users] Nagios memory Leaks

2007-01-24 Thread Tobias Klausmann
Hi! On Wed, 24 Jan 2007, John Longland wrote: > I have been reading with interest about these memory leaks. > I see you mention 2.5 & 2.6. > Does this happen with 2.4 as well ?? I can't really tell: back when I used 2.4 I wasn't aware of this problem and hadn't added so many machines/services ye

Re: [Nagios-users] Memory leaks

2007-01-24 Thread Tobias Klausmann
Hi! On Wed, 24 Jan 2007, Andreas Ericsson wrote: > > Activating the embedded Perl interpreter and -cache will increase > > the amount of lost memory to about 5-6M per hour. In this case, > > however, sometimes the memory usage snaps back, i.e. some of the > > lost memory is collected. I've not ye

[Nagios-users] Memory leaks

2007-01-23 Thread Tobias Klausmann
Hi! (First off: if this should also go to nagios-devel, just yell at me.) Nagios 2.6 and 2.5 have memory leaks. They are not that big that within hours your machine will be swapping, but they degrade performance in other ways. First off, their approximate extent. 2.5 and 2.6 without perl cach

Re: [Nagios-users] Completely stumped

2007-01-22 Thread Tobias Klausmann
Hi! On Fri, 19 Jan 2007, Andreas Ericsson wrote: > Was this by any chance coupled with a big fat spike of memory usage > on the Nagios server? I assume you do monitor memory usage, right? I've checked it every now and then and found nothing unusal. While switching around 2.5/2.6 with and without

Re: [Nagios-users] Completely stumped

2007-01-19 Thread Tobias Klausmann
Hi! On Fri, 19 Jan 2007, Tobias Klausmann wrote: > As far as I can tell, backdating from my own packages (2.6 with > said patch) to dsitro packages (Gentoo, Nagios v2.5) fixed the > problem. The new machine has run close to 12 hours without even > remotely acting up. I've n

Re: [Nagios-users] Completely stumped

2007-01-19 Thread Tobias Klausmann
Hi! On Thu, 18 Jan 2007, Andreas Ericsson wrote: > > The *only* thing I've left to try is removing the multiuser patch > > we talked about at the end of last year. If that does it, at > > least I have an idea *where* in the code my problem lies. I'll > > try that route tonight. > > > > Which pa

[Nagios-users] Completely stumped

2007-01-18 Thread Tobias Klausmann
Hi! The other day, we got our beefier machine. I had hoped my latency problems (ever increasing check latencies) would go away or at least turn irrelevant with that. They didn't. More precisely: we have migrated to a four-core Opteron 2.2GHz with 2GBs of RAM and a quite fast I/O Subsystem. We h

Re: [Nagios-users] Performance issues, too

2007-01-09 Thread Tobias Klausmann
Hi! On Tue, 02 Jan 2007, Daniel Meyer wrote: > Program Running Time: 10d 21h 22m 42s > > So, for almost eleven days nagios runs smoothly now, no more > latency problems. I'll try it again with EPN (but still without > perlcache) now. I've finally gotten around to recompile Nagio

Re: [Nagios-users] Performance issues, too

2006-12-26 Thread Tobias Klausmann
Hi! On Mon, 25 Dec 2006, Robert Hajime Lanning wrote: > > I think the two issues are independent (or at most correlated). > > If switching off EPN/perlcache fixes the issues for me, too, I'd > > guess it's either the embedded Perl or the cache. Finding out > > which is a matter of simple experime

Re: [Nagios-users] Performance issues, too

2006-12-25 Thread Tobias Klausmann
Hi! On Mon, 25 Dec 2006, Robert Hajime Lanning wrote: > > > > Just rechecked. After 72 hours nagios still runs perfectly > > with an average service check latency of 0.3 seconds, max. > > 0.9 seconds. > > > > Memory usage is perfectly "flat" now, with epn and perlcache > > it went from 140 mb

Re: [Nagios-users] Performance issues, too

2006-12-21 Thread Tobias Klausmann
Hi! On Thu, 21 Dec 2006, Daniel Meyer wrote: > > I have the suspicion that our check latency might converge on 419 > > seconds - but I'd rather not test it, we'd be well beyond the > > 300s-interval most of our checks are designed for. > > Why do you think of exactly 419 seconds? > > And btw, i

Re: [Nagios-users] Performance issues, too

2006-12-21 Thread Tobias Klausmann
Hi! On Tue, 19 Dec 2006, Andreas Ericsson wrote: > >>> SERVICE SCHEDULING INFORMATION > >>> --- > >>> Total services: 2836 > >>> Total scheduled services: 2836 > >>> Service inter-check delay method: SMART > >>> Average service check int

Re: [Nagios-users] Performance issues, too

2006-12-21 Thread Tobias Klausmann
Hi! On Thu, 21 Dec 2006, Daniel Meyer wrote: > - it is not triggered by any other software on the server >(nagios and apache are the only things running there) ACK. > - its not triggered by hourly, daily or weekly cronjobs With a lot of guessing and estimating, I can make a case for a slig

Re: [Nagios-users] Performance issues, too

2006-12-19 Thread Tobias Klausmann
Hi! On Tue, 19 Dec 2006, Andreas Ericsson wrote: > >>> --- > >>> Total services: 2836 > >>> Total scheduled services: 2836 > >>> Service inter-check delay method: SMART > >>> Average service check interval: 2225.56 sec > >> This is,

Re: [Nagios-users] Performance issues, too

2006-12-19 Thread Tobias Klausmann
Hi! On Tue, 19 Dec 2006, Daniel Meyer wrote: > >> You could lower this to 2 seconds. I've done so on any number of > >> installations and it has no negative impact what so ever, but seems to > >> make Nagios a bit more responsive. > > > > I'll give that a try. > > I've tried that but had some fa

Re: [Nagios-users] Questions about scheduling

2006-12-19 Thread Tobias Klausmann
Hi! On Tue, 19 Dec 2006, Andreas Ericsson wrote: > > - How does the scheduling queue work? From the docs it seems the > > whole queue is held up as soon as a host check is necessary. > > As far as I know, Nagios parallelizes checks, so my question > > is if the current checking thread is h

Re: [Nagios-users] Performance issues, too

2006-12-19 Thread Tobias Klausmann
Hi! On Tue, 19 Dec 2006, Andreas Ericsson wrote: > Thanks for an excellently detailed problem report, missing only the > Nagios version and system type/version info. I've got some comments and > followup questions. See below. I'm running 2.6 now but I had the troubles with 2.5 initially. OS is

[Nagios-users] Questions about scheduling

2006-12-19 Thread Tobias Klausmann
Hi! I have a few questions about scheduling in Nagios. - How does the scheduling queue work? From the docs it seems the whole queue is held up as soon as a host check is necessary. As far as I know, Nagios parallelizes checks, so my question is if the current checking thread is held up on

[Nagios-users] Performance issues, too

2006-12-19 Thread Tobias Klausmann
Hi! Recently I have run into the very same performance issues as Daniel Meyer (or so it seems). However, I'm not quite sure about it. Here's the gist of it. Currently, service check latency slowly creeps up. As it is now, it starts out at a little over 1s and after about 12 hours it's in the ar

Re: [Nagios-users] Advanced permissions/user properties

2006-12-07 Thread Tobias Klausmann
Hi! On Tue, 05 Dec 2006, Tobias Klausmann wrote: > Thus, I'll probably patch NG to just ignore the perms. > > I'll post the patch here (if it's not too ugly ;)) See the attached file. Have fun. Regards, Tobias -- Never touch a burning system. --- NagiosGrapher.pm.

Re: [Nagios-users] Advanced permissions/user properties

2006-12-05 Thread Tobias Klausmann
Hi! On Mon, 06 Nov 2006, Tobias Klausmann wrote: > > For backwards compatibility, the default would be rwxn. > > > > So, the engineers would have: nrx, customer: nr and helpdesk r. > > > > Attached is an updated patch. > > I'll try to get a peek at it

Re: [Nagios-users] How do distributed setups work? (longish)

2006-11-23 Thread Tobias Klausmann
Hi! First off, thanks for your quick reply. On Wed, 22 Nov 2006, Patrick Morris wrote: > > 1) Documentation for NSCA is - mildly put - lacking. As far > > as I can tell, send-NSCA expects data tab-separated on stdin. > > It would've been nice to actually see an example for getting > > host and s

[Nagios-users] How do distributed setups work? (longish)

2006-11-22 Thread Tobias Klausmann
Hi all, I'm having a conceptual/logical/mindset problem which I hope you can help me with. It's a bit long, but the question/problem I have is complex, so please bear with me. What I dream of: I have a central machine which is the interface to the users. Using the/a web interface, the users can

Re: [Nagios-users] check_cciss plugin for monitoring RAID arrays on HP servers

2006-11-17 Thread Tobias Klausmann
Hi! On Fri, 17 Nov 2006, Yogesh Hasabnis wrote: > Yes, I had used arrayprobe form the command-line. But being a layman, I was > not sure how to define a checkcommand using arrayprobe. Anyway, I will also > give it a try. We use this: nrpe.conf: command[check_array]=/usr/bin/sudo /usr/bin/arrayp

Re: [Nagios-users] check_cciss plugin for monitoring RAID arrays on HP servers

2006-11-17 Thread Tobias Klausmann
Hi! On Fri, 17 Nov 2006, Sim wrote: > > It works for both IDA and CCISS devices and already returns the > > retvals the way Nagios wants them. > > Hi! > > Can you post output an example of this ? > > Have you tryed all case? ( Rebuilding, Inter.Recovery, Fail, etc.. ) Ok state looks like this

Re: [Nagios-users] check_cciss plugin for monitoring RAID arrays on HP servers

2006-11-17 Thread Tobias Klausmann
Hi! On Fri, 17 Nov 2006, Thomas Hager wrote: > > ./check_cciss-1.5: line 90: ./utils.sh: No such file or directory > the plugin calls utils.sh (which comes with the nagios-plugins package) > and needs it in the same directory you installed check_cciss. so, you > got three joices: > > a) copy che

Re: [Nagios-users] Advanced permissions/user properties

2006-11-06 Thread Tobias Klausmann
Hi! First off: thanks for all your work, it didn't quite expect so much (and such constructive/worthwile) feedback. On Sun, 05 Nov 2006, Alex Burger wrote: > How about: > > r: View in web interface > > x: Submit commands for this host/service > > w: Not really needed yet. Maybe some of the o

Re: [Nagios-users] Advanced permissions/user properties

2006-11-03 Thread Tobias Klausmann
Hi! On Thu, 02 Nov 2006, Alex Burger wrote: > I have expanded on the Altinity patch by adding a 'can_submit_commands' > and 'can_submit_commands_strict' option to contact groups. The > limitation of having a can_submit_commands option on the user is that > it's an all or nothing option. A us

Re: [Nagios-users] Advanced permissions/user properties

2006-10-31 Thread Tobias Klausmann
Hi! On Tue, 31 Oct 2006, Az wrote: > > The altinity people have created a patch for the "view some, > > change none" scenario[0]. Unfortunately, what I'd need is a > > mechanism for the "view some, change a few" scenario I outlined > > above. > Is that to say that "view _all_, change some" wouldn

[Nagios-users] Advanced permissions/user properties

2006-10-31 Thread Tobias Klausmann
Hi! I've got a problem that I don't how to solve best in Nagios. I think other people have run into the same problem (I know that someone has run into a /similar/ problem). I'm running 2.5 on a mid-sized installations (~300 hosts, ~2500 services). Thing is, our projects/(host|service)groups vary

[Nagios-users] Debugging plugins

2006-10-25 Thread Tobias Klausmann
Hi! While debugging it would sometimes be nice to be able to run a plugin/check from the commandline in exactly the same fashion as Nagios does. What I'm after is a way of running (for example): simulate -c /etc/nagios/nagios.conf -p 'check_http!80!http://somewhere.com' The macros (like %HOSTA

Re: [Nagios-users] timeouts and performance info

2006-08-30 Thread Tobias Klausmann
Hi! On Wed, 30 Aug 2006, Marc Powell wrote: > > Active Service Checks: > > <= 1 minute:81 (4.6%) > > <= 5 minutes: 1719 (97.4%) > > <= 15 minutes: 1727 (97.9%) > > <= 1 hour: 1727 (97.9%) > > Since program start:1727 (97.9%) > > This seems mostly normal for a 5 minute

[Nagios-users] timeouts and performance info

2006-08-30 Thread Tobias Klausmann
Hi! I have the following values in my nagios.cfg: service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 As far as I know, those values are in seconds. What I wonder is why I still have Service and Host Checks that take l

Re: [Nagios-users] Using SNMP as an alternative to NRPE

2006-07-13 Thread Tobias Klausmann
Hi! On Thu, 13 Jul 2006, Thomas Sluyter wrote: > Why is it that we insist on using NRPE for this? Of course it's very > practical that there's such a thing as the NRPE daemon and the > check_nrpe command. It does indeed make things easier for a lot of > people who lack deep technical insigh

[Nagios-users] Threads and counting them

2006-05-31 Thread Tobias Klausmann
Hi! I'm monitoring several processes that do not for but use threads. Unfortunately, check_procs can be coerced into count threads individually. While I could hack a shell script together using ps a -L, I'd rather use a stock plugin for this. Is there any way to make check_procs count the thread

Re: [Nagios-users] Alerts - Verbal notification - via phone

2006-05-11 Thread Tobias Klausmann
Hi! On Thu, 11 May 2006, Stringham, Steven wrote: > Has anybody out there configured Nagios to alert via telephone? > I don't mean like a SMS message or the like. I figure I am more > likely to answer my home phone ringing at 0 dark hundred than > my cell phone's little email beeps. Also, that wo