Re: [Nagios-users] Check result used as an argument for the next check
2009/7/8 Marc-André Doll : > Is there a way to use the result of a check (the return value or the > output for example) as an argument for the next check. I need to keep an > history of the checks of one particular service in order to compare the > previous check result with the current one. > > I thought to use the $SERVICESTATEID$ and $SERVICEOUTPUT$ macros but the > documentation says that would make no sense because they are not > initialized when a check is running. > > Marc-André Doll I use $SERVICEPERFDATA$ all the time in my service check commands and it works fine, I imagine you can use $SERVICEOUTPUT$ too. It isn't difficult to test, just write a little check command to spit the unixtime stamp and whatever you pass to it back out, and feed it the $SERVICEOUTPUT$ as an arguement. .r' -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] differing notifications
2009/5/20 Neil Dombrowski : > I haven't been able to find an answer to this conundrum looking at the > docs, faq, or googling. I'd love to hear any suggestions. > > > We have two contact groups; group "adminmail" sends an email to our > mailboxes and "adminphone" sends email to our phones. The idea is when a > "warning" notification happens, adminmail gets notified, and when a > "critical" notification happens adminphone gets notified. > > As a test I set up two services that check disk. For the first one I set: > notification_options w,r > contact_group adminmail > > For the second service I set: > notification_options c > contact_group adminphone > > This works, but it's kludgy. Is there a way to define one contact_group > for a warning, and a different contact group for critical? We are > currently running v1.2(yes, old), but will be upgrading eventually. I'd > like to know if there's a solution I can implement now; if we need to > upgrade to get this functionality I'd like to know as well. > > Thanks, > Neil > > http://nagios.sourceforge.net/docs/1_0/xodtemplate.html#contact Just setup two contacts, one for adminmail that gets warnings, the other for adminphone that gets crits. Add them to the same contract_group, then add that contact group to your service check. This style/syntax hasn't changed really, save for in nagios 3.x you don't need the contact group if you don't want it. .r' -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] The icinga on the cake ...
2009/5/8 Sean Carolan : > This is, IMHO, one of the greatest problems with open source > development. Too much bickering, splintering and forking dilutes the > value of the product. Yeah, because out of all of the commercial closed source development projects I've worked on, none of them have ever had bickering, splintering, forking or anything else horrible go on with them. .r' -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] What are the reasons to use NRPE?
2009/4/27 Sean Carolan : >> I just wonder what the reasons are to use NRPE in favour of checks >> over ssh with ssh keys. To me, NRPE seems just one more piece of >> software that may potentioally be broken and more hassle with >> firewalls. Why do people still use it when ssh checks are just as >> simple and (in my opinion) far more secure? > > I may be in the minority here but I also prefer using ssh with keys > for our remote nagios checks. We have a medium sized network, around > 220 hosts and about 1550 active checks. The reports of performance > degradation are greatly exaggerated, IMHO. Our average check time is > less than one second which is more than fast enough for our needs. By > using ssh we avoided having to install NRPE on several different > versions of Red Hat, and as you mentioned ssh is much more secure > especially if you do it right (eg, password protected key, perhaps > with keychain for loading into ssh-agent after you boot the server > up). > I agree with using SSH. If you search back through recent lists there was some talk of using ssh connection master which will make SSH greatly faster than NRPE. As for security...not so much. Both (can) use the same encryption algorithms. What really differs is SSH uses asymmetric encryption to setup the shared keys whereas with NRPE you give it the shared key (hence NRPE being faster, for the most part). A shared key VS a certificate I'm not feeling a real win with either side. In one case I need to grab a config file, in the other I need to grab a ssh-key file... The default to setup NRPE with explicit commands vs just allowing a user to run arbitrary over SSH, NRPE comes out ahead here. But its not really any harder to setup SSH so each key corresponds to a particular check. And you can setup NRPE to allow you to execute any command, which defeats the initial security you gain by limiting which commands it can run. As for having passwords on your keys and loading up a ssh-agent...pffft. If I can read your private key, I can read your ssh-agent socket and the unencrypted SSH key stored in RAM from said ssh-agent process. The latter might actually be easier if we want to go down the road of silly security (think firewire and DMA) since the agent process is always going to be in RAM... .r' -- Register Now & Save for Velocity, the Web Performance & Operations Conference from O'Reilly Media. Velocity features a full day of expert-led, hands-on workshops and two days of sessions from industry leaders in dedicated Performance & Operations tracks. Use code vel09scf and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to upgrade nagios Version 2.12 ?
2009/4/1 J. Bakshi : > Dear list, > > Hope you all are well. > > Nagios 3.x has been released :-) I'm running nagios Version 2.12 and > like to upgrade it to 3.x > Is there any script which can convert the nagios 2.12 configuration to > 3.X ? > > I did the configuration manually and it was a huge work, that's why I'm > looking for a converter program this time. > > Thanks Really not much has changed in terms of your objective configration files. There are some new features you might want to take advantage of, but you're under no real obligation to use them. What really changed are a couple of new lines in nagios.cfg. Have you tried running your existing configs with a nagios 3.0.x binary yet? If you do that it'll spit out whats missing, which will be a couple of paths that need to be added (tmp, check result off the top of my head). .r' -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Additional Info in email?
2009/4/1 Andrew Davis : > Perhaps I need to ask a clarification question... in my checkcommands.cfg, > should I change $SERVICEOUTPUT$ to $LONGSERVICEOUTPUT$ or should I add the > $LONGSERVICEOUTPUT$ macro to the notify-service-by-email command? I'm > thinking that some tests will write some data to $SERVICEOUTPUT$, but others > can include additional info with $LONGSERVICEOUTPUT$, so rather than change > $SERVICEOUTPUT$ to $LONGSERVICEOUTPUT$, I should add it? Perhaps something > like this: > > Before: > ~\n\nAdditional Info:\n\n$SERVICEOUTPUT$ > > After: > ~\n\nInfo:\n\n$SERVICEOUTPUT$\n\nAdditional Info:\n\n$LONGSERVICEOUTPUT$ > > A. Davis > Email: ncc...@gmail.com > So, one of the great things about Nagios, IMHO, is it's documentation. Check out: http://nagios.sourceforge.net/docs/3_0/macrolist.html#serviceoutput In particular: $SERVICEOUTPUT$ The first line of text output from the last service check (i.e. "Ping OK"). $LONGSERVICEOUTPUT$ The full text output (aside from the first line) from the last service check. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE vs. check_by_ssh
2009/3/24 Christopher McAtackney : > Hi all, > > I was wondering if someone could give a brief overview of the pros / > cons of using NRPE to monitor my remote hosts versus using the > check_by_ssh command? > > I'm aware that check_by_ssh increases the CPU overhead, but I'm not > clear on the level of impact here - does this increase the load on the > monitoring machine in direction relation to the number of hosts being > monitored? For example, if I was using check_by_ssh to monitor, say, > 2000 services spread across 200 hosts, would I experience significant > slowdown on my monitoring machine? > > Cheers for any info, > > Chris > SSH is going to slow it down on both sides of the communication. SSH does quite a bit more in terms of setting up the connection which involves using asymmetric encryption to setup a shared secret for symmetric encryption and verifying keys for the asymmetric part, verifying access, allocating a session. Whereas NRPE even with encryption just does a simple pre-shared secret for the symmetric encryption, much faster even if using the same encryption algorithm One thing you could do with SSH to speed it up (and I would argue make it faster than NRPE depending on the stability of your network)) would be to use ControlMaster. ControlMaster is a SSH v2 feature, where you create a connection and can open up multiple sessions with that ControlMaster for other SSH processes. This saves you not only the key-exchange heavy lifting but also you're not opening up a new socket on the remote host. In order to really make it worth it you'd have to spawn a process that was continuously connected. I wrote an ugly check_by_ssh that would spawn a ControlMaster if one didn't exist and use it if it did. Reduced the load/latency quite a bit for SSH checks. Though if I had to do it again I'd used 'ControlMaster auto' (man 5 ssh_config) and create a separate check that was responsible for maintaining the ControlMaster, then you could use the stock check_by_ssh without any modifications. That all being said, you might want to think about a distributed setup anyhow, if nothing more for redundancy. 200 servers and 2,000 checks is alot of responsibility for a singleton, you could break it 50/50 between two servers that could take over for the other one if it fails. .r' -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts are Critical
2009/3/23 : > Marc, > > it also is giving me the following error: > > Return code of 127 is out of bounds - plugin may be missing > > Sorry for not mentioning that before. Thanks, > > Andy > yeah, that's probably the problem - nagios can't find / can't run the plugin. Make sure that the plugins are installed, nagios can run them, and nagios knows where they are. presumably you already installed the plugins through the RPMForge repository, so they should be executable by the nagios user. As for telling nagios where they are, you should have a commands file that defines commands. It'll use one the $USER$, most likely $USER1$. Edit resources.cfg and point $USER1$ (or whatever) to the correct location of your nagios plugins, which will be where you installed them. Probably in /usr/lib/nagios/libexec on centos if memory servers me right. .r' -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Empty Graphs
2009/3/13 Eric Chatham > Hello, > > > > Can someone help me with this? None of my graphs have any information > being displayed in them. I’ve tried re-reading the INSTALL instructions > thinking I may’ve missed a step. It all seems to check out. What > information do you need from me. I tried setting the debug level to (4) and > insert_debug to 4 but nothing shows up in the log with any INSERT > information. Please help. Thanks. Just an FYI, I’m a really new to PERL > and Linux altogether. > > > > > > *Eric Chatham* > So exactly what are you using for graphing? Nagios by default doesn't do any sort of graphing by itself. .r' -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Layer 2 connectivity awareness?
2009/3/11 Victor Lee > In the course of my learning (just starting out) how to work with Nagios I > have found that it isn't aware of layer 2 connections. In other words, if I > have 2 switches with a trunk between them Nagios doesn't seem to provide me > a way to either note that manually on the map nor does it seem to be aware > of that type of connectivity. The same applies to servers, there doesn't > seem to be a way to identify which switch a server is connected to. Now I > admit this might be due to my limited knowledge of the product so I ask the > members here. Am I correct in this assessment? > > This isn't really what nagios does.. You might want to looks at something like opsview if you want a picture of your network like that. Also, fwiw, I think what you're really asking for is a layer 1 map if you want to know what host is plugged into what port - layer2 would be like what host is on a particular broadcast domain. I think most people give up on the nagios map pretty quickly, or at the most setup a few partent/child hosts and leave it at that. .r' -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Different notifications depending on warning?
2009/2/16 Israel Brewster > I don't know if this is possible, but is there any way to make nagios > send different alerts (or, rather not send alerts) depending on what > exactly a warning state is? My understanding is that nagios only looks > at the return code, not the status text, so this shouldn't be > possible, but I figured it couldn't hurt to ask. > In the past I have written my own notification program to do similar things, ie instead of having nagios send email directly, pass it off to some other program that looks at whatever conditions you want and makes some decisions based on that. In this case you could just write a "printer notification" program that doesn't send email when the service output says low on paper. .r' -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Acknowledgement Escalations
2009/1/21 Mathieu Gagné > Hi, > > Here is the situation: > Somebody acknowledges a problem and forget about it. > > How would you implement an acknowledgement escalation? > > Or how would you detect such situation where a host/service is > down/critical for too long while being acknowledged? > > -- > Mathieu > > Mmmm, there are a couple of technology things you could do for this, but the root of this problem is people, not computers. You need to work our a process where people aren't ack'ing things just so they can fall back asleep. I personally suggest having nagios create a ticket with whatever ticketing system you use (you use one right?!) so you can track that issue. That and having a 24x7 NOC helps :) Otherwise, write something that takes a look at the status file and find services that are in a non-okay state but acknowledged and have been for however long. I wrote a simple nagios CFG parser that would be able to handle it that's under the GPL at my former company for their Oracle Monitoring: https://code.bluegecko.net/wiki/Monocle. There is another one that will probably work somehwere on cpan. I would probably write that program to un-acknowledge things as well as alarming. If it just alarmed, someone might acknowledge it and do nothing about it. (since that's the problem you're having) You can do the un-acknowldeging though the nagios cmd file: http://www.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=116 http://www.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=117 You can get the location of the command file from the macro $COMMANDFILE$. Cheers, .r' -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Looking for a Nagios answering service
2009/1/7 Baron Schwartz > * filter out obvious spam > mmm, I think you might be designing for failure with that one. .r' -- Check out the new SourceForge.net Marketplace. It is the best place to buy or sell services for just about anything Open Source. http://p.sf.net/sfu/Xq1LFB___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null