Re: [Nagios-users] Odd Problem with check_yum on one server...
On 2013-09-13 19:46, Sean Alderman wrote: I apologize... I should have pasted that, but I thought the fact that I could execute the check_yum script as the nrpe user on the host with the problem would have implied that it functions as expected. Yum is located where we would expect it to be on a standard CentOS machine. Then it seems like you're running nrpe in a chroot jail or a limited container, where /usr/bin/yum doesn't exist. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LIMITED TIME SALE - Full Year of Microsoft Training For Just $49.99! 1,500+ hours of tutorials including VisualStudio 2012, Windows 8, SharePoint 2013, SQL 2012, MVC 4, more. BEST VALUE: New Multi-Library Power Pack includes Mobile, Cloud, Java, and UX Design. Lowest price ever! Ends 9/20/13. http://pubads.g.doubleclick.net/gampad/clk?id=58041151iu=/4140/ostg.clktrk ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_udp error message
On 2013-09-03 12:33, Marco Borsani wrote: Hi all I need to control many UDP ports. I run the command: ./check_udp –H IP_ADDRESS –p 88 I receive following error message (with state UNKNOWN) : With UDP checks, a send/expect string must be specified Can anyone help me to solve it? Those parameters should be optional.. UDP is a connection-less protocol. The expect string could indeed be optional, and we could just expect to get something at all back when we send something, but without sending anything we won't even touch the network, so the remote host has no idea that we're trying to talk to it and we won't know if the port is up. That's why send, at least, is not optional. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Learn the latest--Visual Studio 2012, SharePoint 2013, SQL 2012, more! Discover the easy way to master current and previous Microsoft technologies and advance your career. Get an incredible 1,500+ hours of step-by-step tutorial videos with LearnDevNow. Subscribe today and save! http://pubads.g.doubleclick.net/gampad/clk?id=58040911iu=/4140/ostg.clktrk ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 4 beta plan/status/roadmap?
On 2013-08-14 17:54, Frost, Mark {BIS} wrote: Unless I missed a message somewhere, Nagios 4 is still in beta. Is there an expected time when it will become a regular release? It's been out for a bit now. Does it seem as if there will be a beta2 or is this effectively the release candidate? There is a beta2 already, and a beta3 is likely to be tagged pretty soon. I have seen some new patches flowing in (mostly feature stuff, I think) and I wasn't sure if those were eventually to be included with Nagios 4 or not - possibly in a later release. No major features have come in since the conference last year. Only bugfixes, performance improvements and some code cleanups. I've had good luck with the pre-beta versions of Nagios 4 and am contemplating timing for a move to Nagios 4 (and Merlin). Happy to hear it :) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with 2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031iu=/4140/ostg.clktrk ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios check latency
On 2013-08-13 05:49, Muhamad Faiz wrote: Hi guys, My Nagios setup are having high service check latency. Check nagios latency result is 240 seconds! ./check_nagios_latency.sh -w 30 -c 60 CRITICAL: Latency at 240.183 seconds! Change to Nagios 4 and your latency will most likely go away. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with 2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031iu=/4140/ostg.clktrk ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] [Nagios-devel] Patches for inclusion to Nagios 4
On 2013-08-08 15:17, Ton Voon wrote: On 7 Aug 2013, at 08:51, Andreas Ericsson wrote: On 2013-08-06 19:16, Ton Voon wrote: Hi! We've published a list of patches for Nagios 4: http://www.opsview.com/whats-new/blog/opsview-patches-nagios-4 We'd be happy if you could review if these are acceptable for future inclusion or if anyone else finds them useful. I'd like to get patches with commit messages and proper author and signed-off-by info. Since we're using git for Nagios now, it'd go a long way in making sure everyone gets credit for the work they've done. The patches also need to apply cleanly to the latest master. You may want to clone the official Nagios repo, apply your patches on top of it and then send me a pull request for github or some such. git clone git://git.code.sf.net/p/nagios/nagios nagios-core should get you the very latest. If you apply your patches on top of 'master' and make sure to always do git pull --rebase when you want to get the latest and greatest you'll quickly see which patches either have been applied or which no longer *can* be applied. Then you can create a separate repository on github or some such and push the changes there. OK, we'll convert them into git changes based off master. However I'd like an assurance that the changes will be merged before we promise to convert. Style changes are fine, won't take much time and will not require retesting, but if we need to refactor object changes or make larger changes to logic, this will require retesting on our side. I'd like a reassurance that the time invested will result in a merge upstream, otherwise we're just wasting time for all of us. For instance, your assistance in the environment macros per-command was greatly appreciated and we've coded to the design agreed in the email conversations, but it hasn't been merged yet. It has, but it hasn't. The original patch no longer applies, due to extensive remodeling of the worker code. I still have it hanging around though, and I'm fairly inclined towards using it for services and hosts as well, so that environment macros can be set on a service level as well as on a command level. I've also been investigating the chances of doing this without resorting to setenv() (in essence, building a one-off block of the load-time environment macros, and then extending that whenever we hit an environment variable). But yea, I've got that one already. So I'd like to reverse the question and ask which changes are most likely to be accepted, based on the amount of changes required and we'll work through that list in order. Thread-safe calls It's harmless, and I'd apply it straight away if I had a sensible commit message for it, but I don't know why you need this so I can't really write one. While we're on the topic; Please write commit messages in imperative form and present tense, as if you're giving orders to the code for how it should change. Also give a short rationale for why it should change that way. The rules about not commenting out code you want to get rid of still applies though. It just looks terribly hackish when patches meant for upstream contains things like that. Slice services within hosts So long as it's configurable from cgi.cfg and the default stays the same as it is today I'll apply it immediately. It has no impact on loadable modules or other headaches, so that's a nobrainer, really. Check command by time period I feel this is somewhat lacking in efficiency and flexibility, and a much cleaner solution would be to add a filtering functionality to NERD so that checkresults can be shipped off to a third-party addon which can transform checkresults and plugin output as much as it likes. Failing that (which would be enormously cool but also a lot of work), I think it would most likely be best off as a separate module, with custom variables or separate configuration to support it. Supporting patches to run events when a timeperiod becomes active or inactive would still be welcome, obviously. Escalation via notification levels This is best off written as a module, using custom variables to configure it. If core support is required to block notifications to a particular contact, then such patches will naturally be accepted. The normal NEBCALLBACK_CANCEL return code signalling should work just fine for that. Synchronising state data Pretty invasive for quite a small benefit, and with enormous complexities to deal with to make this work properly. How do you handle reading the synced file while checks are in-flight and awaiting being returned to the mothership? They may have been spawned as a result of a bad checkresult on one node, but if the sync status returns them to OK and the in-flight result sets them as bad again, would that mean starting over on the retry-attempts? I see nothing in the patch that handles situations like that, so I'm forced to believe you haven't considered it. Considering the fact that full
Re: [Nagios-users] [Nagios-devel] Patches for inclusion to Nagios 4
On 2013-08-06 19:16, Ton Voon wrote: Hi! We've published a list of patches for Nagios 4: http://www.opsview.com/whats-new/blog/opsview-patches-nagios-4 We'd be happy if you could review if these are acceptable for future inclusion or if anyone else finds them useful. I'd like to get patches with commit messages and proper author and signed-off-by info. Since we're using git for Nagios now, it'd go a long way in making sure everyone gets credit for the work they've done. The patches also need to apply cleanly to the latest master. You may want to clone the official Nagios repo, apply your patches on top of it and then send me a pull request for github or some such. git clone git://git.code.sf.net/p/nagios/nagios nagios-core should get you the very latest. If you apply your patches on top of 'master' and make sure to always do git pull --rebase when you want to get the latest and greatest you'll quickly see which patches either have been applied or which no longer *can* be applied. Then you can create a separate repository on github or some such and push the changes there. On a first inspection though; * Don't comment out code. Bringing back dead code is what the VCS is for, and keeping it around is just plain dumb. If it shouldn't be in there, just remove it. In the same vein; Don't add commented-out code. * Avoid C++ comments. I know they're supported in C99, which I'm rooting for as the least version supported, but it's against the current style. * Don't mix spaces and tabs for indentation, unless it's continuation- indentation of a multi-line statement or condition. Stick to the style in the file you're editing, and *look* at the patch before you send it somewhere. * Avoid comments saying things like Opsview specific foobar if you want to have the patches included. If you *don't* want those patches included, don't send them to me or point me to where they are. It takes up a lot of time to remove crap like that, and I have no interest in cleaning up after anyone else. I'm messy enough as it is on my own. * Don't augment objects (such as hosts, services, commands) with new variables. Doing so means Nagios 4.1, and I can't take patches like that until Nagios 4 is at least released as stable. All objects have an 'id' field which means you can look up any extra data you want in O(1) time, provided you just initialize an array of size num_objects.$desired_object_type_in_plural before we enter the event loop. * Make patches the most scalable you can. For the check_time_period thing, you'd be far better off adding code to detect timeperiod changes, notice which timeperiods are used to change commands and make a one-off swap for all the affected commands as the desired timeperiod comes either in or out of effect. * Don't build on deprecated technology, such as external files for commands and/or check results. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with 2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031iu=/4140/ostg.clktrk ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Upgrade
On 2013-08-03 02:27, Kaushal Shriyan wrote: Hi, I am running Nagios Version 3.4.4 on CentOS Linux Version 6.4. As per http://www.nagios.org/download/core/thanks?t=1375489385 do i need to update it to 3.5.0 or upgrade to 4.x series and please help me understand the difference between 3.x and the new beta 4.x version. Any dates planned for the stable release 4.x version? Nagios 4 has improved performance by a factor of 7, as well as much better interfaces for extending it further in the future. Nagios 3.5 is mainly 3.4 with bugfixes, although one such bugfix caused some internal API's to be changed incompatibly, so we had to bump the minor number for it. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Get your SQL database under version control now! Version control is standard for application code, but databases havent caught up. So what steps can you take to put your SQL databases under version control? Why should you start doing it? Read more to find out. http://pubads.g.doubleclick.net/gampad/clk?id=49501711iu=/4140/ostg.clktrk ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Upgrade
On 2013-08-05 14:30, Kaushal Shriyan wrote: On Mon, Aug 5, 2013 at 2:20 PM, Andreas Ericsson a...@op5.se wrote: On 2013-08-03 02:27, Kaushal Shriyan wrote: Hi, I am running Nagios Version 3.4.4 on CentOS Linux Version 6.4. As per http://www.nagios.org/**download/core/thanks?t=**1375489385http://www.nagios.org/download/core/thanks?t=1375489385do i need to update it to 3.5.0 or upgrade to 4.x series and please help me understand the difference between 3.x and the new beta 4.x version. Any dates planned for the stable release 4.x version? Nagios 4 has improved performance by a factor of 7, as well as much better interfaces for extending it further in the future. Hi Andreas Ericsson, I am not sure i understand Nagios 4 has improved performance *by a factor of 7* with Nagios 3 version. Does it mean performance improvement by 70%?. Correct me if i am understanding it completely wrong. 700%, in terms of the number of jobs it can spawn, according to a volunteer tester who reported in a few days ago. It's actually a bit inaccurate to talk about percentages, since most algorithms used in Nagios 4 are several orders of magnitude faster than the ones in Nagios 3, and that means they'll scale a lot better, but for very small installations you won't notice much difference except a slightly lower load. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Get your SQL database under version control now! Version control is standard for application code, but databases havent caught up. So what steps can you take to put your SQL databases under version control? Why should you start doing it? Read more to find out. http://pubads.g.doubleclick.net/gampad/clk?id=49501711iu=/4140/ostg.clktrk ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] how to start nagios with non nagios user
On 2013-07-31 14:49, , Roy wrote: Hi guys, I want to start nagios with non root, nagios user. How do I do that. Well, ask yourself if there are any mechanisms or programs that will let you start a program as a different user than root on your system, and then put that noggin' of yours to good use and see what you come up with. If you don't know of any such mechanisms or programs, you really need general unix administration help rather than help with this specific problem. Good luck. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Get your SQL database under version control now! Version control is standard for application code, but databases havent caught up. So what steps can you take to put your SQL databases under version control? Why should you start doing it? Read more to find out. http://pubads.g.doubleclick.net/gampad/clk?id=49501711iu=/4140/ostg.clktrk ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios navbar search enhancement (Was: Questions about Nagios quick search)
On 2013-06-04 15:43, Jonas Meurer wrote: Hey, I just prepared a patch against git master (commit 758a64). I hope that it helps. Don't hesitate to ask if you've any questions. Also feel free to rename the config options if you don't like the names. The patch is attached. Applied. Thanks. I made the new options default to ON in the code and renamed them navbar_search_addresses and navbar_search_aliases instead od the navbar_search_for... since we're actually searching for hosts by searching the hosts' configured properties. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios backdoor
On 06/06/2013 10:46 PM, William Leibzon wrote: Sounds like they got through some sort of security hole in apache and accessed database on the server, probably as apache/www user and not root. Unsure from the information given if this apache backdoor would have had anything to do with nagios cgi or not. BTW the description of how it happened is rather interesting. I remember 6 or 7 years ago when I was still following security more closely people have been talking about possibility of this (hacking with only in-memory application replacement) on certain forum that shall remain unnamed. I have never seen or heard of this being done at any company I consult for though. It's not particularly difficult. All exploits work by modifying executable code in memory to make a program do what they want. If one can get root access that way, it's possible to freeze a process and replace it entirely. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Questions about Nagios quick search
Would you care to forward the patch so it applies to Nagios 4 as well? Otherwise it's likely to get dropped on the floor I'm afraid. On 05/29/2013 06:17 PM, Jonas Meurer wrote: Hello, Am 2013-03-01 09:05, schrieb Andreas Ericsson: On 02/28/2013 10:49 PM, Jonas Meurer wrote: Am 20.02.2013 16:13, schrieb Jonas Meurer: Hello, Hey again, we're using Nagios as monitoring system for several hundred systems. While navigating through hosts and services, recently two questions regarding the quick search (in navigation bar) raised: 1/ Why doesn't nagios search for host aliases as well? Is it possible to enable alias searching? We're using rather short values for host_name, and tend to add information like server position to the alias. Thus searching for host_name and alias would be awesome for us. Today it's not possible to enable alias searching. Patches welcome. If you create one, please use some format that makes it possible to add searching on other fields as well, such as alias~regex or some such. 2/ When searching for IP addresses, only the first match is returned. In some cases (e.g. NRPE Port forwarding through firewall), several hosts have the same IP address. For these cases it's rather irritating, that only the first matching host is returned. Tru dat. Patches welcome. You'll want to find and remove the correct break statement, I guess. Other than that it shouldn't be much trouble. I finally managed to prepare a patch that fixes both shortcomings. It adds two new configuration options to configure the behavior of the navigation bar search: search for hostname only, or also for addresses, or also for aliases. I reported the patch as feature request at http://tracker.nagios.org/view.php?id=459 Kind regards, jonas -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- How ServiceNow helps IT people transform IT departments: 1. A cloud service to automate IT design, transition and operations 2. Dashboards that offer high-level views of enterprise services 3. A single system of record for all IT processes http://p.sf.net/sfu/servicenow-d2d-j ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios v3.5.0 transitioning immediately to a HARD state upon host problem
notification_optionsd,r active_checks_enabled 1 passive_checks_enabled 1 notifications_enabled 1 check_freshness 0 check_period24x7 notification_period 24x7 check_command check-host-alive contact_groups linux-infrastructure-admins } This is causing some real problems: 1) If a single host polling cycle has a blip, it notifies IMMEDIATELY. 2) Because it notifies immediately, it ignores host dependencies. So, when a WAN link goes down for example, it fires off notifications for *all* hosts at that site as fast as it can, when it should be retrying, and then walking the dependency tree. I do have translate_passive_host_checks=1 on the centralized monitor, but the way I understand it, that shouldn't effect a state going from SOFT to HARD. Am I misinterpreting this? Another variable - I'm using NConf for the configuration management, and it does some templating tricks to help with the distributed monitoring setup. But, all it does is generate config files, and I don't see any evidence in the configs as to why this would be happening. Any help would be greatly appreciated! Set passive_host_checks_are_soft=1 in nagios.cfg on your master server and things should start working as intended. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] FW: Nagios3 switch.cfg
On 2013-05-15 23:25, Joel Roberts wrote: Typo in the email post, I do have host_name in the .cfg file, not host-name. Still returning error. You also have Define instead of define. Copy-typing is pretty much frowned upon for reasons like that. While at it, you may want to make sure you're not using windowsy line-endings. I know some versions of Nagios complains about that, but since you haven't specified the exact version of Nagios you're using it's impossible to say if that affects you. One thing strikes me as odd though; Nagios has never had an error message that goes Could not add property. Are you sure you're using Nagios and not some bastard version of it? Or perhaps you just manually copied it by re-typing the error message from memory. Please don't do that. It takes longer for you and makes those who can answer your questions doubt your competence. Now; If you attach your switch_nagios2.cfg file, I'm sure we can solve the issue quite quickly. If you insist on copy-typing and keep getting things wrong, you'll be on your own. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- AlienVault Unified Security Management (USM) platform delivers complete security visibility with the essential security capabilities. Easily and efficiently configure, manage, and operate all of your security controls from a single console and one unified framework. Download a free trial. http://p.sf.net/sfu/alienvault_d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] FW: Nagios3 switch.cfg
On 05/16/2013 04:30 PM, Joel Roberts wrote: Attached is the .cfg file. The Define was done by the email client, as you can see in the .cfg file, define host is entered correctly, as is host-name. The Nagios installation is on Debian Linux 6.0.7 Squeeze. It was installed from the packages included with the distro. The version of Nagios is 3.2.1 Well, that particular config file should work just fine. Try upgrading from sources and see if that works better. It passes with Nagios 4 at least. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- AlienVault Unified Security Management (USM) platform delivers complete security visibility with the essential security capabilities. Easily and efficiently configure, manage, and operate all of your security controls from a single console and one unified framework. Download a free trial. http://p.sf.net/sfu/alienvault_d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] svn - git conversion
As of today, the migration from subversion to git has been completed. The new git repository is still hosted at sourceforge. You can clone the repository using the following command: git clone git://git.code.sf.net/p/nagios/nagioscore and then browse the code locally in your own nagioscore folder. The svn repository will be decommisioned without further warning, so anyone who likes to stick to the bleeding edge of Nagios core should migrate to git ASAP. Cheers -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- AlienVault Unified Security Management (USM) platform delivers complete security visibility with the essential security capabilities. Easily and efficiently configure, manage, and operate all of your security controls from a single console and one unified framework. Download a free trial. http://p.sf.net/sfu/alienvault_d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] servicegroup overview not restricted for htaccess users
On 2013-05-06 10:42, Jonas Meurer wrote: Hello, I fear that I discovered a security issue in Nagios 3.4.4 status.cgi: All htaccess users, even if not listed in any authorized_for_* config option, have full access to service group overview, summary and grid: /nagios/cgi-bin/status.cgi?servicegroup=allstyle=overview /nagios/cgi-bin/status.cgi?servicegroup=allstyle=summary /nagios/cgi-bin/status.cgi?servicegroup=allstyle=grid I hope that this is not intended. Is this issue known? It's a bit short on info. Servicegroups should be visible if the user is a contact for any service in the group. If a user who has no auth options and is not a contact for any service can see all servicegroups, then yes, that's potentially a security issue. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Learn Graph Databases - Download FREE O'Reilly Book Graph Databases is the definitive new guide to graph databases and their applications. This 200-page book is written by three acclaimed leaders in the field. The early access version is available now. Download your free book today! http://p.sf.net/sfu/neotech_d2d_may ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] High Availabilty with Nagios
On 2013-05-09 11:19, Steve Shipway wrote: Does anyone have an HA setup for Nagios that works? I'm thinking of creating a NEB module that will link two Nagios setups, and replicate over all status changes, config changes, downtime, comments, etc etc and then set the 'standby' Nagios to be checks/notifications disabled when in standby mode, and enabled when in active mode. Then put the two behind a failover load balancer (F5, Foundry or apache reverse proxy). However this would be too much work if someone else has already found an equivalent solution. I've looked at Merlin but it doesn't seem to do what I'm after (and the documentation is practically nonexistant - much the same as the NEB API documentation, in fact). Mod_gearman lets me have redundant checks and replicate *active* checks, but not commands, downtime or passive checks. Merlin would do exactly that if you set one of the nodes as a poller but having all hosts assigned to it. When the poller goes down, the master will by default take over checks for it. Merlin is actually pretty well documented, but as textfiles that you have to read the oldschool way. If there's anything you find lacking from the HOWTO document or the README, please let me know and I'll amend it. Does anyone out there have a workable way to get an active/standby or active/active Nagios setup? Would be interested in hearing all ideas... Well, we have about 800 of them. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Learn Graph Databases - Download FREE O'Reilly Book Graph Databases is the definitive new guide to graph databases and their applications. This 200-page book is written by three acclaimed leaders in the field. The early access version is available now. Download your free book today! http://p.sf.net/sfu/neotech_d2d_may ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] High Availabilty with Nagios
On 2013-05-09 11:50, Supporto Tecnico - Crazy Network wrote: I would be interested too, i'm actually using merlind for this right now, but i would like to dont have for example double notifications if a server goes down.. and i do want both nagios set for notify, since if one is down (for any reason) the other one should be able to check and notify and vice-versa Double notifications is a bug, unless you send passive checkresults to both masters, in which case it's by design. Usually people want to solve passive checks by arranging a single target ip or hostname to send to and then add peered nodes at that tier as necessary, so as to not have to send checkresults to multiple nodes from all the monitored machines. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Learn Graph Databases - Download FREE O'Reilly Book Graph Databases is the definitive new guide to graph databases and their applications. This 200-page book is written by three acclaimed leaders in the field. The early access version is available now. Download your free book today! http://p.sf.net/sfu/neotech_d2d_may ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Not Running 0 in availability report?
On 2013-05-02 13:15, joe.pr...@vaisala.com wrote: Back in February our nagios host was shut down for almost 40 minutes. From my alert log: [02-06-2013 13:39:54] Nagios 3.4.4 starting... (PID=3687) [02-06-2013 13:01:17] Caught SIGTERM, shutting down... However looking at availability reports for this time period (e.g. 1st Feb - 28th Feb) the row Nagios Not Running is set to 0d 0h 0m 0s 0.000%. In fact the entire Undetermined block is set to 0%. Is this a bug? Am I doing something wrong? Are you using the guess states during program downtime thingie? If so, 0% undetermined makes perfect sense. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Get 100% visibility into Java/.NET code with AppDynamics Lite It's a free troubleshooting tool designed for production Get down to code-level detail for bottlenecks, with 2% overhead. Download for free and get started troubleshooting in minutes. http://p.sf.net/sfu/appdyn_d2d_ap2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Configuration changes not taking effect after restart of Nagios
On 04/24/2013 12:38 PM, Deborah Martin wrote: Hi, I'm using Nagios 3.4.4 on SLES 11 SP2. This has been running for around two months. Recently I needed to change the configuration for a service escalation to remove a contact group that would get sms alerts. I restarted Nagios and everything ran fine. However, it's been reported that recently, the team that shouldn't be getting the sms alerts are still getting them. When I look at the Configuration for service escalations, I can now see two entries for the same escalation, one which is the new , with the sms contact group removed and one with the old config which has the sms contact group included. But when I look at the actual .cfg file for the service escalation definition, there is not mention of the sms group anymore. I've checked, and re-checked and still I can't see why this has happened. There are three separate sensible explanations for that. You have, at some time in the past, created an objects.precache file and Nagios is restarted with being asked to use the precached file. Something regenerates the configuration behind your back, overwriting your changes. A separate Nagios process is running, triggering alerts for the faulty team while the restarted Nagios ticks on just fine. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_apr ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NagiosTV and merlind
On 04/16/2013 04:58 PM, Supporto Tecnico - Crazy Network wrote: Update. Since actually latest Ninja use Livestats to have information i did learn today that merlin is no more needed at all... So now the question would be, what i would need to edit in nagiosTV and (if anyone wanna spend some time) how for let it use Livestatus? Short answer; A lot of things. Long answer; If it's abstracted out into neat functions like get_hosts() or some such, then not so much. If sql is sprinkled all over the place, you'll most likely never get it done without forking it completely. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Passive checks not processing performance data
On 04/15/2013 03:29 AM, Grant Maxwell wrote: Could anyone please help me out here ? Try upgrading to the latest stable. That's pretty much always the first resort, since it's unlikely bugfixes will be backported to 3.2.3 anyways. 3.3.1 (I think) was running happily for quite a long time and is probably still the most used Nagios version, so perhaps you can get it working using that. Failing that, try enabling debug logging and see where it goes wrong. That usually tilts developer's head in the right direction wrt remembering what one's fixed or which areas of code one has poked around in. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Revamp of lib/worker.c
I know it's fairly late in the dev cycle, but the current status of lib/worker.c simply isn't maintainable enough. Among the several problems with it are: * spawn_worker() makes assumptions on how the worker manager wants to store the jobs it has sent to the worker. * More types than necessary are made public, which makes patching and minor feature addition close to impossible. * There is no common prefix used for functions and types, making it hard to remember which functions are public and which are private. * It's hard to test since it ties in pretty hard with the Nagios code. I'll be doing quite an extensive revamp over the next few days in order to address these issues. This means base/workers.c will also sustain heavy surgery. If you're writing or have written modules or special-purpose workers that rely on any of the code about to be chainsaw-operated on, please let me know asap, and I'll do what I can to make sure your stuff keeps working. Be quick though, cause this needs to be done sharpish. Thanks, and sorry for crossposting. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_mar ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Revamp of lib/worker.c
I know it's fairly late in the dev cycle, but the current status of lib/worker.c simply isn't maintainable enough. Among the several problems with it are: * spawn_worker() makes assumptions on how the worker manager wants to store the jobs it has sent to the worker. * More types than necessary are made public, which makes patching and minor feature addition close to impossible. * There is no common prefix used for functions and types, making it hard to remember which functions are public and which are private. * It's hard to test since it ties in pretty hard with the Nagios code. I'll be doing quite an extensive revamp over the next few days in order to address these issues. This means base/workers.c will also sustain heavy surgery. If you're writing or have written modules or special-purpose workers that rely on any of the code about to be chainsaw-operated on, please let me know asap, and I'll do what I can to make sure your stuff keeps working. Be quick though, cause this needs to be done sharpish. Thanks, and sorry for crossposting. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_mar ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and SELinx
On 03/13/2013 12:01 AM, Stephen H. Dawson wrote: Can Nagios run under SELinux? Yes it can, but the requirements to do so are close to permissive, since there's a plethora of programs (plugins) that run under the Nagios umbrella. In order for it to be possible, Nagios needs permissions to: * create any number of outgoing network sockets * create incoming network sockets (as some plugins work by setting up a listener and then sending a request) * create raw sockets (for ping) * execute suid root programs (for ping) * create, modify and write files, pipes and sockets on the local fs * connect to local sockets (for local database checks) * fork() and run without a tty * probably a bunch of other things It's quite a daunting task to get everything right with regards to selinux, which is why I guess noone's done it yet. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_mar ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Questions about Nagios quick search
On 02/28/2013 10:49 PM, Jonas Meurer wrote: Am 20.02.2013 16:13, schrieb Jonas Meurer: Hello, Hey again, we're using Nagios as monitoring system for several hundred systems. While navigating through hosts and services, recently two questions regarding the quick search (in navigation bar) raised: 1/ Why doesn't nagios search for host aliases as well? Is it possible to enable alias searching? We're using rather short values for host_name, and tend to add information like server position to the alias. Thus searching for host_name and alias would be awesome for us. Today it's not possible to enable alias searching. Patches welcome. If you create one, please use some format that makes it possible to add searching on other fields as well, such as alias~regex or some such. 2/ When searching for IP addresses, only the first match is returned. In some cases (e.g. NRPE Port forwarding through firewall), several hosts have the same IP address. For these cases it's rather irritating, that only the first matching host is returned. Tru dat. Patches welcome. You'll want to find and remove the correct break statement, I guess. Other than that it shouldn't be much trouble. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_feb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Define Multiple Event Handlers
On 02/21/2013 09:16 PM, Travis Runyard wrote: I'm trying to define multiple event handlers for a service but am getting the error during preflight check: Error: Event handler command 'update-nagex,reset-proc' specified in service I've tried two different ways to define it in the service: event_handler update-nagex,reset-proc and event_handler update-nagex event_handler reset-proc When I define them on separate lines, Nagios seems to only pick up the last event_handler defined which is reset-proc Any ideas? Create a wrapper script that runs both handlers with their proper arguments. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_feb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Mass delete of host/service downtimes.
On 02/21/2013 12:16 PM, Páll Guðjón Sigurðsson wrote: The posts inspired me to write downtime support in pynag command-line utility. Works both via livestatus or more traditional writing directly to nagios command pipe. You should avoid writing external commands to livestatus. They get passed to Nagios from a separate thread and are susceptible to races. This is true both in Nagios 3 and Nagios 4. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_feb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] distributable downtime delete commands?
On 02/12/2013 05:24 AM, Páll Guðjón Sigurðsson wrote: Thanks for the sed Andreas, If anyone on the list is interested i compared include.common.h from nagios-3.4.4 with the documentation at http://www.nagios.org/developerinfo/externalcommands/ and discovered the following discrepancies: External commands that are documented but not found in the source code: DISABLE_SERVICE_FLAP_DETECTION This is in reality 'DISABLE_SVC_FLAP_DETECTION' RESTART_PROGRAM SHUTDOWN_PROGRAM These are 'RESTART_PROCESS' and 'SHUTDOWN_PROCESS', respectively. External commands that appear undocumented: NONE This is an error marker for internal use only and shouldn't be documented at all. DELAY_HOST_SVC_NOTIFICATIONS This is currently unimplemented. CANCEL_HOST_DOWNTIME CANCEL_SVC_DOWNTIME CANCEL_ACTIVE_HOST_DOWNTIME CANCEL_PENDING_HOST_DOWNTIME CANCEL_ACTIVE_SVC_DOWNTIME CANCEL_PENDING_SVC_DOWNTIME CANCEL_ACTIVE_HOST_SVC_DOWNTIME CANCEL_PENDING_HOST_SVC_DOWNTIME These are unimplemented, as documented by comments in include/common.h A command overhaul in time for 4.1 would be a good idea. Using a hash- table to look up the command and its handler would be a very good idea indeed. Then we'd do the insert to hash-table thing as add_command(char *name, char *description, int (*handler)(char *cmd, char *args)) or some such and generate the help-list from that, while allowing nebs to access the same command parsing routines as the rest of Nagios. Though to be honest, I'd much rather just extend the query handler thing and let commands take key/value vectors via the socket. That way we can support multiline values properly and get rid of nonsensical attributes. FLUSH_PENDING_COMMANDS This has no meaning anymore, and if I guess correctly it never had, as it would be impossible to submit a command that flushes all pending commands, as any command one want to flush are either already parsed and processed when we get to this command, or hasn't been picked up by the command queue yet (in Nagios 3 that is; In Nagios 4 it has absolutely zero meaning). I'll remove this. CHANGE_HOST_NOTIFICATION_TIMEPERIOD This is undocumented for real, although its meaning should be fairly self-explanatory. DEL_DOWNTIME_BY_HOST_NAME DEL_DOWNTIME_BY_HOSTGROUP_NAME DEL_DOWNTIME_BY_START_TIME_COMMENT These are the new 'distributable' downtime delete commands, which I suppose are undocumented. CUSTOM_COMMAND This is currently unused, to the best of my knowledge. Its intended use is for eventbroker modules that want to accept external commands. There can't really be any documentation for it, apart from whatever the module itself decides to accept as input. Since the query handler got implemented (which allows for feedback from the targeted handler), this should be considered obsolete. I totally agree with Andreas, that it would be nice if there was a handler that could print commands, short description, and in an utopian world, the documentation would be generated from that so there is no need to maintain it manually. Well, it still needs to be maintained manually, since the description and the command template isn't going to write itself. It would just be a lot more up to date if the documentation was kept inline with the code. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Free Next-Gen Firewall Hardware Offer Buy your Sophos next-gen firewall before the end March 2013 and get the hardware for free! Learn more. http://p.sf.net/sfu/sophos-d2d-feb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] distributable downtime delete commands?
On 02/08/2013 04:32 AM, Páll Guðjón Sigurðsson wrote: Thanks a lot Paul, We have been achieving the same goals with pynag scripts and DEL_SVC_DOWNTIME command. Is there anyone on the list that might take it into their hands to keep documentation up to date, maybe even write a routine in nagios to ask for every supported external command, so that external scripts know what external commands are supported ? It wouldn't be hard to build a command query-handler in Nagios that can list the available commands, along with a short description saying what it does. The same could be used with macros and other things we already have pre-compiled lists of. I've been meaning to write one that can handle external commands from users and reply with a result code to let people know how it actually went parsing, understanding and applying it anyway, so it'll get done eventually. Right now I'm working on something different though. The pynag team depends on the documentation to generate library functions, but we have found a few glitches in both external commands docs and the nagios.cfg docs. Anyone willing to help us on getting it right deserves a beer. Use the code instead. The external commands are all listed in include/common.h, so sed -n 's/^#define CMD_\([A-Z_]*\).*/\1/p' will list all of them (although some of them are disabled internally, so you'll have to keep a list of those). -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Free Next-Gen Firewall Hardware Offer Buy your Sophos next-gen firewall before the end March 2013 and get the hardware for free! Learn more. http://p.sf.net/sfu/sophos-d2d-feb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Embedded Perl in Nagios 4
On 01/18/2013 12:29 PM, Matthew Jurgens wrote: Hi Andreas op5 sponsor the Check WMI Plus plugin (http://edcint.co.nz/checkwmiplus/?q=donations_and_sponsorship). Its a large perl plugin where the lack of embedded perl would approx double the amount of CPU that it uses. Actually, it's a large perl shim around a C binary called wmic. Most of the time it uses is spent in network latency because there's no way the shim can get the compiled binary to do all its work for it in a single session. check_wmi is one of the (very many) plugins where embedded perl wouldn't improve things in any way what so ever, and the only way to get real performance out of it is to rewrite the wmic binary so that it can handle communication with the perl plugin, or so that wmic itself is turned into a plugin. Where embedded perl *does* matter is for perl plugins that utilize a large set of perl libraries. For those cases, embedded perl can cut execution time of plugins down to as much as 25% of the original cpu and I/O cost. However, the new worker model is still up to 12 times more efficient than the old one, so at best you're looking at a tiebreak for running perl plugins. There are people using it so much (multiple tens of thousands of invocations per day) that they already notice the amount of CPU that it consumes. I'm sure they do, and patches to make language-specific workers work fine out of the box are ofcourse welcome, along with the code to make such workers fly. I'm hoping the op5 can assist with a solution to the Nagios 4 problem of no embedded Perl. It's not a problem that Nagios 4 has no embedded perl. It used to be a problem that Nagios 3 with embedded perl enabled leaked so much memory that the system became unusable after a day or two of running. This is (potentially) a performance issue caused by a single plugin being heavily used in spite of it being crap code that just wraps a second program. Why has it been taken out anyway? For all the reasons stated above. There are a lot of plugins out there that use Perl. And for the most part it's still faster to run checks with Nagios 4 without embedded perl than it is to run checks with Nagios 3 with embedded perl compiled in and enabled. The only exception is where large and heavy plugins written in perl are extensively and almost exclusively used, and the perl loadtime dominates the time it takes for the plugin to run. That's not the case with check_wmi. For a workaround, you may want to look into mod_gearman. It's one of the unofficially blessed eventbroker modules that are known to work well with Nagios 4, and as a happy coincidence it still supports embedded perl. I know for a fact that Nagios 4 with mod_gearman works very well, and if you put your perl-based checks in a single service group and let mod_gearman handle only those, you'll have a system that can easily handle tens of thousands of checks without breaking a sweat. As a final note; Embedded perl *may* yet come back to Nagios. If it does, it will be as a multi-stage rocket that goes like this: 1. Make Nagios recognize different script languages. 2. Make Nagios' worker process manager capable of distributing checks based on scripting language. 3. Build a special-purpose worker that supports perl caching and whatnot and get it included in Nagios. Given that it's done properly and the code holds good quality, I'd be more than happy to take patches that implements script recognition and a perl-worker. Note that the worker will have to be able to handle the case when plugins can't be executed by it since they don't conform to whatever rules the embedded perl layer chucks up. It will, however, *never* be in the plain core worker, which is meant to be as stable and simple as possible. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Master HTML5, CSS3, ASP.NET, MVC, AJAX, Knockout.js, Web API and much more. Get web development skills now with LearnDevNow - 350+ hours of step-by-step video tutorials by Microsoft MVPs and experts. SALE $99.99 this month only -- learn more at: http://p.sf.net/sfu/learnmore_122812 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Update Check Is Not Running
On 01/15/2013 10:42 PM, Jon Adcock wrote: I noticed that the main.php page update_available blue flag never appears for me when a new version of Nagios is available. I do have check_for_updates=1, but I see the following at the top of the Nagios status.dat file: info { created=1358284500 version=3.4.1 last_update_check=1340002591 update_available=1 last_version=3.3.1 new_version=3.4.1 } What that tells me is that the Nagios function (that checks to see if there is an update available) hasn't run since 6/18.2012 6:56:31 GMT (1340002591). I am seeing similar indications on my other Nagios servers. Yes. This is my fault. I'm one of the core maintainers, but I also take care of maintaining the nagios package for op5. Pretty much the only thing we change in our version is to disable update checks with a hardcoded return statement in the function that issues the actual check, since our customers get their updates from us in the form of RPM packages. Once upon a time a patch spilled over to the Nagios repository that had that change, so the update check was broken sometime between 3.3.1 and 3.4.4. I believe it's fixed in 3.4.4 though, with kudos to Eric Stanley for fixing the problem. 1) Is there a way to force Nagios to CHECK NOW? Not that I know of, and it wouldn't work with the hardcoded avoid update checks at all cost patch in place anyway. 2) Does the update_check log it's results somewhere? Only in the status_file. 3) Do you have any recommendations on how I should approach this problem? Upgrade to the latest version manually and then the update checks should start working again. I'm really sorry for the inconvenience. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Master Java SE, Java EE, Eclipse, Spring, Hibernate, JavaScript, jQuery and much more. Keep your Java skills current with LearnJavaNow - 200+ hours of step-by-step video tutorials by Java experts. SALE $49.99 this month only -- learn more at: http://p.sf.net/sfu/learnmore_122612 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Embedded Perl in Nagios 4
On 12/18/2012 09:29 AM, Samuel Kidman wrote: Hello I just had a look at Ethan's slide decks on Nagios 4 from 2012 Nagios Conference, and found out that embedded perl won't be in Nagios 4. I'm running a distributed Nagios deployment that makes extensive use of perl plugins, and the performance of my Nagios servers is beginning to struggle a bit as a result. I was working on making my plugins work with embedded perl but since this is now not an option I was wondering what others will be doing to optimise perl plugins without embedded perl? For starters, you should just upgrade and it's entirely possible that the performance issues go away completely. Nagios 4 has awesome check scaling. The second thing to do would be to look into running mod_gearman with workers living on the same server as the master Nagios process. mod_gearman still has embedded perl support (although it really only makes a difference for large plugins). The third thing to do would be to inspect your most expensive plugins (expensive in terms of Perl loadtime multiplied by the number of times the plugin is used for any given time interval) and see if rewriting them in a different language makes a huge difference. We did that for the snmp interface checks when one of our large customers wanted to monitor some 6 services. Perl simply wasn't fast enough. Embedding it meant we couldn't fork() fast enough (embedding languages has its own overhead too), and it still leaked memory, so we rewrote them in C and we cut system load by more than 85%. The fourth thing to do would be to either sponsor a developer or buy development time to build a special-purpose Nagios worker that handles perl checks and that has a perl interpreter and a cache embedded. It's not certain that would be better than just running the perl interpreter directly. Without a cache, embedded perl is completely useless, since it still has to bytecompile all the modules, and with it it can grow to consume all memory on the system, no matter how carefully you write your plugins. Tradeoffs, compromises and possible leaks no matter what you do if you start down that route. It might be awesome though, so I guess it could be worth a shot. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] service checks running too often
On 12/14/2012 05:13 AM, Mark Keisler wrote: I think I found the issue. If I happen to send a reload (HUP) to nagios while a service check is in progress (fairly easy since my service check is rather long lived), the reloaded nagios doesn't seem to know about that service check and so I'll end up with another being scheduled as well as the original on its schedule. Create a dummy service check that just sleeps for 30 seconds or something and issue a reload while it is running and see if your nagios instance will start another sequence of service checks. This should be pretty easily fixed by just adding a check reaping event before initializing the event queue and skipping all checks that have already been scheduled. I'll have to add a check for it in 4.x. Since we keep workers between reloads, the same thing can easily happen there. That means we'll reschedule all checks like normal when we're starting, but if a check result comes in when a new check is already scheduled, we'll remove the old event and reschedule a new one according to the retry interval. I'd suggest doing something similar in the 3.4.x branch, but I'm not sure I can commit to that one without doing a new svn clone, and that takes at least a day. Mark; Would that be acceptable to you? Oh, and good catch :) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] service checks running too often
On 12/14/2012 04:19 PM, Mark Keisler wrote: What you propose sounds acceptable. In the meantime I need to be careful about reloading nagios :). Once I get it in that state, I have to disable use_retained_scheduling_info and then do a full restart. I've actually checked Nagios 4 now, and it appears we don't do this there. I didn't test it all that thoroughly (and I probably should), but it's friday and I'm two beers past my best-before-thinking hour, so I'll just refrain from trying it further today. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] refresh and stay on current page
On 12/14/2012 04:34 PM, Leonardo Bacha Abrantes wrote: Hello guys! I'm using nagios 3.4.1 and when I press F5 to refresh the page, nagios go to home. How can I configure to stay on the current page when F5 is pressed ? If you're using Firefox, you can the go to chrome://settings and set the variable pixie_dust_my_frames and it will magically do the right thing. It's possible it's only available through some plugin though. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] servicedependecy is not working
On 12/04/2012 03:52 PM, Leonardo Bacha Abrantes wrote: Hey Andreas, yes, that is. however if I run an passive check of nrpe, nagios does not send alerts. I'm testing it to confirm. So the notification for '/var partition' only goes out when NRPE is in PENDING state? on doubt: /var was checked and it failed. when it happens nagios will automaticaly re-check nrpe plugin or only check the current status ? Only if active checks are enabled for the check and it can run it at the time it wants to run the dependency check. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.99.95 contact_groups
On 12/04/2012 02:29 PM, Supporto Tecnico - Crazy Network wrote: Here from object.cache define servicegroup { servicegroup_name ftp-server alias FTP Server } define service { host_name Server.SysAdminDiary.it service_description Server FTP check_period24x7 check_command check_ftp contact_groups sysadmindiary.it notification_period 24x7 initial_state o hourly_value1 check_interval 5.00 retry_interval 2.00 max_check_attempts 3 is_volatile 0 parallelize_check 1 active_checks_enabled 1 passive_checks_enabled 1 obsess 1 event_handler_enabled 1 low_flap_threshold 0.00 high_flap_threshold 0.00 flap_detection_enabled 1 flap_detection_options a freshness_threshold 0 check_freshness 0 notification_optionsr,c notifications_enabled 1 notification_interval 60.00 first_notification_delay0.00 stalking_optionsn process_perf_data 1 icon_image ftp.png retain_status_information 1 retain_nonstatus_information1 } define contact { contact_nameandrea.iannucci alias Andrea Iannucci service_notification_period 24x7 host_notification_period24x7 service_notification_optionsr,c host_notification_options r,d,u service_notification_commands notify-service-by-email,notify-service-by-sms host_notification_commands notify-host-by-email,notify-host-by-sms email supp...@crazynetwork.it pager 00393388552885 minimum_value 1 host_notifications_enabled 1 service_notifications_enabled 1 can_submit_commands 1 retain_status_information 1 retain_nonstatus_information1 } Apparently in contact definition the contact_groups disappear and it come with an empty line. In objects.cache the contacts are always written to the contactgroup's members variable. If it's available there, Nagios knows about it. If not, then there's something seriously wrong. Im preparing the box to send privatly login data so you can check by yourself. Much appreciated. Many thanks. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.99.95 contact_groups
On 12/05/2012 02:28 PM, Supporto Tecnico - Crazy Network wrote: define contactgroup { contactgroup_name sysadmindiary.it alias SysAdminDiary.it Staff } The only contactgroup i can see in object.cache I did send you a private mail with login data for the box. You did, and I'm perplexed. The problem arises when precaching objects (which is discouraged in Nagios 4, since it actually makes config loading slower rather than faster for almost all cases). The objects.precache file has you as a member of the contactgroup, but the objects.cache file does not. If I don't precache objects, it doesn't matter how I specify that you should be a member of the group. You still get notifications. If I do use the precache file, you don't get notifications no matter how I specify it. Now that I noticed this discrepancy, it appears there are other issues with group membership parsing as well, and they only appear when reading the precached object file. I'm working on a patch as we speak and it appears I've got things fixed. I'll just run some more tests and then send it out muy pronto. Thanks for letting me sneak a peak at your server. This one would most likely have baffled me for quite some time otherwise. This should sort out many of the Nagios doesn't send notifications issues that have cropped up here and there and that I've been unable to reproduce. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.99.95 contact_groups
On 12/05/2012 04:12 PM, Supporto Tecnico - Crazy Network wrote: Nice to know is not my config way problem.. Let me know once i can re-try to install and when i can reset the testbox provided. You can test it now. For now, thanks :) You're welcome, and thanks you too :) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.99.95 contact_groups
On 12/05/2012 04:54 PM, Supporto Tecnico - Crazy Network wrote: Ok, seems is working! Recompiled, restarted nagios and... [1354722485.871976] [032.1] [pid=22807] Service notification will NOT be escalated. [1354722485.871983] [032.1] [pid=22807] Adding normal contacts for service to notification list. [1354722485.871990] [032.0] [pid=22807] No contacts were found for notification purposes. No notification was sent out. [1354722682.351821] [032.0] [pid=25265] ** Service Notification Attempt ** Host: 'Server.SysAdminDiary.it', Service: 'Server FTP', Type: 0, Options: 0, Current State: 2, Last Notification: Thu Jan 1 01:00:00 1970 [1354722682.351891] [032.0] [pid=25265] Notification viability test passed. [1354722682.351903] [032.1] [pid=25265] Current notification number: 1 (incremented) [1354722682.351914] [032.1] [pid=25265] Service notification will NOT be escalated. [1354722682.351925] [032.1] [pid=25265] Adding normal contacts for service to notification list. [1354722682.352187] [032.0] [pid=25265] 1 contacts were notified. Next possible notification time: Wed Dec 5 17:51:22 2012 [1354722682.352203] [032.0] [pid=25265] 1 contacts were notified. And email are arriving now! Thanks a lot Andreas Excellent. Thanks for testing so promptly :) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.99.95 contact_groups
On 12/03/2012 04:41 PM, Supporto Tecnico - Crazy Network wrote: Hi, i did re-install nagios again (from git repo) using: git clone git://github.com/ageric/nagios.git But still got same error from debug level 32: [1354549057.568949] [032.0] [pid=4151] ** Service Notification Attempt ** Host: 'Server.SysAdminDiary.it', Service: 'Server FTP', Type: 0, Options: 0, Current State: 2, Last Notification: Thu Jan 1 01:00:00 1970 [1354549057.569017] [032.0] [pid=4151] Notification viability test passed. [1354549057.569029] [032.1] [pid=4151] Current notification number: 1 (incremented) [1354549057.569040] [032.1] [pid=4151] Service notification will NOT be escalated. [1354549057.569049] [032.1] [pid=4151] Adding normal contacts for service to notification list. [1354549057.569056] [032.0] [pid=4151] No contacts were found for notification purposes. No notification was sent out. Here contacts file: define contact{ namecn-contacts service_notification_period 24x7 host_notification_period24x7 service_notification_optionsc,r host_notification_options d,u,r service_notification_commands notify-service-by-email host_notification_commands notify-host-by-email register0 } define contact{ contact_nameandrea.iannucci use cn-contacts alias Andrea Iannucci email supp...@crazynetwork.it contact_groups sysadmindiary.it service_notification_commands notify-service-by-email,notify-service-by-sms host_notification_commands notify-host-by-email,notify-host-by-sms pager 00391234567890 } define contactgroup{ contactgroup_name sysadmindiary.it alias SysAdminDiary.it Staff } Here service definition: define service{ namecn-ftp-service active_checks_enabled 1 passive_checks_enabled 1 parallelize_check 1 obsess_over_service 1 check_freshness 0 notifications_enabled 1 event_handler_enabled 1 flap_detection_enabled 1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information1 is_volatile 0 servicegroups ftp-server service_description Server FTP check_command check_ftp icon_image ftp.png check_period24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval2 contact_groups sysadmindiary.it notification_optionsc,r notification_interval 60 notification_period 24x7 register0 } define servicegroup { servicegroup_name ftp-server alias FTP Server } Here service call in server host file. define service{ use cn-ftp-service host_name Server.SysAdminDiary.it } Any suggestions/hint? So contactgroup is inherited from template. That might be a clue. Can you verify if the contactgroup is present on the proper services in your objects.cache file? Also, if you're not keeping secrets in the configuration, it would help if I could get my hands on it so I can see the problem for myself. If template inheritance is broken for contacts and/or contactgroups, that's a pretty serious issue. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin
Re: [Nagios-users] Service scheduled for one week later....
On 12/03/2012 11:57 PM, Rodrigo Gesswein wrote: Dear All: I have a curios question about 'timeperiod' and 'service'. My 'cfg' files look like: define timeperiod{ timeperiod_name DayTime alias DayTime monday 08:00-19:00 tuesday 08:00-19:00 wednesday 08:00-19:00 thursday08:00-19:00 friday 08:00-19:00 saturday08:00-19:00 sunday 08:00-19:00 } define timeperiod{ timeperiod_name NightTime alias NightTime monday 19:01-07:59 tuesday 19:01-07:59 wednesday 19:01-07:59 thursday19:01-07:59 friday 19:01-07:59 saturday19:01-07:59 sunday 19:01-07:59 } This is wonky. Make it '00:00-07:59,19:01-24:00' instead. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.99.95 contact_groups
On 11/29/2012 07:17 PM, Supporto Tecnico - Crazy Network wrote: Thanks a lot :) Just two questions: 1 - Where i can find the patch you menthioned? In git, soon. 2 - I will get latest (patched) version with git clone from repository? In a (very) short while, yes. Tomorrow i'll grep latest git repo and try to recompile and test Since today was tomorrow yesterday that might be hard, unless you're working late. It'll be something you can try on monday though. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Keep yourself connected to Go Parallel: TUNE You got it built. Now make it sing. Tune shows you how. http://goparallel.sourceforge.net ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] circular parent/child chain
On 11/19/2012 07:57 PM, Lucy Pelzer wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi List, thanks for your fast replies, but I think you missed my point. I know that I could configure one parent and alert about spanning tree or do a lot of other stuff, but this would not help me at all in the following situations. Lets say that I have a lot of important infrastructure behind switch-4 and switch-5 and that the normal stp way is to route through switch-2 for switch-5 and for switch-4 to switch-3. Lets also suggest that at switch-2 there is not that much important infrastructure. If now switch-2 is failing at the same time a really important infrastructure behind switch-5 fails. I will only get informed that switch-2 failed, because switch-5 is a child of switch-2, but I will not be informed about the more important stuff at switch-5. Now I will spend a lot of time to exchange switch-2 before I even find out that there is something really important at switch-5 going on only because I could not give it a second parent (switch-4). Maybe the other problem would have been fixed in minutes if I would have known about it. I think it is a bad idea to have such parent/child chains. And please keep in mind that this is just a simple example and that a network can be much bigger than just 5 switches. The other point is that I will never see the real network infrastructure in the statusmap. As far as I can read the documentation I should also be able to build up my whole infrastructure like it is in the statusmap. Or for what is it at all? parent/child relations are primarily to block notifications for stuff that is only down because some earlier stage in the traffic shoveling is down. I would be really pleased if somebody could answer my question. - - Why is it in nagios not possible to be parent/child of a other host at the same time? - - Why is it a deadlock and where is the problem? Because if A is a parent of B and B is a parent of A we'd never run out of parents to check when trying to determine the root cause of the problem. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Monitor your physical, virtual and cloud infrastructure from a single web console. Get in-depth insight into apps, servers, databases, vmware, SAP, cloud infrastructure, etc. Download 30-day Free Trial. Pricing starts from $795 for 25 servers or applications! http://p.sf.net/sfu/zoho_dev2dev_nov ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] circular parent/child chain
On 11/17/2012 06:08 PM, Lucy Pelzer wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi List, I am in charge for a big network infrastructure and at the moment we evaluate solutions for our nagios monitoring with configuration via monarch. I really would like to create the new solution with nagios and puppet. There is only one big problem. In our infrastructure there are a lot of switches connected together like this: | switch-1 | / \ / \ / \ | switch-2 |-| switch-3 | | | | | | switch-5 |-| switch-4 | All the time I try to configure this switch connections in nagios with more than one parent, where the hosts are at the same time parent and child, I get the following error: ERROR: The host 'switch-2' is part of a circular parent/child chain! ... I know by now that this is because nagios has to be hierarchically, but I do not understand why. Maybe somebody could tell me exactly why I can’t configure this in nagios or better could tell me how I can do it. I only found the explanation that this could conduct a deadlock. Maybe I am wrong, but is this not a likely network infrastructure? At last it is no problem to build something like this with STP running in your network. It is, but parent-child relations concern network data flows, and not necessarily the cables connecting various pieces of equipment. In your case switch5 should have switch2 as parent, switch4 should have switch3 as parent, switch2 and switch3 should both have switch1 as parent. Think of how the traffic goes. It will never travel like this: switch1 - switch2 - switch3 - switch4 but it will travel like this: switch1 - switch3 - switch4 (assuming you have a somewhat sane network, that is). -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Monitor your physical, virtual and cloud infrastructure from a single web console. Get in-depth insight into apps, servers, databases, vmware, SAP, cloud infrastructure, etc. Download 30-day Free Trial. Pricing starts from $795 for 25 servers or applications! http://p.sf.net/sfu/zoho_dev2dev_nov ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notifications question
notifications_enabled1 contact_groups WISESRV-critical stalking_options w,c register 0 } define service { name generic-service is_volatile 0 max_check_attempts 5 check_interval 60 retry_interval15 active_checks_enabled 1 passive_checks_enabled 1 check_period none parallelize_check 1 obsess_over_service 1 check_freshness 0 event_handler_enabled1 flap_detection_enabled1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 notification_interval 60 notification_period none notification_options w,u,c,r notifications_enabled1 contact_groupsadmins stalking_options o,w,u,c failure_prediction_enabled 1 register 0 } Can you please point me in the right direction? Am I misreading something? And sorry for the long post. I should have used some web pasting tool. Just remembered. :) Best regards, Bruno Martins -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_sfd2d_oct ___ Nagios-users mailing list Nagios-users@lists.sourceforge.netmailto:Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LogMeIn Central: Instant, anywhere, Remote PC access and management. Stay in control, update software, and manage PCs from one command center Diagnose problems and improve visibility into emerging IT issues Automate, monitor and manage. Do more in less time with Central http://p.sf.net/sfu/logmein12331_d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- LogMeIn Central: Instant, anywhere, Remote PC access and management. Stay in control, update software, and manage PCs from one command center Diagnose problems and improve visibility into emerging IT issues Automate, monitor and manage. Do more in less time with Central http://p.sf.net/sfu/logmein12331_d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Chage/add colors for alerts display
On 10/27/2012 09:53 AM, Network Operation Center FMC Luxemburg wrote: Hi Everybody It's possible to have more colors of 'red', green', 'orange' for displaying the alerts ? The principe should be so in the en plugins : - The variable STATE_WARNING = 1 gives an orange colorfor the display - It's possible to create a new variable (i.e. STATE_SOMETHING) valorised by a particular evenement, and associated with a blue color for example? No, it's not. It would be possible to assign a particular color to denote that something is amiss due to a particular dependency or some such, but adding a new exit code isn't very likelt to happen. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- WINDOWS 8 is here. Millions of people. Your app in 30 days. Visit The Windows 8 Center at Sourceforge for all your go to resources. http://windows8center.sourceforge.net/ join-generation-app-and-make-money-coding-fast/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Passive-only master still pinging
On 10/23/2012 11:00 PM, Mike Lindsey wrote: I've got a passive-only master that is configured to never execute checks. Yet it's still performing ping checks for some hosts at some times. This is mostly just annoying, but when it decides to ping hosts that it doesn't have a route to, pagers go off. I've got 30k services in this config, so debug isn't really an easy option. Seeing this on 3.3.1. Any ideas? If the master accepts passive checks, it's impossible to disable host checks completely, since some dependency checks are done without any care to whether or not modules say I've got this. You go back to sleep. The only way I know of to completely block out host checks from a master is to use Merlin as pollers, leave all the checks to pollers and set takeover = no for all pollers while avoiding sending passive checks to the master. That means the master will never be the point of origin for any problems in the network, and dependency checks never happen. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_sfd2d_oct ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] solutions for off-server PNP4Nagios perfdata processing?
On 10/03/2012 04:33 PM, Frost, Mark {BIS} wrote: Hello. Has anyone come up with solutions for processing Nagios performance data on a server other than a Nagios server? We've been processing perfdata results on our Nagios server(s) for a while now and increasingly it's just eating up too much I/O to make me comfortable. Yes, we do use rrdcached and yes, I realize that shuffling data around on different disk spindles and controllers would help, but in today's world where companies don't like building any kind of physical server let alone one with all that additional hardware, that's not entirely an option for us. I realize that once the perfdata files are on the dedicated graphing server(s), processing them into RRD files there should be a no-brainer. My problem is figuring out how to get them there without say, using a NAS device. (If I/O's a problem locally, I don't want to shuffle that I/O to an even slower network device). It would be ideal if somehow there was a process that I could just send that data to and have it picked up remotely. Like if maybe Merlin have a special kind of peer that just received a stream of perfdata or something. Anything else I could imagine would be some kind of home-grown solution like say pumping events into a messaging system from the Nagios server(s) and then letting the graphing server pick them up from the message queue(s). I could also imagine some kind of fancy-pants module in Nagios 4 that did something like this, maybe. Any thoughts would be appreciated. Using the NERD radio you will be able to stream your checkresults off-site (once the macro-processing patches are done, that is). It's Nagios 4 only, but I know you've been looking at that already so perhaps that's not be a big issue. The will be is a bit of a showstopper though, but I have to have it done by next wednesday. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] solutions for off-server PNP4Nagios perfdata processing?
On 10/03/2012 05:58 PM, Frost, Mark {BIS} wrote: Dan, As I understand it, the issue is less about Nagios and more about npcd. Nagios merrily produces the perfdata files and then npcd comes along and scoops them up, but as it's processing them it's opening a lot of rrd files and inserting data into them. That's what rrdcached is for. It's supposed to stash the perfdata in memory so writes don't have to happen so often. You defeat the purpose if you use a cache-timer shorter than your check-interval though, but if you have, say, a 30 minute cache time and a 5 minute check-interval, you basically cut I/O down to 1/6th of the normal. If you also hook up a decent-sized ssd, you get x21 throughput from before, so all in all you can get a rough increase of 12000% with both those options enabled. So really it's npcd that's the problem. Well, not really a problem, but ultimately it's doing its thing and then Nagios gets less than a fair share of the box's I/O. It's not that it's horrible right now, but we're starting to notice it and I would tend to be concerned about scaling problems. Honestly even with Nagios 3, it seems like Nagios' own I/O is entirely manageable so far with strategic use of ram disk. It's just putting Nagios and PNP4Nagios (plus Apache to serve up the graph contents which I'm also not happen going on on the same server) on the same boxes that I don't like. With Nagios 4, set status_file=/dev/null and use livestatus instead and you're golden. Apart from the nagios.log, there's no other I/O going on from Nagios' side, so it won't get stuck waiting for pnp to get done. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios writing escalation rules multiple times to objects.cache
On 10/01/2012 09:37 PM, Chris Baldwin wrote: Short version: I have an ever-growing Nagios install for monitoring a bunch of linux hosts (currently 99 hosts 2322 services, I plan on adding 115 more hosts 1500+ services). I've noticed something odd with my escalation rules - they're being repeated multiple times in my objects.cache file. This is started to affect performance for parts of my nagios install, to the point where it's painfully slow to use the web interface. My google-fu is weak today, so I was hoping someone here could point me in the right direction. Longer version: I have 4 escalation rules: -Our helpdesk gets notification #1 for critical issues. -Our on-call person gets notifications 1 - 12 @ 5 minute intervals 24x7. -The relevant IT-group leader(s) get notifications 5-12 @ 5 minute intervals during on call periods. -Our CIO gets notification 12 - infinity at 60 minute intervals during on call periods. We use puppet to control our environment, and it's amazing for deploying servers and adding them to nagios. Once I'm able to bring in other aspects of our environment under puppet control (firewall, sudo, yum repos), it will be trivial to set up a server from scratch and monitor it. In order to create a new set of escalation rules, we use a custom class on the puppet server and a small bit of code to be executed from the client-side (of puppet) to make this work. An example: # Escalate to the_boss. He, in turn, will call people. I imagine this # to be along the lines of Hulk nudging Thor playfully in The # Avengers. And sending him flying through a few bulkheads. nagios::server::escalations { Boss-critical: contact_groups = the_boss, escalation_options = c,r, escalation_period = oncall_hours, first_notification = 12, last_notification = 0, notification_interval = 60, servicegroup_name = Disk,Ping,HTTP,Load,MySQL,Ping,Procs,SSH,Swap,Users,Zombie, } I know this portion works correctly - it's producing my desired result, which is 1 file per (set) of escalation rules specified. I have 1722 escalation cfg files. The cfg files look something like this: define serviceescalation{ contact_groups the_boss escalation_options c,r escalation_period oncall_hours first_notification 12 host_name my.hostname.xyz last_notification 0 notification_interval 60 #service_description Disk,Ping,HTTP,Load,MySQL,Ping,Procs,SSH,Swap,Users,Zombie servicegroup_name Disk,Ping,HTTP,Load,MySQL,Ping,Procs,SSH,Swap,Users,Zombie } So you're assigning it to a host_name along with a set of servicegroups. I'm not entirely sure that makes 100% sense, since servicegroup members already have a host_name. It might work better with Nagios 4, but I'm not sure. If it doesn't, I'll fix it so 'service_description' is required when 'host_name' or 'hostgroup_name' is set, as I don't see how one makes sense without the other. My questions to you guys: - Am I crazy to think that it's reading every rule once for *each* server? It seems as if it's reading the rule once for each host mentioned in host_name and then assigning it to each member of the servicegroups listed, so if you have identical escalations assigned to the same set of servicegroups then this is really how you're configuring your Nagios. Nagios 4 has provisions to compare slave objects and avoid adding multiple ones, which would hide a potential bug in your config. It's currently only used for dependencies, but making it work with escalations too would be the final fallback to fix this. However, I urge you to look over your configuration first to make sure you don't really have multiple escalations assigned to the same set of servicegroups. I tried using the precache, it didn't help. Both files were created by my nagios install. That's not surprising, as precaching and caching uses the exact same code. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic
Re: [Nagios-users] modify service macros
On 10/02/2012 11:38 AM, Marco Borsani wrote: Hi all I would modify $SERVICEDESC$ before Nagios use it to send notification. Due internal procedure I can’t modify the “service description” inside our configuration file. For example $SERVICEDESC$ is “a_b_check_tcp_port_1234” I would change it in “tcp_port_1234” (“just” cut N-char at the beginning of the string) How can I do this ? Wrap notifications in a script of your own and cut as much as you like from whatever you like. There's no sensible way to do it from inside Nagios, since that will have ramifications for modules and everything else. Notifications are (normally) not so frequent that wrapping them in an extra layer of execution will matter. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] host object declarations
On 10/02/2012 02:52 PM, Marki wrote: Hi there, I'd like to further discuss http://tracker.nagios.org/view.php?id=177 which is about host object declarations. You suggest using host_name as something that resolves. However, we don't have a (DNS) hostname for each device. I'm suggesting what now? The only thing I'm saying in that bugreport is that Nagios does not and will not complain when the address fields of hosts are unique. Also, directive description (Nagios documentation) says: This directive is used to define a short name used to identify the host. The description for the address directive actually needs something useful, again, see http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#host You say one should switch to using them differently, i.e. host_name = ip address or DNS hostname alias = description because Nagios4 will do it that way (and Icinga already does). Nagios 4 won't do that. The one and only change in Nagios 4 is that address will be set to the host_name variable if no address field is set. That means people who want to use dns names for their hosts don't have to write that info again in the address field. I guess I will then try to update my config that way: - use a DNS name (if one exists) in the host_name field, otherwise use the IP as host_name if there is no reverse lookup for it, - in that case I use a symbolic name in the display_name field, - and optionally a description in the alias field. Furthermore for devices that have no IP but should show as different hosts, define a virtual hostname with the IP address of the device's management station (that may be duplicate). You can't have two hosts with the same host_name. You really don't need to change anything at all to make your configuration work the same way it always has. Anyway I'd really like to know what everyone thinks about this, and how you do it in a sensible way. host_name is the short symbolic name that has to be unique but can be anything you want. address is the field that (usually) has to point to an ip address or that's resolvable as a dns hostname. display_name and alias are equally useless and are only meant to provide some sort of help when receiving notifications or when viewing the UI. You've been overthinking it and seriously misunderstood something. The way you think Nagios 4 will be working isn't correct. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] host object declarations
On 10/02/2012 05:20 PM, Marki wrote: The only thing I'm saying in that bugreport is that Nagios does not and will not complain when the address fields of hosts are unique. In fact you also said: that suggests that either your namestandard (used for the host_name field) sucks, or that you or your co-workers are simply confused when it comes to configuring Nagios. So let's talk best practices. In practice it did occur that when someone copy/pasted a bunch of service definitions (yeah I know you should use templates, call that confusion if you like) just forgot to update the address field and the newly created checks (or so they believed) did nothing more than check the old stuff once more. So use the require unique address scriptlet I provided in the ticket. Nagios won't protect you against such config buglets, because for many users its a very useful trick to do. According to what you suggest, it is wisest to indeed use DNS names where you can in the host_name field. (and only use the address field in special circumstances) (in Nagios4) Or just not fsck up when adding hosts, perhaps? If you define your nomenclature like that, possible confusion may be eliminated ;-) Could be. BTW I have all sorts things in place that - check for duplicate addresses So this should've been caught then. - check for syntax errors all over the place Which Nagios already does for you, so that seems moot. - check if the plugin specified in command_line actually exists thus preventing stupid error 127 or alike You don't get that anymore. With Nagios 4 you get (No output on stdout) stderr: execvp() failed. errno is 2: No such file or directory PS. As a sidenote: I once created a more or less functional version of a script allowing to change service names. It's not trivial but can be done. Does anyone know of something existing and working that allows to change service and also hostnames? Well, python, perl, ruby, php, sed, awk, whathaveyou can do that with little or no hassle, so long as you know how to parse compound blocks into key/value vectors (which is trivial). Unless you're talking about modifying it runtime, in which case it's a little trickier and requires an eventbroker module. for f in $(grep command_line $NAGIOS/etc/checkcommands.cfg | awk '{print $2}' |\ grep USER1 | sed 's/\$USER1\$\///'); do if [ ! -f $NAGIOS/libexec/$f ]; then echo [KO] $f does not exist ERR=$(($ERR + 1)) fi done [ $ERR -eq 0 ] echo [OK] all commands seem to exist This seems pretty error-prone. I've been thinking of adding something similar to the Nagios binary, to make sure all (used) commands have existing binaries and warn if some of them don't. It would be quite trivial, but I haven't gotten around to it yet as not that many people suffer from it for very long. Besides, with the special-purpose workers on the way it's quite likely we'll just trigger false positives by the dozen. # make sure to always update both statements! if [ $(grep -r address $NAGIOS/etc/hosts* | grep -v '~' | grep -v svn |\ grep -v '.bak' | cut -d: -f2 | tr -d ' \t' | awk -F'address' '{print $2}' |\ sort | uniq -c | sort -n | grep -v ' 1 ' | wc -l) -ne 0 ]; then echo [INFO] IP address defined more than once: grep -r address $NAGIOS/etc/hosts* | grep -v '~' | grep -v svn |\ grep -v '.bak' | cut -d: -f2 | tr -d ' \t' |\ awk -F'address' '{print $2}' | sort | uniq -c | sort -n | grep -v ' 1 ' else echo [OK] Dup IP check ok fi That's just insanely tricky. precache the objects and you can use grep address $precache | sort -k 2 | uniq -c | sort -n | grep -v ' 1 ' instead to list all the duplicate addresses. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Running the same plugin once vs. multiple times
On 10/02/2012 11:02 PM, Tech Support wrote: All; I wrote a plugin to collect data for a particular service. I invoke the plugin with whatever parameter I want to return information on. In fact, it collects information on about 30 different params at once, then returns the data that I specified. For example, ./get_sysinfo -p param1 or ./get_sysinfo -p param2, etc. It seems to me to be a waste of resources to call the same script 30 times. Is it possible to call the plugin once and return all the data (including the performance data) at the same time? The benefit that I have calling the same script 30 different times is that Nagiosgraph creates a separate graph for each parameter and I don't want to lose that ability. Any insight at all would be greatly appreciated. Take a look at check_mk. It's doable but is a bit convoluted unless you write good infrastructure for it once and then reuse it over and over. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using contacts in host definition -- Bug ?
On 09/26/2012 09:25 AM, Frost, Mark {BIS} wrote: I believe this is a feature introduced in 3.3 or thereabouts. I've always found it very irritating and wish there was some way to turn off inheriting host contacts/contactgroups to services as it's never what I want. It's a half misfeature. The intention was (and is) that services with no contactgroups OR contacts should inherit the ones from the host, but it was coded up so that if the service had contactgroups (but not contacts) it would inherit contacts (but not contactgroups) from the host. The same applied when the service had contacts but no contactgroups and the host had contactgroups. In 4.0 this is fixed so only services with neither contacts nor contact- groups inherit them from the host. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://ad.doubleclick.net/clk;258768047;13503038;j? http://info.appdynamics.com/FreeJavaPerformanceDownload.html ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using contacts in host definition -- Bug ?
On 09/27/2012 02:37 PM, Frost, Mark {BIS} wrote: -Original Message- From: Andreas Ericsson [mailto:a...@op5.se] Sent: Thursday, September 27, 2012 3:08 PM To: Nagios Users List Cc: Frost, Mark {BIS} Subject: Re: [Nagios-users] Using contacts in host definition -- Bug ? On 09/26/2012 09:25 AM, Frost, Mark {BIS} wrote: I believe this is a feature introduced in 3.3 or thereabouts. I've always found it very irritating and wish there was some way to turn off inheriting host contacts/contactgroups to services as it's never what I want. It's a half misfeature. The intention was (and is) that services with no contactgroups OR contacts should inherit the ones from the host, but it was coded up so that if the service had contactgroups (but not contacts) it would inherit contacts (but not contactgroups) from the host. The same applied when the service had contacts but no contactgroups and the host had contactgroups. In 4.0 this is fixed so only services with neither contacts nor contact- groups inherit them from the host. -- Andreas Ericsson andreas.erics...@op5.se I still see that as a misfeature. I would rather have the preflight check tell me there's an error because I forgot to define contacts for the service than for it to assume that what I want is anything to be inherited from a host definition. I'd be OK if that were the default behavior and was configurable, but I'd be the first to disable that in nagios.cfg if it were a configurable preference. Yeah, I know, patches gleefully accepted :-). Well, disabling object inheritance completely is pretty simple. Doing it on a per-variable basis is less so. I'm on the fence with this. On the one hand I really don't care what the default is. On the other hand I really don't want bugreports from people who're relying on that feature and suddenly realize it's gone missing. As a workaround for your particular case, you could specify a catch-all service template which other service templates reference, where you set contactgroups and contacts to something that doesn't exist. That way you'll get your error and I can dodge bugreports ;) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://ad.doubleclick.net/clk;258768047;13503038;j? http://info.appdynamics.com/FreeJavaPerformanceDownload.html ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Livestatus with Nagios 4
On 09/24/2012 10:30 AM, Mathias Kettner wrote: Hi Andreas, thanks for your work! We're quite busy so we had no time yet to even try out Nagios 4. Do you think it would be easily possible to have a common code base for N3 and N4 for Livestatus (kind of #ifdef)? Since most Nagios headers are included from a single place, it shouldn't be that hard. I'm not sure I'll have time to look into it before the conference (as I haven't written my presentations yet), but I'll look into it. Nagios 3 will be default in that case, and Nagios 4 will have to be enabled explicitly. My autoconf-fu is pretty weak, but I think I could manage that. Also I've seen that you removed some columns that seem to not exist in N4. We'd need at least dummy columns here so that code relying on that columns will not break. Good idea. I'll readd them. Thanks for the feedback :) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] [Nagios-devel] Nagios 4: Call for testing
On 09/15/2012 06:57 PM, Brian Seklecki wrote: On 9/14/2012 4:23 PM, Daniel Wittenberg wrote: First build of the RPM's are below. So far we've been using them during testing but not in any kind of production environment so let me know if any issues/changes. https://github.com/dwittenberg2008/nagios/tree/master/rpms Dan We should get FreeBSD Ports, OpenBSD ports, and NetBSD pkgsrc makefiles prepped as well. Once its into NetBSD Pkgsrc, people non-RHEL platforms can get exposure. That's where you'll run into all of your threading, preprocessor, and autoconf Makefile errors. Threading has been removed, partially for this reason. It won't be an issue anymore. These will require a snapshot/tag tarball of SVN be published to a URL. Also Is nagios-3.99.95.tar.gz derived from trunk/ svn co?What rev #? Yes it is. git commit 7824598c1f9d4a6900253409523277c28433ffba svn revision @2216 It's positively ancient already though. I'm working several hours per day on Nagios 4 and I really can't chuck out a new version for every feature I add and every bug I fix. Or is it safe to assume that the rev # is within within a hour of 09-14-2012 @ 14:16 GMT (mtime of the tarball)? Nope. Or maybe. The git-svn conversion might do weird things to the timestamps. The change is listed as having happened 09:06:22 but on the same date in my svn-synced git repo, which means I did git-svn dcommit around then. In the long run, we'd be better off putting up autobuild servers that build on each push and get their version from an scm with version numbers on can actually rely on (cough-git-cough). -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] [Nagios-devel] Nagios 4: Call for testing
On 09/14/2012 10:23 PM, Daniel Wittenberg wrote: First build of the RPM's are below. So far we've been using them during testing but not in any kind of production environment so let me know if any issues/changes. https://github.com/dwittenberg2008/nagios/tree/master/rpms Sweet! Thanks a lot :) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- How fast is your code? 3 out of 4 devs don\\\'t know how their code performs in production. Find out how slow your code is with AppDynamics Lite. http://ad.doubleclick.net/clk;262219672;13503038;z? http://info.appdynamics.com/FreeJavaPerformanceDownload.html ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 4: Call for testing
On 09/14/2012 02:09 PM, John Kennedy wrote: On Thu, Sep 13, 2012 at 6:00 PM, Andreas Ericsson a...@op5.se wrote: Yes, you read it right. Nagios 4 is approaching, but it needs your help to reach the finish-line, so pretty please HELP! Is there a target date for release of Nagios 4? Yes. It's very, very close, but also quite secret. If you've kept an eye on http://go.nagios.org/conference I'm fairly sure you can figure it out though ;) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Got visibility? Most devs has no idea what their production app looks like. Find out how fast your code is with AppDynamics Lite. http://ad.doubleclick.net/clk;262219671;13503038;y? http://info.appdynamics.com/FreeJavaPerformanceDownload.html ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Service parents...
... have arrived. The intention (and the way they currently work) is that they remove the need for servicedependencies like this: define servicedependency { host_name A service_description Foo dependent_host_name B dependent_service_description Bar notification_failure_optionsw,u,c } Currently they totally ignore state hardness, and only care about the actual state. You can have parents residing on a different host (in which case you have to specify host_name,service_description in the parents field, naturally), or you can have same-host parents, in which case you only need to specify the service description. This only works so long as there's only a single parent though, but since that's how 99.9% of the servicedependencies seem to work in the configurations people sent me after my Call for configs, I guess that's reasonable for a first implementation anyway. Speaking of which; Thanks a bunch for the configs. You know who you are, although some of you didn't want me to name you in any thank-you's, so I'll refrain from posting the full, and surprisingly long, list of people. Your help was invaluable and helped me uncover many a bug that would otherwise have earned me a lot of hatemail. For the (extremely) common depend on a client version check thing, that would look something like this: define service { usesome-suitable-template host_name A descriptionNRPE Disk / parentsNRPE Version Check } although I expect you crafty folks will put such things in templates and be done with it very quickly. Enjoy. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Got visibility? Most devs has no idea what their production app looks like. Find out how fast your code is with AppDynamics Lite. http://ad.doubleclick.net/clk;262219671;13503038;y? http://info.appdynamics.com/FreeJavaPerformanceDownload.html ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_http throwing 141 exit on ssl error
On 09/14/2012 08:09 PM, Mike Lindsey wrote: I'm typically used to seeing this kind of error code for a missing plugin, but I've got a device that is accepting tcp connections and then due to a local misconfiguration, immediately closing them. But rather than a normal critical I'm getting: (Return code of 141 is out of bounds) SIGPIPE has sig id 13. When a program catches a signal, it returns the sigid as a negative number, but the field for the exit status is unsigned, so it gets translated to 128 + sigid instead. As I read it back, I realize that doesn't exactly make supersense to anyone not familiar with integer math as computers do it, but I can assure you that's the reason. When you get exit code 127 or 126 (below 128, but high), it's because execve() has thrown an error and the shell is trying to signal it the only way you can. It does that by returning 0x80 and subtracting the error number. 128 is 0x80 and would be a negative number (which are reserved for signal-triggered exits), so 127 is 0x80 - EPERM, which means the Nagios user lacks execute permissions for the file. 126 is 0x80 - ENOENT, which means the plugin doesn't exist in the path specified, or one of the elements in the path leading to the plugin is missing. There are others too. Check out man 2 execve for all the details. When run by hand I have: root@ops-mon-nagios3 /usr/local/nagios/libexec $ ./check_http -H device.domain.com -w 10 -c 20 -S -p 83 -f follow CRITICAL - Cannot make SSL connection root@ops-mon-nagios3 /usr/local/nagios/libexec $ echo $? 141 write(1, CRITICAL - Cannot make SSL conne..., 39) = 39 write(3, \200w\1\3\1\0N\0\0\0 \0\0009\0\0008\0\0005\0\0\26\0\0\23\0\0\n\7\0\300..., 121) = -1 EPIPE (Broken pipe) --- SIGPIPE (Broken pipe) @ 0 (0) --- +++ killed by SIGPIPE +++ And there's the SIGPIPE. Case closed. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Got visibility? Most devs has no idea what their production app looks like. Find out how fast your code is with AppDynamics Lite. http://ad.doubleclick.net/clk;262219671;13503038;y? http://info.appdynamics.com/FreeJavaPerformanceDownload.html ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios 4: Call for testing
Yes, you read it right. Nagios 4 is approaching, but it needs your help to reach the finish-line, so pretty please HELP! Right now, config parsing is a lot quicker. Check execution is done through workers, except on-demand host chekc execution, which is still handled by the serial model. Code can be fetched from either of: svn co https://nagios.svn.sourceforge.net/svnroot/nagios nagios git://github.com/ageric/nagios.git The people who provide me with the most valuable bug reports will be rewarded with a special prize, apart from a special mention at the Nagios World Conference 2012, happening in Minneapolis on september 25, 26, 27 and 28 this year (as it's 2012...). Points will be given on: * Accuracy of bugreport. If the bug is triggered by one of 13 trillion serviceescalations, you'll have an excellent chance if you produce an example config which reproduces the problem without creating a huge overhead. * Patchiness of bugreport. If the bug contains valgrind output showing illegal memory access, or a backtrace of the error, it will be given extra credit. If it contains a minimal setup displaying the error as well as a patch fixing the error, you're almost sure to be in the very small Circle Of Winners. * Surgeonry of bugreport. If a patch fixes a billion unrelated items, points will be taken from it. * bugreports of a slow Nagios accompanied by gprof or valgrind output output will be given priority over other bugreports of a slow Nagios. Note that only bugreports providede *before* 2012-09-24 will be considerered, and also note that the winners will be announced at the Nagios World Conference 2012 (unless they explicitly ask me not to mention their names, in which case they can't even compete, really, but will probably get several shots from me if I ever meet them and *may* get a very special prize anyway). Patches can be provided for Core Nagios, Livestatus, NDOUtils, Merlin mod_gearman, dnx_utils or npcdmod. Each project gives extra points, provided it's universal. Patches that promote a specific installation directory are worthless and won't be considered. So spedang! and get your hack on, folks :) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] False alerts on http service
On 09/12/2012 07:03 PM, francis picabia wrote: We have used nagios successfully for many years and never seen a case like this. I cannot get nagios sevice to see the remote http service is up, although the check command indicates it is up and the remote apache log shows nagios visited with no error. The site to monitor runs webwork, a math quiz system. I have it set to redirect / to /webwork and also redirect insecure to https. At first I did a plain check_http. I switched to -S option and added -u with the full URL to avoid hitting the redirects, so I can get a clean code 200 returned, in case that was muddling things. No difference. When I look at the apache log, I can see the visits from nagios, For the early morning visits, there is no one using the system, so it can't be unresponsive. Here is my check command: # 'check_www_ssl' command definition define command{ command_namecheck_www_ssl command_line$USER1$/check_http -S -I $HOSTADDRESS$ -f follow -w 5 -c 20 -t 60 -u $ARG1$ } Here is my service: define service{ use generic-service host_name webwork is_volatile 0 service_description Webwork Web Service check_command check_www_ssl!'https://webwork.example.com/webwork/' check_period24x7 contact_groups unix-admins max_check_attempts 3 normal_check_interval 3 retry_check_interval1 notification_interval 120 notification_period 24x7 notification_optionsw,u,c,r } This is the service definition (will be relevant later)... Of course I have changed the actual domain to example.com in the above. But you forgot to change it in the apache log ;) The alert report: * Nagios 3.2 * Notification Type: PROBLEM Host: webwork State: DOWN Address: 131.162.201.91 Info: Server answer: Date/Time: Wed Sept 12 06:59:04 ADT 2012 Here is a sample visit from nagios in the webwork apache log file before this time. XXX.YYY.2.50 - - [12/Sep/2012:06:58:50 -0300] GET https://webwork.acadiau.ca/webwork/ HTTP/1.0 200 5015 - check_http/v1.4.14 (nagios-plugins 1.4.14) Our apache logs show nagios is visiting every 3 minutes, 24 hours a day. None of these visits results in an error. In a nagios log, this is all that appears for webwork for the day: # grep webwork nagios-09-11-2012-00.log [1347246000] CURRENT HOST STATE: webwork;DOWN;HARD;1;Server answer: [1347246000] CURRENT SERVICE STATE: webwork;Webwork Web Service;OK;HARD;1;HTTP OK: HTTP/1.1 200 OK - 4053 bytes in 0.274 second response time So according to these two, the service (which you're saying never turns OK) is OK, but the host itself appears to be down. I think you need to rethink If I do the check_http manually, I seem to get through fine: # /usr/lib/nagios3.2/libexec/check_http 0-S -I webwork -f follow -w5 -c 20 -t 60 -u https://webwork.example.com/webwork HTTP OK: HTTP/1.1 200 OK - 5162 bytes in 0.025 second response time |time=0.024700s;5.00;20.00;0.00 size=5162B;;;0 Can anyone spot a reason why this alert is not set up properly or there is a better way to do it? Examine the *host* check, not the service check, if you want to figure out why the host appears to be down. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Are neb_register_callback()s threadsafe?
On 09/05/2012 01:20 PM, Razvan Cojocaru wrote: Hello, is it safe to assume that the functions registered with neb_register_callback() will be called in a threadsafe manner, That depends on what you mean by threadsafe. If you intend to use multiple threads in your own module, you'll have to take care not to do anything that upsets Nagios. If you mean will multiple Nagios threads try to rip my data out from under my feet?, then no, but other modules might. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Heads up for config changes in Nagios 4
On 08/31/2012 04:06 AM, Yu Watanabe wrote: First time hearing about Nagios 4. Is it coming out soon? In a couple of weeks is the general idea. People are alpha testing it already, but some more longtime tests are required before it goes live, and for full-on production use (with no other systems doing automated checking), I'd wait until Nagios 4.0.1, when we've ironed out any remaining bugs that we missed in testing. op5 will do a release based on Nagios 4 in december, so I'll have bugfixing the core as my top priority from the release until then. Because of that, it's likely 4.0.1 will be out before christmas. Unless 4.0 happens to be bugfree ofcourse. Not impossible, but very unlikely with 20k lines added and 30k lines removed. In the meantime, testing of config parsing and the new check engine is ofcourse very helpful. I'd prefer to get anomaly-reports on mail to nagios-devel, with a...@op5.com on the cc-list so I don't miss out on anything serious. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Heads up for config changes in Nagios 4
Currently, with Nagios 3, it's possible to have cyclic group includes, like this: define hostgroup{ hostgroup_name HG1 alias Hostgroup 1 hostgroup_members HG3 } define hostgroup{ hostgroup_name HG2 alias Hostgroup 2 hostgroup_members HG1 } define hostgroup{ hostgroup_name HG3 alias Hostgroup 3 hostgroup_members HG2 } With Nagios 4, that will generate an error. Currently the behaviour is that the first hostgroup, as determined by alphabetical sorting, will get the members of its included groups, but the later ones in the sorting will not get the members of the previous ones, which is surprising to say the least. This will work the same for all grouptypes. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Call for configs
Hi everyone. Sorry for crossposting. As some of you may have noticed, the Nagios 4 development is well under way and introduces some changes in how we parse object config. Primarily in order to make it a whole lot faster, which it's in dire need of at the moment. Anyways.. I'm looking for Nagios configurations to help me verify that what I'm doing isn't breaking anything with regards to object config. I could generate configs (and I have), but there are so many various options that I can't possibly create all possible permutations that work all possible features of the Nagios configuration, so I've decided to fall back to lazy mode and see if anyone wants to step up and send me their configurations directly. I've created a small scriptlet that will let you anonymize most of your configuration, which can be downloaded from here: http://dl.dropbox.com/u/83439435/scripts/nagios-anonymizer.sh It won't replace your *_name variables, because doing that in a sensible manner requires individually hashing all of them and then replacing them with their hashed versions, and verifying a config like that is just a huge pain in the butt, but addresses, email addresses, pager numbers, aliases and all command information is wiped completely clean. The resource.cfg file isn't even included, so any and all passwords or secrets stored there will be left where it is. Thanks for helping out :) -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_snmp
On 08/27/2012 03:20 PM, Brad Beckenhauer wrote: I have got an app that require us to monitor the Swap file space and trigger an alert when the Available Swap (free space, falls below 500MB). Initially it seems pretty simple, but the check_snmp plugin does not seem to be working with ranges. Here is the first command and it is correctly reporting the free space on the system. Great, so I added some warning and critical limits but the limits are not working the way I expected. Test 1: No limits, looks good. # ./check_snmp -H removed -m UCD-SNMP-MIB -C removed -o memAvailSwap.0 SNMP OK - 751600 | UCD-SNMP-MIB::memAvailSwap.0=751600 Test 2: This should return OK because the warning is outside the returned value, but it is reporting a WARNING. You misunderstand. When using ranges, the given range are the limits on the OK range, and anything outside that triggers the alert. As per your explanation above, I would assume -c 50: -w 55: would be reasonable values. That will cause a critical alert when the value is below 50 and a warning value when it's below 55. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Error compiling Nagios 3 on CentOS 6.2 minimal
On 08/17/2012 05:16 PM, Meyer, Bruce wrote: Using CentOS 6.2 (minimal) Nagios-3.4.1.tar.gz Running: ./configure --sysconfdir=/etc/nagios --localstatedir=/var/nagios --with-command-group=nagcmd --enable-embedded-perl You'll want to stop using embedded perl. It's been removed from Nagios 4, which still masquerades as Nagios 3.4.1 (although not in the tarball) and shouldn't be used if you want your Nagios installation stable and without memory leaks. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] A couple of Nagios NEB plugin API questions
On 08/09/2012 02:39 PM, Razvan Cojocaru wrote: Yes you can, if you know just a tiny bit about Nagios. It's a pointer to either a host or a service. Which it is depends on the type of callback we're issuing. That was entirely uncalled for. Some of us prefer to ask similarly minded peers where we feel it's not unreasonable to do so, rather than go with the the code is the documentation attitude (which I am not putting down, to each his (or her) own). I thought it obvious both that I've done my homework, and that I am quite able to stand on my own two feet. Apparently I wasn't as clear as I should have been. I didn't mean to be rude. Since you do seem like a clever guy who has indeed done his homework I figured that question was more of a complaint along the lines of this should probably be documented too. Thank you for answering, You're welcome. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] A couple of Nagios NEB plugin API questions
On 08/09/2012 08:39 AM, Razvan Cojocaru wrote: Hello, I've read the NEB Module API document [1], and the nebstructs.h header, but I'd like more details about the members of nebstruct_notification_struct. The document shows the struct definition with no comments, and no example, and there are no comments for the individual members in the nebstructs.h header either. Well, the individual members are fairly self-explanatory, don't you think? Ethan's big on mnemonic names, and for a very good reason. You might want to check out some module and use as example though. mod_gearman provides a good resource, as does NDOUtils and Merlin. Livestatus is pretty nifty, but unless you're mentally equipped to follow the tons and tons of C++ boilerplate I advise against using it as anything but an excellent backend to your data. At http://git.op5.org/git/?p=nagios/merlin.git;a=summary you'll find the merlin source-code. If you're not git-literate you might want to download a snapshot. module.c and hooks.c has most of the code that handles nebstruct type objects. Also, this article [2] suggests that you can pass configuration data to your NEB module with this syntax: broker_module=/usr/local/nagios/bin/testmodule.o config_file=/usr/local/nagios/etc/testmodule.cfg How does one access that configuration data from within the plugin? Via a dedicated callback? By accessing some global data structure within the Nagios core process that loads my module? It's the char * argument to your module's nebmodule_init() function. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] A couple of Nagios NEB plugin API questions
On 08/09/2012 12:16 PM, Razvan Cojocaru wrote: Well, the individual members are fairly self-explanatory, don't you think? Ethan's big on mnemonic names, and for a very good reason. Yes, you can glean what they're for from their names, but that's not what I was asking. It's one thing to know that a field called flags should probably be set to some OR-ed or AND-ed bits, and quite another to know what those bits should be or what behaviour they will trigger, for example. The flags are variations for the callback. All nebstructs have them. And you can't quite glean even that much about what a void *object_ptr does or means. Yes you can, if you know just a tiny bit about Nagios. It's a pointer to either a host or a service. Which it is depends on the type of callback we're issuing. I'll figure out what the docs don't say by reading the code. You might want to check out some module and use as example though. mod_gearman provides a good resource, as does NDOUtils and Merlin. Livestatus is pretty nifty, but unless you're mentally equipped to follow the tons and tons of C++ boilerplate I advise against using it as anything but an excellent backend to your data. At http://git.op5.org/git/?p=nagios/merlin.git;a=summary you'll find the merlin source-code. If you're not git-literate you might want to download a snapshot. module.c and hooks.c has most of the code that handles nebstruct type objects. I'm already going that route, and yes, I am git-literate. Thanks for the suggestions! I didn't get to Merlin yet. broker_module=/usr/local/nagios/bin/testmodule.o config_file=/usr/local/nagios/etc/testmodule.cfg How does one access that configuration data from within the plugin? Via It's the char * argument to your module's nebmodule_init() function. Thanks! But the char * argument of nebmodule_init() passes the command line arguments for the module, i.e.: broker_module=/usr/local/nagios/bin/testmodule.o arguments Yes. I thought your line was just wrapped because it was too long. However, I was asking about accessing data written in a regular Nagios .cfg file. The article I've quoted clearly implies that this can be done; it's possible that they're wrong, but if they're not, I'd like to know extern whattype* what gives me access to the parsed Nagios configuration data. Good question. I just access the global variables after Nagios has parsed them. In Merlin, the module command argument points to a configuration file, which Merlin has its own parser for. You can't add random variables to the Nagios configuration file though, since that'll make Nagios barf rather than passing it on to modules. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] strange behavior disabling checks
On 08/08/2012 03:16 PM, Marco Borsani wrote: Hi all Sometime happen that when I set active_checks_enabled 0 in a service and reload the configuration, Nagios did not recognize the instruction. The server still executes the check. It happens if you have enabled state retention data and disable active checks from the webinterface and then enable them again. In that case, altering the configuration doesn't help, since the saved retention state still claims you've manually overridden the configuration. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Just a heads up
Sorry for cross-posting. I just committed and pushed out the worker process changes to the Nagios core. They've passed basic testing and are more than ready for a wider audience. If you're using active checks in your network you should be seeing a huge increase in number of checks executed per second as well as reduced I/O load, memory usage and cpu usage on your Nagios server. Anyone interested in beta-testing should grab the latest svn trunk and get busy. Also... I'll be going on vacation tomorrow, so I'll be slacking enormously on patch review and my other co-maintainer duties. Live with it. It's just for three weeks, so I'm sure you can manage. If it's raining I might go online a short while, but don't count on it. Cheers -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Allow access to custom variables outside Nagios
On 06/15/2012 03:13 PM, VALIYEV Ruslan wrote: Hi all, We have custom variables like this: define host { use linux-server host_name host.foo.com address 192.168.1.44 _serverfunction Oracle EBS R12 Application Server _serveradminAdmins_Linux _serviceadmin Admins_Oracle _userorgACME } Now we have our own script to send out the notifications. And, apparently, custom variables are not exported outside Nagios for safety. That's $USER..$ macros, and they're not exported in the UI for security reasons (although some of them find their way there anyway through plugin output). To give an example of when we’re able to get the var data: my $o_hostname = $ENV{NAGIOS_HOSTNAME}; And this one doesn’t return anything: my $o_hostfunction = $ENV{NAGIOS__HOSTSERVERFUNCTION}; First of all, you'd be better off not using environment macros at all. When you do use them, Nagios has to calculate ALL macros instead of just the ones you use, and some of many of them are quite expensive. Secondly, try replacing your script with something that echoes the output of 'set' and 'env' to a tempfile and see what you get there. It's possible the documentation regarding their names is wrong. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_procs returning wrong data
On 06/17/2012 03:47 PM, Axel Amigo Arnold wrote: Hello, I am using nagios plugins 1.4.15 and I have a question about the check_procs plugin. If I execute it as *root*: root@localhost[/usr/local/nagios/libexec]# ./check_procs -w 50 -c 100 PROCS CRITICAL: 126 processes Now if I do it as *nagios user*: nagios@localhost[/usr/local/nagios/libexec]# ./check_procs -w 50 -c 100 PROCS OK: 2 processes Obviously it gives me the result of doing *ps *as the selected user, and I don't want this to happen like this. From the remote Nagios Server using NRPE, I also have the wrong result of 2 processes. Is there a fix for this? May I have to write a script for myself? The only fix is to run check_procs as root, or make the plugins suid root. Writing your own script won't work either, since it's the kernel blocking access to other users' processes. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_procs returning wrong data
On 06/18/2012 11:51 AM, Axel Amigo Arnold wrote: Hi Andreas, thank you for the response. I already have the suid bit activated in the check_procs binary as you can see here (I just copied the values of check_icmp) -r-sr-sr-x The user for this binary is root, and the group is nagios (just as check_icmp), but I still can't access the total process list. In the command definition, should I make something like this? command[check_total_procs]=/usr/local/nagios/libexec/sudo check_procs -w 150 -c 200 That might help. Try and find out. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Large scale installation
Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.4.1 keeps sending notifications for hosts and service(s) in scheduled shutdown time
On 06/05/2012 08:24 AM, Mark Elsen wrote: On Mon, Jun 4, 2012 at 1:48 PM, Randal, Philphil.ran...@hoopleltd.co.ukwrote: Does the patch in http://tracker.nagios.org/view.php?id=338 fix the issue (notifications during scheduled downtime) for you? Cheers, Phil Initial testing seems to indicate that it does. I'm looking at that patch now and intend to apply it (or some other version of it) to the core. One question though: The original return early patch was meant to fix a problem where contacts were being re-notified of downtime when Nagios restarted in the middle of a downtime period. Is that problem still fixed with the patch found in the issue tracker? -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.4.1 keeps sending notifications for hosts and service(s) in scheduled shutdown time
Thanks for the verifications, both of you. I'll push the changes upstream later today. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Manually run custom command via nagios dashboard possible?
top-post. don't Please On 05/25/2012 05:19 PM, Ajay Jethani wrote: Is there a recommended or a sample framework (outside of nagios) that would be useful in implementing that page or is the page that it would point to something that is mostly home-grown by developers? The latter. On Fri, May 25, 2012 at 10:21 AM, Andreas Ericssona...@op5.se wrote: On 05/25/2012 03:29 PM, Ajay Jethani wrote: I'd like the nagios dashboard to allow me to run commands MANUALLY when I see there is an issue. For example, when a service goes into the 'CRITICAL state' I would check the dashboard and do a sanity check by looking at the performance output and perhaps even log onto the machine. I'd like to keep the dashboard open and click a hyperlink that executes a command that I choose on the remote machine. I know there is the concept of event handlers which can AUTOMATICALLY execute a command when a service check returns a failure, but I'd prefer to do this type of healing manually. Does nagios offer an ability to run a custom command/script I choose via the dashboard? No. That would be akin to building in a secure shell client into Nagios, which is something we'll most likely never (ever) do. Most people who've wanted to do similar things make a separate page for it, point action_url or notes_url to that page, and then handle stuff in a script of their own. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Loosing NAGIOS_* env variables in 3.4.1
On 05/25/2012 02:52 PM, Mark D. Nagel wrote: On 5/24/2012 11:59 PM, Mark Elsen wrote: - I have some 'home-written' notification scripts launched from NAGIOS ; now when using nagios 3.4.1 , all NAGIOS_* variables are gone. (printenv listing). + Make sure you're not using large_installation_tweaks and that you have + enable_environment_macros=1 set in nagios.cfg. Tx! I explicitly needed : enable_environment_macros=1 In nagios.cfg when using 3.4.1 , to get my problemresolved. I wasn't using this setting before (nor was it set to 0). It sure would be neat if this was something you could enable environment macros per command. Then you could have the default be off to get better performance for checks, but still get them for notification commands, which is often the main place they are useful. They're hardly useful there either, tbh, and if you really need them you can just put them up as command-line macros before the command you wish to run, like so: NAGIOS_SOMEVAR=$NAGIOS_SOMEVAR$ /path/to/something -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Manually run custom command via nagios dashboard possible?
On 05/25/2012 03:29 PM, Ajay Jethani wrote: I'd like the nagios dashboard to allow me to run commands MANUALLY when I see there is an issue. For example, when a service goes into the 'CRITICAL state' I would check the dashboard and do a sanity check by looking at the performance output and perhaps even log onto the machine. I'd like to keep the dashboard open and click a hyperlink that executes a command that I choose on the remote machine. I know there is the concept of event handlers which can AUTOMATICALLY execute a command when a service check returns a failure, but I'd prefer to do this type of healing manually. Does nagios offer an ability to run a custom command/script I choose via the dashboard? No. That would be akin to building in a secure shell client into Nagios, which is something we'll most likely never (ever) do. Most people who've wanted to do similar things make a separate page for it, point action_url or notes_url to that page, and then handle stuff in a script of their own. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] is Multiple instances of broker_module possible?
On 05/18/2012 03:43 AM, Sugianto So wrote: Hi All, Since I'm having a constant database crash with my centreon+nagios installation, I'm thinking to run 2 brokers, each feeding data to different servers. is running multiple instances of broker_module possible? It should be. Try and find out. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Loosing NAGIOS_* env variables in 3.4.1
On 05/24/2012 07:48 AM, Mark Elsen wrote: - I have some 'home-written' notification scripts launched from NAGIOS ; now when using nagios 3.4.1 , all NAGIOS_* variables are gone. (printenv listing). How come ? How can I get them back ? Make sure you're not using large_installation_tweaks and that you have enable_environment_macros=1 set in nagios.cfg. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.4.0 double quote in command bug (fixed)
On 05/15/2012 03:49 PM, Dennis Kuhlmeier wrote: Hi Phil, On 15.05.12 15:24, Randal, Phil wrote: Hi Dennis, Can you check out Nagios 3.4.1 and see if this fixes the problem? sorry for not checking prior to posting. Anyway, I built the new version, btw., why can't you just build the RPM from the tar? This is what I had to do on RHEL5: # tar xzf nagios-3.4.1.tar.gz # mv nagios nagios-3.4.1 # cd nagios-3.4.1 # vim nagios.spec change datarootdir to datadir in configure statement # cd .. # tar czf nagios-3.4.1.tar.gz nagios-3.4.1/ # rpmbuild -tb nagios-3.4.1.tar.gz Tada, build works. Back to the problem. Installed the update, doesn't help. On my sample host with 31 checks only 18 get submitted via Perfdata. Back to nagios-3.2.3 again and all 31 checks are processed. Please tell me if you need any more details, performance data template configuration or something like that. This is a slightly older bug about perfdata not being processed when there is no perfdata. I'll see if I get time to take a look at it. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.4.0 double quote in command bug (fixed)
On 05/13/2012 10:44 PM, Randal, Phil wrote: I've uploaded a patch to http://tracker.nagios.org/view.php?id=332 Which fixes the double quote issue and my original compile problem. Thanks. The double-quote issue has been resolved by reverting the faulty patch though. I'll see if I get time to take a look at the compile problem tomorrow. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios backup
On 05/12/2012 01:18 PM, Luis H. Forchesatto wrote: Backup done successfully. All hosts are imported and being monitored, but a few things are not working, like SMS messages and mail messages. Does it need to be backed up from some directory? That's most likely due to system settings and probably has nothing to do with Nagios. I assumed you understood that the new system has to be capable of sending emails and text messages as well if you want that functionality to work. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios backup
On 05/11/2012 03:12 PM, Luis H. Forchesatto wrote: Hi Someone know the correct backup and restore procedure to export nagios 3.0.6 from one server do another nagios 3.2.1 server? rsync. Nothing important's changed in object config, and the rest will just be simple and quick manual tweaking of nagios.cfg, if any at all. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Performance data not being returned
On 05/10/2012 06:04 PM, Jonathan Gazeley wrote: On 10/05/12 16:39, C. Bensend wrote: I've narrowed it down to a stage where running the plugin directly returns the right results, but running the plugin through check_nrpe on localhost returns this: [jg4461@dhcp1 log]$ /usr/lib64/nagios/plugins/check_nrpe -H localhost -c check_dhcpd_pools OK - all pools less than 80% full | What could cause NRPE to truncate the results in such a way? Too much data? Are you using SSL? I don't know that I've seen this behavior before - it's always been *invalid* perfdata that have caused this issue for me. I am using SSL in production, although not for testing. Both cases return the same truncated results. The total status + performance data returned from this plugin is only 500-and-something bytes, less than the limits for NRPE that I'd heard about. I have no real clue about that, but perhaps it's a newline thing, since the output and pipe are shipped in output while the perfdata doesn't seem to be. I'm wondering if it might be something to do with the embedded perl interpreter. Does NRPE use this to execute perl-flavoured plugins? It does not. Only plugins executed directly by Nagios can be affected by embedded perl. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Confusion on nrpe
On 05/07/2012 09:06 PM, Meyer, Bruce wrote: Using a mac That's beside the point. Don't hijack someone elses thread if you want your question answered, since threaded view of the mailing list makes it appear as if you're responding to someone else's question when you take their mail, hit reply and then replace all text with your own. Since you're the one who wants something, it's in your interest to make it easy for the people you want something from to give that to you. Hijacking a thread means you're off to a bad start, so don't. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Confusion on nrpe
Replying in-line, as is the common form on this list. On 05/09/2012 03:33 PM, Meyer, Bruce wrote: HiJacking a thread? What are you talking about? If I had maintained someone's subject but changed the content, that is hijacking a thread. Merely hitting 'reply' and deleting the subject and replacing it, deleting the body and writing my own does not seem to me, to qualify as hijacking a thread. That is creating a new thread, by changing the subject and body. Or am I completely missing something? I do sort by thread on my machine and my emails are perfectly sorted. Perhaps you should start sorting my subject instead of X-header or whatever you are threading on now. I would love to have learned something about what you are talking about, but so far, you haven't explained a thing, merely complained. If I want it further threaded, then it is grouped by the subject. What on earth are you threading by (and what amazing email client are you using) that causes my repurposing an email the way I do to cause a problem, X-Header attributes? If you have a problem with a persons actions, You should have taken the reply immediately offline, and not sent it to the group. You should also explain why doing what I do causes a problem, and not assume that what I do is intentional to cause a problem. Maybe I am not born with all knowledge and experience in all things mailing list known to man, as you apparently are. Some people try to teach and share knowledge. Since you insist, I will continue here to we can all learn from your complaint and eventual (hopefully) explanation as to how my process causes problems. BTW, You are the first person ever to have mentioned that doing it this way is causing a problem. I'll keep the thread here instead of taking it offline. This way if anyone else does what I do, we can all learn why this causes a problem. Maybe, we can get the owner of the mailing list to place a link or footer in each posting that says what address to send emails to. Then people like I who don't know, could just copy and paste that email address into new emails instead of hitting reply and changing subject and body and letting it fly. That is a fairly common thing in newsgroups postings, probably for this exact reason. I eagerly await your instructional and helpful reply. And here it is: When someone mentions, without malice, that you've done something wrong, examine your actions closely before claiming that those educating you are ignorant or wrong. If you can't find any fault in how you did things, ask nicely for an explanation. It's quite likely others know something you don't, and unless you actually *are* the reigning deity on the subject at hand, showing some humility is always a good idea. Hijacking a thread is not a big deal. Trying to correct us because we say it's wrong is however a very bad idea when you want our help. Hope that helps in your future endeavours. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.4.0 complilation error (and trivial fix)
On 05/09/2012 05:15 PM, Randal, Phil wrote: I should have mentioned that that error only occurs if you try to compile Nagios 3.4.0 with embedded perl support. There's also no guarantee that it'll work properly even with that patch. I had problems check_mk with Nagios 3.4.0, but unfortunately, I haven't got the time or test environment to get to the bottom of it. check_mk is written in python though, so it won't be affected by embedded perl. Not sure you were saying that. I just wanted to clarify. Some users (on nagios-devel@) have reported a failure to run checks with double quotes in them. It appears the execve() patch caused it. If that's your problem too, you should report it here. I have a half completed patch for it that'll be ready this weekend or something, but it would be nice with some corroboration. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.4.0 complilation error (and trivial fix)
On 05/09/2012 06:23 PM, Mike Guthrie wrote: Hey Phil, Thanks for the clarification on this. I was trying to figure out why we hadn't come across this issue on our local dev environments. Maybe some other users can chime in on this, but my understanding was that compiling with the embedded perl option tends to be problematic due to memory leaks and other unpredictable behaviors. (??) It is, most of the time, but the code is still in there so it's sort of supposed to keep working. Personally, I'd rather remove the whole thing, since it's such a mess 99% of the time. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Any upcoming release?
On 05/09/2012 07:45 PM, Frost, Mark {BIS} wrote: -Original Message- From: Andreas Ericsson [mailto:a...@op5.se] Sent: Thursday, April 05, 2012 4:34 AM To: Nagios Users List Subject: Re: [Nagios-users] Any upcoming release? On 04/05/2012 07:31 AM, Yu Watanabe wrote: Hi all! I would like to know if there are any plans for the nagios v 3.4.x. It has been a while since the last release so I was very curious about it. There is. Nagios 3.4 will be a single-threaded and event-driven application that sports an I/O-broker and vastly improved check performance. In essence, we've removed 2 fork() calls, 4 disk searches, 2 filewrites and 2 filereads from each check being performed. There's also a fixed usage of the current scheduling queue implementation which turns scheduling new checks from its current O(n) behaviour to O(1). This will provide a huge benefit for large installations, and combined with the worker process code we're currently seeing a 12-fold increase in the amount of checks Nagios can execute, but it's still too early to tell what other things are affected. The external command pipe might be a bottleneck if one uses large amounts of passive checks, for example. It's currently in late alpha, so beta releases should be available in a month or so. -- Andreas Ericsson andreas.erics...@op5.se Andreas, I'm a little confused about this. I've been eagerly awaiting these gee-wiz-bang space-age changes, but when I looked over the change list for 3.4.0 that Ethan sent they seem like mostly minor changes. Or perhaps they just don't describe things in enough detail to match up the rather significant architectural changes listed above. Is this the super-summarized bullet item that refers to the change above? ENHANCEMENTS * Use execv() to execute active check commands (#86 - Ton Voon, dnsmichi) Nopes, it's not, and that patch is actually broken. My code still leaks (about 1MB per 24 hours with 1000 checks / second), so I've held it back a bit. I didn't know they were going to hit the release button so fast, and without a beta period. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Any upcoming release?
Bleh. It's too late and to sit up and rebase 160-odd patches on top of 20 to fix a single broken one. For those using the 3.4.0 release, I've cherry-picked all patches that went in the 3.4.0-release apart from the broken execvp patch and added them on top of the worker-process code. If anyone's interested in trying them out, you can get them from git://github.com/ageric/nagios.git or possibly get a snapshot somewhere at https://github.com/ageric/nagios Consider it 3.5.0-alpha or something. It still needs a lot of testing, but it's been ticking along quite nicely in our lab the past 4 months, running checks and generally just working with livestatus and Merlin enabled. Granted, that part of our lab doesn't use passive check results, but most of the rest is tested and known to work pretty well, apart from the small memory leak mentioned below (but who knows? Someone might even fix that now that it's in the open). Happy testing. I'll get a proper fix in for the double-quote issue on friday, as I've got presentations to run all day tomorrow. On 05/09/2012 09:53 PM, Andreas Ericsson wrote: On 05/09/2012 07:45 PM, Frost, Mark {BIS} wrote: -Original Message- From: Andreas Ericsson [mailto:a...@op5.se] Sent: Thursday, April 05, 2012 4:34 AM To: Nagios Users List Subject: Re: [Nagios-users] Any upcoming release? On 04/05/2012 07:31 AM, Yu Watanabe wrote: Hi all! I would like to know if there are any plans for the nagios v 3.4.x. It has been a while since the last release so I was very curious about it. There is. Nagios 3.4 will be a single-threaded and event-driven application that sports an I/O-broker and vastly improved check performance. In essence, we've removed 2 fork() calls, 4 disk searches, 2 filewrites and 2 filereads from each check being performed. There's also a fixed usage of the current scheduling queue implementation which turns scheduling new checks from its current O(n) behaviour to O(1). This will provide a huge benefit for large installations, and combined with the worker process code we're currently seeing a 12-fold increase in the amount of checks Nagios can execute, but it's still too early to tell what other things are affected. The external command pipe might be a bottleneck if one uses large amounts of passive checks, for example. It's currently in late alpha, so beta releases should be available in a month or so. -- Andreas Ericsson andreas.erics...@op5.se Andreas, I'm a little confused about this. I've been eagerly awaiting these gee-wiz-bang space-age changes, but when I looked over the change list for 3.4.0 that Ethan sent they seem like mostly minor changes. Or perhaps they just don't describe things in enough detail to match up the rather significant architectural changes listed above. Is this the super-summarized bullet item that refers to the change above? ENHANCEMENTS * Use execv() to execute active check commands (#86 - Ton Voon, dnsmichi) Nopes, it's not, and that patch is actually broken. My code still leaks (about 1MB per 24 hours with 1000 checks / second), so I've held it back a bit. I didn't know they were going to hit the release button so fast, and without a beta period. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null