[Nagios-users] Dependencies in redundant networks and services
I'm trying to sort out setting up dependencies for services and networks which contain redundancies, and it *looks* like I may be out of luck. I'm hoping someone can comment... the googling I've done has raised one question about this in the past which received no answer. The documentation for service dependencies is pretty clear that if service A is dependent on service B and C, then if either service B or C fails, service A will be treated as blocked (depending on the defined failure criteria). Does anyone have a working solution where service A is able to use either B *OR* C in order to perform its function? In this case, there'd need to be a way to tell Nagios that service A is blocked only if service B and C are both failed. The documentation is less clear about how multiple parents are handled, but given that host dependencies are documented to behave in the same way as service dependencies, it stands to reason that parents use the same logic... that if host A has two parents (routers B and C) then either router B or C being down would result in host A being considered unreachable. Again, I'm looking for a way to define a situation where host A can be reached via either router B or router C, and should only be considered unreachable if both routers are down. Does anyone have any suggestions for dealing with this? Thanks! -- What Every C/C++ and Fortran developer Should Know! Read this article and learn how Intel has extended the reach of its next-generation tools to help Windows* and Linux* C/C++ and Fortran developers boost performance applications - including clusters. http://p.sf.net/sfu/intel-dev2devmay ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Dependencies suppressing execution but not notification
No responses. Was it because it looked overcomplicated, too long, just something? Spelling cortical provided by iPhone. On Jul 25, 2010, at 7:45 PM, Litwin, Matthew mlit...@stubhub.com wrote: Hi, Here is hopefully a more clear explanation of the problem I posted last night. I have configured dependancies to suppress notifications when the master dependency is not OK or WARNING and while I am finding they work in suppressing command execution, notification suppression doesn't seem to be happening as the alerts keep coming for them. This is an example of the problem: Setup: Apache_Webhits service depends on Apache_Status service being OK or WARNING for both execution and notification Apparent order of events: 1. Apache_Webhits returned UNKNOWN because apache was probably down 2. Apache_Status soon goes CRITICAL because it definitely is down alright 3. Nagios stops further execution of Apache_Webhits 4. In the GUI I turn off notifications for Apache_Status for that server until the problem is fixed. Problem: Apache_Webhits continues to send out notifications that it is CRITICAL though I would have expected the behavior to suppress further notification, but it clearly is just sending notifications as usual. Is that expected behavior? Based on my understanding I would thing further notifications would cease in this case. My configs: One this that I did that was unusual mas making a service dependency template to try to streamline my services.cfg so I didn't need to use a dependencies.cfg. It seemed to work just fine but I am mentioning it because it doesn't seem to be explicitly documented. Here is the template: define servicedependency { namegeneric-dependency execution_failure_criteria c,u,p notification_failure_criteria c,u,p inherits_parent 1 register0 } and here is the example described above: # Apache_Webhits define service{ use standard-app-service hostgroup_name Apache service_description Apache_Webhits servicegroups Apache_Webhits check_command check_webhits_nrpe } define servicedependency { use generic-dependency hostgroup_name Apache dependent_service_description Apache_Webhits service_description Apache_Status } -- The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://ad.doubleclick.net/clk;226879339;13503038;l? http://clk.atdmt.com/CRS/go/247765532/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://p.sf.net/sfu/dev2dev-palm ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Dependencies and notifications
Hello everyone, could I please ask for some assisntance / am-I-off-my-rocker soft of advice I am working with using dependencies and I am having a problem that while dependencies are successfully suppressing the execution of monitors because their dependencies are not being satisfied which is what I expect, however notifications are continuing to go out, even though they have the same criteria, which is NOT what I expect because they are the same criteria: execution_failure_criteria c,u,p notification_failure_criteria c,u,p The monitors that were held in the PENDING state because their dependency was never satisfied since the monitor was created are not sending notifications, but the ones that managed to squeak in a CRITICAL state read before further checks were suppressed thereafter continue to send out alert notifications. This is not what my understand of how notification dependencies worked. Is there some other directive that I need to make it so monitors that are in a CRITICAL state do not send further notifications if their dependencies are not satisfied? One this that I did that was unusual mas making a service dependency template to try to streamline my services.cfg so I didn't need to use a dependencies.cfg. Here is the template: define servicedependency { namegeneric-dependency execution_failure_criteria c,u,p notification_failure_criteria c,u,p inherits_parent 1 register0 } and here is an example of a service that uses it: define service{ use standard-app-service hostgroup_name Apache service_description Apache_Status servicegroups Apache_Status check_command check_apachestatus_nrpe!localhost } define servicedependency { use generic-dependency hostgroup_name Apache dependent_service_description Apache_Status service_description NRPE_Available } Is this just a case of me not understanding how dependencies are supposed to work or is something else weird going on here? Thanks in advance for any help. -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Dependencies
Hi, I am trying to get some of the dependency stuff working in Nagios. I have a very simple system: Machine A Router B . Internet machine C I use the following to show the relationship: define host{ use linux-server host_name A parents B } 1) The 'status map' shows nagios as being somehow between A B, whereas it is on A (ie it runs on A). Is there a way that I can tell nagios that it is on A ? 2) I check that DNS is still working thus: define service{ use local-service host_name A service_description DNS Access check_command check_dns } How do I say that this depends on B ? ... I don't want warnings just because the router is dead. I also check (twice/day) that the clock on A is accurate, that will only work if NTP works, ie also depends on B, but is very much tied to A. TIA -- Alain Williams Linux Consultant - Mail systems, Web sites, Networking, Programmer, IT Lecturer. +44 (0) 787 668 0256 http://www.phcomp.co.uk/ Parliament Hill Computers Ltd. Registration Information: http://www.phcomp.co.uk/contact.php Chairman of UKUUG: http://www.ukuug.org/ #include std_disclaimer.h - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Dependencies question
On Mon, 9 Jul 2007, Patrick M. Hausen wrote: Here's a group of somethings, i.e. hosts or services. If any number of them fails, just send a notification for the first one. I will look at the Nagios status page anyway and probably something else connecting them in some way failed. Have you seen check_cluster? http://nagios.manubulon.com/traduction/docs14en/clusters.html Does your network utilize load balancers? If so, consider monitoring the VIP instead of or in addition to the individual hosts behind the LB. When it comes to monitoring stuff across the WAN, I've found the best approach is to move the monitoring server (Nagios) as close to the monitored hosts as possible. This will help eliminate your multi-path to the data center problem and also eliminate the phenonmenon I refer to as monitoring the internet - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] dependencies and AND
Hello all, i have defined multiple host dependencis for one single host. Now notification is not sent when any of dependencies failed (any of hosts dependent upon is down). Is there any possibility to tweak this dependencies that notification is not sent only if ALL of hosts depended upon are down and sent it normally when only some of them are down? Many thanks for any ideas. Best regards, Jan Gregor - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] dependencies and AND
i have defined multiple host dependencis for one single host. Now notification is not sent when any of dependencies failed (any of hosts dependent upon is down). Is there any possibility to tweak this dependencies that notification is not sent only if ALL of hosts depended upon are down and sent it normally when only some of them are down? One possibility (and this may not be the best way -- it's on off-the-top-of-my-head thing) would be to have an addition check using check_cluster that failes if all of the depended-on services are down, and depend on that. - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] DEPENDENCIES
Hi, I have dependencies working mainly fine. However, i'm still not 100% satisfied that its behaving correctly. Using Nagios 2.0rc2 on suse 9.2 (i've used later releases for testing but behaviour is the same) I have one main service which is a database connection plugin to ensure the database is up and running. I have 3 additional dependency plugins which do various other checks such as ram and disk usage. When the main database plugin check goes to warning or critical I want to suppress email notifications on the 3 remaining checks as there is no point in running these (they won't return anything useful apart from timeouts). However, this rarely happens. Usually, I get the main check email notification plus 1 other email notification from one of the dependencies (though it may not always be the same dependency) Is there something i'm missing here ? Why can't it supress all three as i would expect ? Do i need to set a parameter setting somewhere ? My main plugin checks every 5 mins with the dependencies every 20 mins. Should i change these timings to something different and if so why ? This is driving me somewhat insane and any pointers / help would be very welcomed. regards, deborah This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio Limited takes steps to prevent the transmission of viruses via e-mail, we can not guarantee that any email or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. *** --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] DEPENDENCIES
On Wed, 8 Feb 2006, Deborah Martin wrote: Hi, I have dependencies working mainly fine. However, i'm still not 100% satisfied that its behaving correctly. Using Nagios 2.0rc2 on suse 9.2 (i've used later releases for testing but behaviour is the same) I have one main service which is a database connection plugin to ensure the database is up and running. I have 3 additional dependency plugins which do various other checks such as ram and disk usage. When the main database plugin check goes to warning or critical I want to suppress email notifications on the 3 remaining checks as there is no point in running these (they won't return anything useful apart from timeouts). However, this rarely happens. Usually, I get the main check email notification plus 1 other email notification from one of the dependencies (though it may not always be the same dependency) Is there something i'm missing here ? Why can't it supress all three as i would expect ? Do i need to set a parameter setting somewhere ? My main plugin checks every 5 mins with the dependencies every 20 mins. Should i change these timings to something different and if so why ? This is driving me somewhat insane and any pointers / help would be very welcomed. regards, deborah It is possible for the checks to be running concurrently - If the main check fails, then the service dependency checks kick in. but the cocurrent check is already in progress and notifications would be sent out. -sg -- --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] DEPENDENCIES
* Deborah Martin [EMAIL PROTECTED] [2006-02-08 16:40]: When the main database plugin check goes to warning or critical I want to suppress email notifications on the 3 remaining checks as there is no point in running these (they won't return anything useful apart from timeouts). However, this rarely happens. Usually, I get the main check email notification plus 1 other email notification from one of the dependencies (though it may not always be the same dependency) Whether or not service dependencies work as expected depends on the setting of soft_state_dependencies and your check intervals: | Try reading the nagios docs on soft_state_dependencies. Then setting | | soft_state_dependencies=1 | | in nagios.cfg. | | If A depends on B (aka no alert on A when B in trouble) make sure that: | | retry_check_interval(A) * max_check_attempts(A) | normal_check_interval(B) | | That should guarantee that at least one check of B occurs during the | trouble interval for A. Without the soft_state_dependencies you must use: | | retry_check_interval(A) * max_check_attempts(A) | normal_check_interval(B) + | (retry_check_interval(B) * max_check_attempts(B)) | | to make sure that B goes into a hard state during the trouble interval | of A. [ http://sourceforge.net/mailarchive/message.php?msg_id=14420901 ] This makes sure it should work in most cases (though in theory it can still fail if the critical check of service B was delayed for some reason). Holger PS: For host dependencies, a similar problem AFAICS cannot be solved without patching Nagios: http://sourceforge.net/mailarchive/message.php?msg_id=14641584 -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Dependencies
I am using 1.3, and reading about dependencies. http://nagios.sourceforge.net/docs/1_0/dependencies.html I am interested in host dependencies. When I used nagios perhaps over a year ago, I thought it was host dependencies which allowed me to get the status map drawn in such a way where you could see hosts connected through other hosts. This made it clear the organization and location in the network. I can't seem to make this happen, though nagios -v doesn't complain about anything at all. Is there any better documentation on this? Being dyslexic doesn't help (I can't keep straight the relation between host_name and dependent_host_name)... --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637alloc_id=16865op=click ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Dependencies
fire-eyes wrote: I am using 1.3, and reading about dependencies. http://nagios.sourceforge.net/docs/1_0/dependencies.html I am interested in host dependencies. When I used nagios perhaps over a year ago, I thought it was host dependencies which allowed me to get the status map drawn in such a way where you could see hosts connected through other hosts. This made it clear the organization and location in the network. I can't seem to make this happen, though nagios -v doesn't complain about anything at all. Is there any better documentation on this? Being dyslexic doesn't help (I can't keep straight the relation between host_name and dependent_host_name)... Whoops, I was thinking parents in the host definitions. --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637alloc_id=16865op=click ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null