Re: [Nagios-users] Check ActiveSync
On Aug 3, 2010, at 6:14 AM, Mike A. Leonetti wrote: (2010年07月29日 12:28), Israel Brewster wrote: Is there a plugin that allows me to check if I can make an ActiveSync connection to an exchange server? I've found plugins that use, for example, check_nt to check various parameters of the exchange server, and of course I can make sure that IMAP and SMTP connections are working, but I haven't found one to check if I can actually make an activesync/exchange connection., i.e. if my users can actually connect. Thanks. --- Israel Brewster Computer Support Technician II Era Alaska 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x7293 --- You can always attempt some sort of check on http://yourwebaddress/Microsoft-Server-ActiveSync. ActiveSync is wrapped in HTTP and the first thing you should get is a web login when you go to that site. You can start there.Ah, thank you. I thought there was something like that, but I couldn't find it. I think simply using a http check to that address, and making sure it lets me log in should do the trick. Of course, I get a 501 error after logging in, but that's fine- at least I know it's responding and accepting credentials. Thanks. --The Palm PDK Hot Apps Program offers developers who use thePlug-In Development Kit to bring their C/C++ apps to Palm for a shareof $1 Million in cash or HP Products. Visit us here for more details:http://p.sf.net/sfu/dev2dev-palm___Nagios-users mailing listNagios-users@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/nagios-users::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ---Israel BrewsterComputer Support Technician IIEra Alaska5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x7293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://p.sf.net/sfu/dev2dev-palm___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Check ActiveSync
Is there a plugin that allows me to check if I can make an ActiveSync connection to an exchange server? I've found plugins that use, for example, check_nt to check various parameters of the exchange server, and of course I can make sure that IMAP and SMTP connections are working, but I haven't found one to check if I can actually make an activesync/exchange connection., i.e. if my users can actually connect. Thanks. ---Israel BrewsterComputer Support Technician IIEra Alaska5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x7293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://p.sf.net/sfu/dev2dev-palm___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] [OT]Network speed test
On Feb 12, 2010, at 10:03 AM, Matt Simmons wrote: I think you're looking for this: http://www.speedtest.net/mini.php Exactly what I was looking for. Thanks. Feel free to direct general non-nagios sysadmin related questions to http://www.serverfault.com as well. Lots of good people on that site. Good tip - I'll definitely look into it. Thanks again! --Matt On Fri, Feb 12, 2010 at 1:20 PM, Israel Brewster wrote: This isn't a nagios question so much as a general network diagnostic question, but since nagios is all about network monitoring and diagnostics, I figured it was likely someone here would know the answer. I am looking for a tool for testing network speeds on our "local" network, similar to internet speed sites. The difference being that I need something I can install somewhere on our local network so we can get site-to-site results between our main hub and our outstations. The idea being to test the actual speed we are getting over our private links. This wouldn't necessarily need to integrate with Nagios however, as it would just be for periodic spot checks - thus the OT label. Thanks. ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/ null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] [OT]Network speed test
This isn't a nagios question so much as a general network diagnostic question, but since nagios is all about network monitoring and diagnostics, I figured it was likely someone here would know the answer. I am looking for a tool for testing network speeds on our "local" network, similar to internet speed sites. The difference being that I need something I can install somewhere on our local network so we can get site-to-site results between our main hub and our outstations. The idea being to test the actual speed we are getting over our private links. This wouldn't necessarily need to integrate with Nagios however, as it would just be for periodic spot checks - thus the OT label. Thanks. ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Mutually exclusive contacts?
I suspect the answer is no, but in nagios 3, is there any way to set up mutually exclusive contact? As in, I have a host template set up with a number of contacts. On specific hosts that use this template, I want to replace one contact (normal me) with a different contact (urgent me). The other contacts for the host remain the same, so I don't want to just override the contacts directive for the host. I think I can do something like contacts +UrgentMe, !NormalMe, but I'm not even sure about that - I want to say there was an issue with doing negative contacts (although that might have been something else- I'll try it and see). Be that as it may, it would be really convenient if I could set something in the contacts to say "Urgent me overrides Normal me" Is this possible? Thanks. ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Dependancy clarification
On Jan 8, 2010, at 2:55 PM, gmartin wrote:Israel, I believe you are correct. I'll be interested to hear what other shave to say on the inner workings. In the meantime, can the problem be solved if the event handler for Service B is written to restart svc A if it is down? (perhaps it calls the same nagios check from the command line and acts on the results)Yeah, that should work, at least for my specific situation. Of course, doing so greatly reduces the utility of having the dependancy in the first place, since the situations under which it would be triggered (given nagios restarting service A as soon as it detects it as down) would be somewhat rare, and even when triggered it would no longer be needed, since the service B event handler does its own dependancy checking. The only time the dependancy would apply (assuming our understanding is right) is in the situation where Nagios detects A as down, and then tries to run a check on B before verifying that A is back up. Of course, even then it wouldn't matter, since a) nagios should have restarted service A immediately (so a straight restart of B would be fine), and b) even if nagios didn't, the new event handler for service B would. At which point there is no need of the dependancy at all, since the event handler takes care of the dependancies. Basically, if the dependancy only applies when nagios ALREADY knows service A is down, then the dependancy is basically useless, at least in this situation. Of course, if this is just the way dependancies work, then there may be no other option. Thanks for the feedback. \\Greg On Fri, Jan 8, 2010 at 6:07 PM, Israel Brewster <isr...@frontierflying.com> wrote: Here's the situation: running nagios 3.2.0, I have two services, we'll call them A and B. Both have event handlers such that if they register a hard critical state, Nagios attempts to restart them. Service B depends on service A, such that when service A goes down, service B does as well, causing them both to need restarted, with A needing to be restarted first. I have a servicedependancy set up in nagios specifying service B's dependancy on service A. My understanding is that the way this works is that when nagios goes to check service B, it first looks at the "current" state (as defined by the last nagios check) of service A, and, if the execution_failure_criteria matches (i.e. if service A is down) nagios does not run the check on service B, thus not running the event handler to attempt to restart B until A is back up. This is good. But what happens in the following scenario? Service A is scheduled to check every 5 minutes.1) Nagios does a normally scheduled check of service A, finding it to be OK.2) One minute later, Service A crashes3) One minute after that (three minutes prior to the next regular check of service A), thanks to nagios staggering checks, Nagios goes to do a normal check of service B Now, to my understanding of this scenario, the check on service B would run normally, since the last check on A was OK, and nagios uses cached results for dependancy checks. Since service A is actually critical, service B will be critical as well. The problem with this is that Nagios will respond by attempting to restart service B, which will invariably fail since service A is still down. Once the next regular check time for service A is reached, Nagios will detect service A as down and restart it, but service B will never get restarted successfully, since nagios already tried and failed. Is this correct? If so, what can be done about it? Or is nagios smart enough to schedule its service checks to avoid this scenario? It seems that the most logical solution (if possible) would be to mirror the service/host check logic. That is, when a check of service B comes back as critical, immediately check service A. If service A is critical, then don't declare service B to be critical until service A is OK, at which point B would enter a hard down state and run the event handler. Alternately, if I could say something like always check service A immediately before checking service B to make sure our data is current, that would work as well. Although I could see it resulting in excessive checking of service A, which may be less desirable. What do you guys think? ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev __
[Nagios-users] Dependancy clarification
Here's the situation: running nagios 3.2.0, I have two services, we'll call them A and B. Both have event handlers such that if they register a hard critical state, Nagios attempts to restart them. Service B depends on service A, such that when service A goes down, service B does as well, causing them both to need restarted, with A needing to be restarted first. I have a servicedependancy set up in nagios specifying service B's dependancy on service A.My understanding is that the way this works is that when nagios goes to check service B, it first looks at the "current" state (as defined by the last nagios check) of service A, and, if the execution_failure_criteria matches (i.e. if service A is down) nagios does not run the check on service B, thus not running the event handler to attempt to restart B until A is back up. This is good. But what happens in the following scenario?Service A is scheduled to check every 5 minutes.1) Nagios does a normally scheduled check of service A, finding it to be OK.2) One minute later, Service A crashes3) One minute after that (three minutes prior to the next regular check of service A), thanks to nagios staggering checks, Nagios goes to do a normal check of service BNow, to my understanding of this scenario, the check on service B would run normally, since the last check on A was OK, and nagios uses cached results for dependancy checks. Since service A is actually critical, service B will be critical as well. The problem with this is that Nagios will respond by attempting to restart service B, which will invariably fail since service A is still down. Once the next regular check time for service A is reached, Nagios will detect service A as down and restart it, but service B will never get restarted successfully, since nagios already tried and failed. Is this correct? If so, what can be done about it? Or is nagios smart enough to schedule its service checks to avoid this scenario? It seems that the most logical solution (if possible) would be to mirror the service/host check logic. That is, when a check of service B comes back as critical, immediately check service A. If service A is critical, then don't declare service B to be critical until service A is OK, at which point B would enter a hard down state and run the event handler. Alternately, if I could say something like always check service A immediately before checking service B to make sure our data is current, that would work as well. Although I could see it resulting in excessive checking of service A, which may be less desirable. What do you guys think? ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Override interval_length for specific service only
On Jan 7, 2010, at 2:37 AM, matthias.fla...@gmx.de wrote: I was wondering if there was any way with nagios 3.2 to override the global interval_length for a specific service? I have one service that I would like to check every 30 seconds, but interval_length in 60. I could change the interval_length definition, of course, but then I would have to go back through all my definitions and change the interval values I used there. This may not be too bad, as I make heavy use of templating, but it would be preferable to simply override this for the one service. In Nagios3 you can specify fraction numbers for the check_interval. So with the standard interval_length of 60 seconds a check_interval of 0.5 would configure a 30 seconds interval for your particular check. Perfect, Thanks :-) -Matthias --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Override interval_length for specific service only
I was wondering if there was any way with nagios 3.2 to override the global interval_length for a specific service? I have one service that I would like to check every 30 seconds, but interval_length in 60. I could change the interval_length definition, of course, but then I would have to go back through all my definitions and change the interval values I used there. This may not be too bad, as I make heavy use of templating, but it would be preferable to simply override this for the one service. ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios stops updating
I want to say this is a FAQ, as I seem to recall seeing something about this behavior before, but I didn't find anything with a quick search, and I don't thing there was any real solution posted, so I'm going to ask now. I am running Nagios 3.2.0 on OpenBSD 4.6. I have found that from time to time nagios will simply stop updating. No errors will be produced, and the nagios log will show normal operation right up until it stops updating. After that, nothing. I have implemented a work around of a script that checks for this and restarts nagios if needed, but obviously this is a sub-optimal solution. Has anyone seen this behavior before that might have some idea how I can fix it? Or does anyone have a suggestion as to how I can go about debugging it? Thanks. ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios with MRTG
Probably the most straightforward way is to use the notes_url host directive to link to the MRTG generated graphs from nagios. Doesn't precisely "embed" the graphs in nagios, but at least you can access them from the nagios interface with just one click. For my installation, I've simply placed a link to the MRTG graphs in the nagios sidebar, but while perhaps more work, I would think using the notes_url (and optional notes_icon) directives would be a bit nicer.On Dec 2, 2009, at 11:38 PM,wrote:Hello, Thank you for your reply, but seems the check_mrtg plugin can get the MRTG data and show the value in the Nagios. However, it seems no way to use the MRTG data and create a graph like MRTG, and embedded into Nagios. That is when I click the monitored interface, it will show the MRTG graph instead of Nagios format. Thank you. Terry ChowSystem EngineerFortis (Hong Kong) Limited28/F Fortis Bank Tower77-79 Gloucester RoadHong KongTel : +852-3653-0823Email : terry.c...@hk.fortisnl.com From: webknowle...@gmail.com [mailto:webknowle...@gmail.com] On Behalf Of MarcelSent: Friday, November 27, 2009 4:51 AMTo: Chow TerryCc: nagios-users@lists.sourceforge.netSubject: Re: [Nagios-users] Nagios with MRTG There is a check_mrtg plugin, if i recall correctly, you can check against values on the RRD.On Thu, Nov 26, 2009 at 7:07 AM, wrote:Dear all, I am currently using Nagios 3 in Solaris 10 X86 version, it is fine. Also I have install MRTG on the server and configure to collect one switch interface traffic for testing. Is it possible to integrate MRTG graph about the switch interface traffic into Nagios? So that I can use Nagios portal to view the switch uptime, or other alert and also view the bandwidth usage of the switch in MRTG format. Thank you. Terry ChowSystem EngineerFortis (Hong Kong) Limited28/F Fortis Bank Tower77-79 Gloucester RoadHong KongTel : +852-3653-0823Email : terry.c...@hk.fortisnl.com This message (including any attachments ) is confidential and is intended solely for the use of the individual or entity to whom it is addressed. If you have received this message by mistake please notify the sender by return email and delete this message from your system. Any unauthorised use or dissemination of this message in whole or in part is strictly prohibited.--Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Daytrial. Simplify your report design, integration and deployment - and focus onwhat you do best, core application coding. Discover what's new withCrystal Reports now. http://p.sf.net/sfu/bobj-july___Nagios-users mailing listNagios-users@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/nagios-users::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.::: Messages without supporting info will risk being sent to /dev/null This message (including any attachments ) is confidential and is intended solely for the use of the individual or entity to whom it is addressed. If you have received this message by mistake please notify the sender by return email and delete this message from your system. Any unauthorised use or dissemination of this message in whole or in part is strictly prohibited. --Join us December 9, 2009 for the Red Hat Virtual Experience,a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere.http://p.sf.net/sfu/redhat-sfdev2dev___Nagios-users mailing listNagios-users@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/nagios-users::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sess
Re: [Nagios-users] What would you want in a replacement web interface?
No, knowing how to do "regular _expression_" (or wildcard) searches is not an extreme requirement. In fact, it makes perfect sense, and I am intimately familiar with both. That said, last time I tried that it didn't work - I'm not so stupid as to not think to try a simple wildcard :-). On the other hand, I'm running 3.0.6, as that is what is distributed with the stable version of OpenBSD. Perhaps this is something that was fixed in 3.1 or 3.2? I seem to recall reading something about a new web interface. So my point stands that this is MY biggest gripe with the web interface. If it's already been fixed, awesome! :-)On Nov 24, 2009, at 11:48 AM, shadih rahman wrote:This is not an issue. nagios search bar does regular _expression_ search. do ffs-* in your box and it will result in all the hosts. Nagios is a monitoring tool. I think nagios users and administrators are some what involved with technology. Knowing how to do "regular _expression_" search is not an extreme requirement. On Tue, Nov 24, 2009 at 1:47 PM, Israel Brewster <isr...@frontierflying.com> wrote: 2009/11/16 Taylor Dondich <tdond...@lilacnetworks.com>: #3) What is your biggest gripe regarding the existing Nagios web user interface?The lack of a real search. The select box is practically useless. For example, I have a bunch of hosts named ffs-, but if I type in ffs- to the search box, I only get one result. I would really like a real (albeit simple) search so if I type ffs, I get ALL hosts containing ffs. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null-- Cordially,Shadhin Rahman ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] What would you want in a replacement web interface?
2009/11/16 Taylor Dondich : #3) What is your biggest gripe regarding the existing Nagios web user interface? The lack of a real search. The select box is practically useless. For example, I have a bunch of hosts named ffs-, but if I type in ffs- to the search box, I only get one result. I would really like a real (albeit simple) search so if I type ffs, I get ALL hosts containing ffs. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Environment variables
Never Mind, I found the configuration directive in the nagios .conf.On Sep 15, 2009, at 12:24 PM, Israel Brewster wrote:Is there anything special (say, in the nagios config) that I need to do in order to use environment variables in my notification scripts? From the documentation, it looks like the various macros should be made available as enviroment variables, such as NAGIOS_HOSTNAME. However, when I do the following in my shell script:ADDRESS=$NAGIOS_HOSTADDRESSNAME=$NAGIOS_HOSTNAMESTATE=$NAGIOS_SERVICESTATETYPE=$NAGIOS_NOTIFICATIONTYPETIME=$NAGIOS_LONGDATETIMEDESC=$NAGIOS_SERVICEDESCINFO=$NAGIOS_SERVICEOUTPUT I get nothing in the notification email that uses the above set variables. Additionally, if I dump the output of the env command into the e-mail, it doesn't show any NAGIOS_ variables. What am I missing here? The goal is for the script to be able to query the appropriate environment variables directly, rather than having to have a huge long (hard to read) command string that passes in everything from the macros as arguments. Thanks. ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293--- --Come build with us! The BlackBerry® Developer Conference in SF, CAis the only developer event you need to attend this year. Jumpstart yourdeveloping skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9-12, 2009. Register now!http://p.sf.net/sfu/devconf___Nagios-users mailing listNagios-users@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/nagios-users::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Come build with us! The BlackBerry® Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9-12, 2009. Register now! http://p.sf.net/sfu/devconf___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Environment variables
Is there anything special (say, in the nagios config) that I need to do in order to use environment variables in my notification scripts? From the documentation, it looks like the various macros should be made available as enviroment variables, such as NAGIOS_HOSTNAME. However, when I do the following in my shell script:ADDRESS=$NAGIOS_HOSTADDRESSNAME=$NAGIOS_HOSTNAMESTATE=$NAGIOS_SERVICESTATETYPE=$NAGIOS_NOTIFICATIONTYPETIME=$NAGIOS_LONGDATETIMEDESC=$NAGIOS_SERVICEDESCINFO=$NAGIOS_SERVICEOUTPUT I get nothing in the notification email that uses the above set variables. Additionally, if I dump the output of the env command into the e-mail, it doesn't show any NAGIOS_ variables. What am I missing here? The goal is for the script to be able to query the appropriate environment variables directly, rather than having to have a huge long (hard to read) command string that passes in everything from the macros as arguments. Thanks. ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Come build with us! The BlackBerry® Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9-12, 2009. Register now! http://p.sf.net/sfu/devconf___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Contact options on a hostgroup?
On Sep 15, 2009, at 8:28 AM, Marc Powell wrote: On Sep 15, 2009, at 11:01 AM, Israel Brewster wrote: If I could apply contact groups/notification options to a hostgroup, however, then all I would need to do is add one hostgroup, and one config line to each host that is high-priority. You cannot apply contactgroups to a hostgroup. You used to be able to do so long ago but that was abandoned in favor of the greater flexibility of applying contacts per host{} and templates. That's pretty much what I thought (at least about not being able to do it), but it's always nice to have confirmation. I guess if I work it right though, it should still only be a single line added, that being the contact group. Thanks. ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- Marc -- Come build with us! The BlackBerry® Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9-12, 2009. Register now! http://p.sf.net/sfu/devconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Come build with us! The BlackBerry® Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9-12, 2009. Register now! http://p.sf.net/sfu/devconf___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Contact options on a hostgroup?
On Sep 14, 2009, at 12:01 PM, Terry wrote: On Mon, Sep 14, 2009 at 1:04 PM, Israel Brewster wrote: Is there any way to apply contact options to a hostgroup? What I'm looking for is a way to make different hostgroups with different contacts and contact options, such as "Emergency", "high-priority", "low- priority" etc. At the moment I am putting these things into the host template, and then for any hosts that need to vary (such as a lower-priority file server as opposed to our high-priority database server), I override the contact_groups and/or notification_options directive. This works, but makes maintenance a little difficult as there is no easy way to look at just my high-priority servers or the like and see what I have where. Is there a better way of handling this? Thanks. ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- I am not sure I understand 100%. If you just want a way to logically see all of your high priority servers, just create another hostgroup called high-priority and join the respective hosts to that hostgroup. You can then see only hosts in that hostgroup in the CGIs and apply commands to them. The point isn't so much to see all the high-priority servers grouped, but rather to have an easy way of applying "high-priority" contacts to the hosts that need it. The grouping that would occur with a hostgroup would just be an added bonus. I could, of course, use host templating for this, but the problem with that is that I already heavily use templating to simplify my configs. Trying to add high-priority/low- priority to the host templates would double the number of host templates I would need, creating an annoyingly large number of host templates. If I could apply contact groups/notification options to a hostgroup, however, then all I would need to do is add one hostgroup, and one config line to each host that is high-priority. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Come build with us! The BlackBerry® Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9-12, 2009. Register now! http://p.sf.net/sfu/devconf___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Contact options on a hostgroup?
Is there any way to apply contact options to a hostgroup? What I'm looking for is a way to make different hostgroups with different contacts and contact options, such as "Emergency", "high-priority", "low-priority" etc. At the moment I am putting these things into the host template, and then for any hosts that need to vary (such as a lower-priority file server as opposed to our high-priority database server), I override the contact_groups and/or notification_options directive. This works, but makes maintenance a little difficult as there is no easy way to look at just my high-priority servers or the like and see what I have where. Is there a better way of handling this? Thanks. ---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293---BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us CATEGORIES:General X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Version 2.12 vs. 3.1.2
For the most part, yes, v. 2 config files will convey transparently to v.3 There may be some of the more obscure parameters in v2 that don't work, but you'd have to consult the documentation to find out what, specifically. In general though, as long as you don't do anything TOO fancy, there shouldn't be any problems. There wasn't for me. General procedure is to upgrade to 3, run a check on your config files, and if any errors are reported, fix them. As I said, for me, there wasn't any problems reported, so you should be fine. ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD On Aug 5, 2009, at 12:04 PM, Neil Dombrowski wrote: I am new to fairly new to nagios, and have to set up a new server with a new config. Currently when we do a yum install nagios we get 2.12, but I'm hearing that we'll have v3 in our repository soon. My question is: if I use v2.12 to set up my config files, will they just plug into v3? If the format/location of the files are the same, then I can just install 2.12 and configure it now. If they are different, I'll probably get v3 elsewhere and start with that. Thanks, Neil -- -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SNMP Monitoring conundrum
On Jul 16, 2009, at 2:08 PM, Max wrote: On Thu, Jul 16, 2009 at 5:50 PM, Israel Brewster wrote: Anyone else run into a similar situation? What does everyone else think the best approach here is? Thanks. We do a lot of SNMP monitoring for a number of various agents - Net-SNMP, Cisco, Sysedge. I have found it to be much easier to maintain and release the scripts by doing one agent type per script. This lets you create a 'bundle' of scripts for an agent type. E.g. net-snmp/check_net_snmp_cpu.pl net-snmp/check_net_snmp_disk.pl net-snmp/check_net_snmp_la.pl net-snmp/check_net_snmp_mem.pl net-snmp/check_net_snmp_procs.pl net-snmp/check_net_snmp_swap.pl sysedge/check_sysedge_cpu.pl sysedge/check_sysedge_la.pl sysedge/check_sysedge_mem.pl sysedge/check_sysedge_swap.pl sysedge/check_sysedge_unix_storage.pl This then helps us also keep our configurations consistent as we do the service definitions and comand definitions in files by type .. ommands/net-snmp.cfg commands/sysedge.cfg services/net-snmp.cfg services/sysedge.cfg As you know, code bases get pretty big pretty quickly with SNMP, so having one agent type per script also decreases the chances of bugs and makes it easier to fix bugs when they arise. So my personal recommendation is to keep the scripts to 1 agent type per script. So if I understand you correctly, after you make 1 agent type per script, you would then write a wrapper script calling multiple individual scripts for the cases where you want more than one piece of data? The approach certainly bears consideration, especially since, as you say, the simpler scripts make debugging easier. I hadn't originally thought of that approach because I wanted two pieces of data displayed in nagios: power state and estimated run time, so I just made one script that gave that data. It might be worth breaking it down more though. Thanks for the suggestion. ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - max BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SNMP Monitoring conundrum
On Jul 16, 2009, at 2:06 PM, Mark Gius wrote: I would suggest you use custom object variables (http://nagios.sourceforge.net/docs/3_0/customobjectvars.html ). This is a nagios 3 feature, so if you're on 2.0 you'll need to upgrade. Basically, you can define custom variables into hosts/services/ whatever. So you'd have a custom object var in the host define host { name UPS-thing _OIDS <-- This being the custom variable that you define. } and in your service you'll have define service { namesnmp-check check_command check_snmp!$_HOSTOIDS$ <-- Use it like so } -Gius Ah, I KNEW I had seen something about that sort of thing somewhere! I think I'll probably combine this with Jon Angliss's suggestion (adding a version argument). Although if I start having more than a couple of sets of OID's to worry about, I may rethink that approach to avoid my scripts becoming unmaintainably large. Thanks. ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ------- Israel Brewster wrote: I have a number of scripts I have developed to monitor our UPS units via SNMP - I found the included plugins didn't give me enough power/flexibility with the monitoring, so I made my own. Unfortunately, we have just added a new UPS to the system which, although made by the same company, uses different OID's for the data I am pulling. Thus the issue: I want the same data, so other than the OID's the existing scripts would work. But how do I deal with different OID's on different devices? So far I've thought of the following: 1) Make a second copy of my scripts with the new OID's. Also implies creating new contacts to use the new script (the notification script pulls data such as estimated run time via snmp) and maintaining two copies of each check/notification script 2) Somehow pass the OID's to the scripts from nagios. Not quite sure how I would manage this, as the OID's would need to be associated with the hosts, and further complicated by the fact that some scripts use 3 or more OID's - for example, the main script looks at UPS state(On battery, on line power, etc) estimated run time (if on battery) and time on battery - so it would need to be passed three separate OIDS. Perhaps this approach could be simplified by simply passing a code identifying which set of OID's the script should use, at the expense of a more complicated script. Still need to find some way of getting that into Nagios so it can pass the appropriate value to the script though. Anyone else run into a similar situation? What does everyone else think the best approach here is? Thanks. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SNMP Monitoring conundrum
--- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD On Jul 16, 2009, at 2:26 PM, Marc Powell wrote: On Jul 16, 2009, at 4:50 PM, Israel Brewster wrote: I have a number of scripts I have developed to monitor our UPS units via SNMP - I found the included plugins didn't give me enough power/ flexibility with the monitoring, so I made my own. Unfortunately, we have just added a new UPS to the system which, although made by the same company, uses different OID's for the data I am pulling. Thus the issue: I want the same data, so other than the OID's the existing scripts would work. But how do I deal with different OID's on different devices? So far I've thought of the following: 1) Make a second copy of my scripts with the new OID's. Also implies creating new contacts to use the new script (the notification script pulls data such as estimated run time via snmp) and maintaining two copies of each check/notification script 2) Somehow pass the OID's to the scripts from nagios. Not quite sure how I would manage this, as the OID's would need to be associated with the hosts, and further complicated by the fact that some scripts use 3 or more OID's - for example, the main script looks at UPS state(On battery, on line power, etc) estimated run time (if on battery) and time on battery - so it would need to be passed three separate OIDS. Perhaps this approach could be simplified by simply passing a code identifying which set of OID's the script should use, at the expense of a more complicated script. Still need to find some way of getting that into Nagios so it can pass the appropriate value to the script though. Beyond what you've received so far, you can pass arguments to the command from the service definitions too so there are a number of ways that you can pass information for your script to act on. The only part you really need to figure out is what 'standard' information you want to pass to the script and what tools the language it's written is provides to parse those arguments -- Good suggestion, and it would certainly work. However, that would muddy up (to some extent) my nagios configs. The way I have things set up at the moment, there is one (ok, two actually, but same idea) service for UPS checks, which is applied to the UPS host group. So to add a new UPS all I need to do is add a new host with a "use generic- UPS" directive, the name, ip, and parent. Keeps everything nice and simple. Overriding that for a specific Host (or hosts) would add to the complexity, something I would like to avoid as much as possible. Thanks for the suggestion though. http://nagios.sourceforge.net/docs/3_0/macros.html (Command Argument Macros) define service { host_name fancyups1 service_description SNMPCHECK check_command my_check_command!APC!.1.3.1.6.1.3!.1.3.1.7.5.4 } define service { host_name fancyups1 service_description SNMPCHECK check_command my_check_command!APC!.1.3.1.6.1.3!.1.3.1.7.5.4 } define service { host_name fancyups2 service_description SNMPCHECK check_command my_check_command!MGE!.1.3.1.6.1.3.10.17!. 1.3.1.6.1.4.3.2.4 } define command { command_name my_check_command command_line $USER1$/my_custom_script --type $ARG1$ --oid $ARG2$ -- oid $ARG3$ ; assumes your script can parse opts or # command_line $USER1$/my_custom_script $ARG1$ $ARG2$ $ARG3$ } The two above then become /path/to/my_custom_script --type APC --oid .1.3.1.6.1.3 --oid . 1.3.1.7.5.4 /path/to/my_custom_script --type MGE --oid .1.3.1.6.1.3.10.17 --oid . 1.3.1.6.1.4.3.2.4 or /path/to/my_custom_script APC .1.3.1.6.1.3 .1.3.1.7.5.4 /path/to/my_custom_script MGE .1.3.1.6.1.3.10.17 .1.3.1.6.1.4.3.2.4 -- MArc -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Plea
Re: [Nagios-users] nagios log file date format
On Jul 15, 2009, at 8:23 AM, Chris Waters wrote: You could do something like this: Replace the paths to reflect your environment. tail -f /usr/local/nagios/var/nagios.log | perl -pe 's/(\d+)/localtime($1)/e' or grep 'something important to look for' somelogfilewithepochtimestamps.log | perl -pe 's/(\d+)/localtime($1)/ e' or perl -pe 's/(\d+)/localtime($1)/e' somelogfilewithepochtimestamps.log Thank you! This is (for me at least) a much better approach than the one specified in the FAQ, in that I can (as in your first example) use it with tail -f for real-time monitoring when testing stuff. Any chance of getting this approach incorporated into the FAQ's? ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Chris Waters From: shadih rahman [mailto:shadhi...@gmail.com] Sent: Wednesday, July 15, 2009 9:10 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] nagios log file date format Is there any way to change nagios log file date format to human readable format. I did not see any option in nagios.cfg file. Please advise on this. Thanks -- Cordially, Shadhin Rahman -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] SNMP Monitoring conundrum
I have a number of scripts I have developed to monitor our UPS units via SNMP - I found the included plugins didn't give me enough power/ flexibility with the monitoring, so I made my own. Unfortunately, we have just added a new UPS to the system which, although made by the same company, uses different OID's for the data I am pulling. Thus the issue: I want the same data, so other than the OID's the existing scripts would work. But how do I deal with different OID's on different devices? So far I've thought of the following: 1) Make a second copy of my scripts with the new OID's. Also implies creating new contacts to use the new script (the notification script pulls data such as estimated run time via snmp) and maintaining two copies of each check/notification script 2) Somehow pass the OID's to the scripts from nagios. Not quite sure how I would manage this, as the OID's would need to be associated with the hosts, and further complicated by the fact that some scripts use 3 or more OID's - for example, the main script looks at UPS state(On battery, on line power, etc) estimated run time (if on battery) and time on battery - so it would need to be passed three separate OIDS. Perhaps this approach could be simplified by simply passing a code identifying which set of OID's the script should use, at the expense of a more complicated script. Still need to find some way of getting that into Nagios so it can pass the appropriate value to the script though. Anyone else run into a similar situation? What does everyone else think the best approach here is? Thanks. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios plugin for monitoring files in a directory?
On Jul 10, 2009, at 8:16 AM, David Rosenstrauch wrote: Just wondering if anyone might have some pointers on a monitoring puzzle I've run into. We have a couple of automated processes - one of them an automated FTP upload, the other an automated log file gatherer - which wind up causing essentially the same result: new files regularly getting added to a disk directory. I need to set up monitoring on this, since if we're not getting new files showing up then something has crashed and our system won't be getting updated properly. Not quite sure how to do this, though - i.e., which nagios plugin would help me get the job done. I've used the check_file_age plugin successfully in a couple of spots. E.g., to verify that a backup process has been run successfully I perform a unix "touch" on a "last-backup-date" file at the end of the backup, and then have the check_file_age plugin check the age of that file. I don't think I can use that plugin & technique here, however, since: a) Each uploaded file has a different (timestamped) name, which I don't know ahead of time. So I have no way to tell the check_file_age plugin which file to monitor. b) Unlike my backup process, which I wrote and was able to alter to touch the last-backup-date file, I don't have control over the external processes creating/uploading these files. I saw there were some rudimentary directory-monitoring plugins over at the Monitoring Exchange site, but they seem like they can only do things like monitor the number of files in a directory vs. a threshold minimum, which is not quite what I want. (I don't really care about how many files there are in the directory.) Anyone have any suggestions on an existing plugin (either part of the standard Nagios release, or available from Monitoring Exchange) that can help here? I'm not afraid to roll my own plugin if needed, but I'd obviously prefer to save myself some work and use an existing one if possible. How about using check_file_age on the directory itself? I think the timestamp of the directory is updated whenever a new file is added. Or are there other processes going in in the directory, such as deletions or modifications, that might make this value unreliable? --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Thanks, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Can nagios replace whatsup
On Jun 17, 2009, at 2:33 AM, Alex Dehaini wrote:Hi Guys,My company uses nagios and whatsup together. Nagios helps us to monitor services and send out notifications and whatsup does live network monitoring. Recently, my supervisor wants me to scrap whatsup for nagios. I am not very comfortable with his decision. What's up is a spot-check tool. Both products views are different. Our nagios server checks over 800 hosts and services while whatsup checks less than 80 devices.Is there a way to configure nagios to be a spot check tool? What do you guys suggest? I will not mind getting an open-source graphical spot-check tool like whatsupI'm not familiar with whatsup myself, but from what you are saying, it sounds to me like the main difference is that nagios is checking many more hosts than whatsup, although they are both doing similar monitoring. Just looking at the less than 80 hosts monitored by whatsup in nagios is problematic because they get lost among all the other hosts that nagios is checking. Is that correct? If so, perhaps you could simply configure a separate contact in nagios, probably with notification options set to none, and only assign this contact to the 80 or so hosts you want to monitor closely. Then when you log in as that user, only those 80 or so hosts are visible. At least, that is my understanding of how contacts work: only the hosts for which you are a contact are displayed. Perhaps someone else could expound on this as a solution? I also seem to recall reading about/briefly looking at some sort of front-end for nagios that provides a much more customizable view, where you could select just the hosts you want. I think it was Nagios Looking glass: < http://exchange.nagios.org/directory/Addons/Frontends-(GUIs-and-CLIs)/Web-Interfaces/Nagios-Looking-Glass/details > Perhaps this will suit your purposes?---Israel BrewsterComputer Support Technician IIFrontier Flying Service Inc.5245 Airport Industrial RdFairbanks, AK 99709(907) 450-7250 x293- Alex DehainiDeveloperSite - www.alexdehaini.comEmail - alexdeha...@gmail.com --Crystal Reports - New Free Runtime and 30 Day TrialCheck out the new simplified licensing option that enables unlimitedroyalty-free distribution of the report engine for externally facing server and web deployment.http://p.sf.net/sfu/businessobjects___Nagios-users mailing listNagios-users@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/nagios-users::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/nullBEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hostgroup subtraction?
On Jun 3, 2009, at 1:25 PM, Marc Powell wrote: On Jun 3, 2009, at 2:43 PM, Chris Pepper wrote: We use several hostgroups. I'd like to ensure each host is in a group for its building. My concern is that we need to avoid adding a host without adding it to a building hostgroup. I tried putting the generic parent object in a hostgroup for our main location, and using 'hostgroups -mainlocation,+actuallocation', but apparently hostgroups only allows '+', not '-' or '!'. Is there a way to remove hosts from inherited hostgroups? In this specific example, using the following in the host{} definition will override the template definition and the host would only be in actuallocation -- hostgroups actuallocation For this example, that works, and perhaps for the original poster it's good enough. I'm running into a very similar situation, however, where I have a list of hostgroups in a base definition, and I just want to remove one hostgroup from that list. I don't want to override the entire list, however, because it might change in the future, and then I'd have to change it in two spots. Plus, I want to keep the other hostgroups, I just don't want the one specific one. So is there a way to remove one (or more) item(s) from an inherited hostgroup list, without overriding the entire thing? Thanks. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- BEGIN:VCARD VERSION:3.0 N:Brewster;Israel;;; FN:Israel Brewster ORG:Frontier Flying Service;MIS TITLE:PC Support Tech II EMAIL;type=INTERNET;type=WORK;type=pref:isr...@frontierflying.com TEL;type=WORK;type=pref:907-450-7293 item1.ADR;type=WORK;type=pref:;;5245 Airport Industrial Wy;Fairbanks;AK;99701; item1.X-ABADR:us X-ABUID:36305438-95EA-4410-91AB-45D16CABCDDC\:ABPerson END:VCARD -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Importing 2.6 config files into a 3.1.0 server
Step 1: copy the config files from the 2.6 server to the 3.1.0 server step 2: Make sure it works, and fix any errors that nagios -v reports It's that easy :) --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On May 27, 2009, at 12:53 PM, Kaplan, Andrew H. wrote: Hi there -- Is it possible to import into a 3.1.0 server the configuration files that are currently on a 2.6 server? If so, what steps and/or tools are necessary in order to accomplish this task? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers & brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, & iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, & Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers & brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, & iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, & Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] differing notifications
On May 20, 2009, at 10:39 AM, Neil Dombrowski wrote: RijilV wrote: 2009/5/20 Neil Dombrowski : I haven't been able to find an answer to this conundrum looking at the docs, faq, or googling. I'd love to hear any suggestions. We have two contact groups; group "adminmail" sends an email to our mailboxes and "adminphone" sends email to our phones. The idea is when a "warning" notification happens, adminmail gets notified, and when a "critical" notification happens adminphone gets notified. As a test I set up two services that check disk. For the first one I set: notification_options w,r contact_group adminmail For the second service I set: notification_options c contact_group adminphone This works, but it's kludgy. Is there a way to define one contact_group for a warning, and a different contact group for critical? We are currently running v1.2(yes, old), but will be upgrading eventually. I'd like to know if there's a solution I can implement now; if we need to upgrade to get this functionality I'd like to know as well. Thanks, Neil http://nagios.sourceforge.net/docs/1_0/xodtemplate.html#contact Just setup two contacts, one for adminmail that gets warnings, the other for adminphone that gets crits. Add them to the same contract_group, then add that contact group to your service check. This style/syntax hasn't changed really, save for in nagios 3.x you don't need the contact group if you don't want it. .r' So, basically you're saying to put both contact_groups (adminmail and adminphone) into a third contact_group, then use that contact_group in the service? How would I then make warning notifications go only to adminmail, and critical notifications go only to adminphone? No, in the individual contacts you can define what sort of notifications that individual contact receives, using the host_notification_options [d,u,r,f,s,n] and service_notification_options [w,u,c,r,f,s,n] directives of the contact. So what you do is make two contacts (not groups, but individual contacts), specifying that adminmail only receives warning (not critical) and adminphone only receives critical (not warning). Add both contacts to a single contact group, add that contact group to the host, and specify that the host send both critical and warning notifications. Due to the filters in the contacts, each contact will only receive the appropriate types. If you have more people you need to contact, just make more contacts for each person, keeping the phone contacts and the mail contacts separate. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Thanks, Neil -- Neil Dombrowski Linux Administration Pricegrabber.com AIM: neilatpg -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers & brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, & iPhoneDevCamp asthey present alongside digital heavyweights like Barbarian Group, R/GA, & Big Spaceship. http://www.creativitycat.com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notify via Growl?
On May 20, 2009, at 9:52 AM, Mathieu Gagné wrote: > Hi, > > On 5/20/09 1:00 PM, Israel Brewster wrote: >> I have nagios running on an openBSD box, and I would like to use >> Growl's network notification options to pop-up alert messages on my >> Mac for certain situations. Does anyone have a script or program that >> can do this already? I've looked around a bit, but all I've found >> seem >> to assume that nagios is running on the Mac, which is not the case >> here. I've got the Growl dev tools, which come with a python script >> it >> looks like I could hack to work, but before I spend too much time on >> that I wanted to see if it has already been done. Thanks. > > 1) Install Net::Growl from CPAN: > http://search.cpan.org/~nmcfarl/Net-Growl-0.99/ > 2) Allow network notification in Growl on your Mac and set a password. > 3) Create a notify script using it based on the provided example. > 4) Enjoy! :) > Which is basically the same thing as hacking the Python script except for a) having to start from scratch, and b) having to work in a language I have little experience with :-) I was sort of hoping someone had already done the above steps and would be willing to share :-D Oh well, it won't take more than a couple of hours to whip the Python into shape, and has the advantage of being self-contained, aside from some standard Python modules (so no installing from CPAN or the like). Thanks for the suggestion though :-) --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > -- > Mathieu -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers & brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, & iPhoneDevCamp asthey present alongside digital heavyweights like Barbarian Group, R/GA, & Big Spaceship. http://www.creativitycat.com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Notify via Growl?
I have nagios running on an openBSD box, and I would like to use Growl's network notification options to pop-up alert messages on my Mac for certain situations. Does anyone have a script or program that can do this already? I've looked around a bit, but all I've found seem to assume that nagios is running on the Mac, which is not the case here. I've got the Growl dev tools, which come with a python script it looks like I could hack to work, but before I spend too much time on that I wanted to see if it has already been done. Thanks. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Generating availability report in Nagios 3.0.6
On May 15, 2009, at 4:46 AM, Paul Corcoran wrote: > Hi, > > I've recently completed a distributed installation of Nagios > consisting of a master server acting as the web front end for Nagios > and 2 slave servers that do all the service checking. > > Currently we are monitoring approx 500 hosts and perform approx 3300 > service checks. > > Nagios performs daily rotation of the logs and they are approx 250MB > in size for each 24 hours period. > > I am having some difficulty running availability reports at the > moment. I am able to generate reports for the current day but if I > try and run a report against data that has been archived I get an > internal server error from Apache. > > I don't think it is a permissions issue nor do I think it's a > security issue. Memory usage on the web box is quit high and > avail.cgi seems to use a lot of it so it could be something as > simple as not enough memory in the machine (The server currently has > 3GB) Well, I don't know that this will be of much help, but I would strongly suspect that memory is not the issue- I have a server with about 300 hosts, although there are significantly fewer services than you have, but it is running quite happily and generating reports for whatever period I want, with only 512MB ram. Actually, about 300MB of that is free at the moment, and when I run an availability report, the cgi only uses around 3MB. So whatever the issue is, I would doubt the problem is a lack of memory. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > The server is running Fedora Core 10 as it's OS. > > If any one can shed some light on this issues it would be most > appreciated. > > Kind Regards, > > Paul Corcoran > -- > Crystal Reports - New Free Runtime and 30 Day Trial > Check out the new simplified licensing option that enables > unlimited royalty-free distribution of the report engine > for externally facing server and web deployment. > http://p.sf.net/sfu/businessobjects___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] [OT] Network help?
On May 8, 2009, at 1:40 PM, Jim Avery wrote: > 2009/5/8 Israel Brewster : >> I apologize for the off-topic post, but I figure the people here tend >> to be involved with networking, so perhaps someone can direct me to >> the right place to post a question about some networking problems I >> am >> having. Any suggestions? > > If it's social-networking problems, I recommend maybe facebook.com! > > Seriously though, it depends. If it's to do with Linux you could seek > out your local LUG, but why not just say what the problem is here, you > never know ... someone might find a solution to your problem AND > explain how Nagios can help you to check how effective their answer to > it was! Nope, not social, although if I don't get it fixed soon (or find another solution) it might become a social problem :-) I didn't post it here because I thought some might get annoyed with me cluttering the list with non-nagios problems, but here goes. Thanks to a recent company acquisition made by my company (Frontier Flying), we are now dealing with two separate networks (they will eventually be combined, but that's a ways off still). The company we bought (Era aviation) uses a piece of software called Sabre for their flight reservations, which contacts a central server to which their network has a direct connection. The Sabre server itself is on a third network, but there are a couple of routers that link the two (Sabre and Era) across a private network, so asside from a couple of routing and NAT statements they are effectively on the same network. In order to be able to run Sabre on our network, we established a VPN tunnel between our network and Era's. As far as I can tell, the VPN is functioning. I can ping computers on the Era network, including the Sabre server, and can access and control various computers on the Era network. For initial testing purposes we installed the sabre software (windows only) inside a parallels installation on one of our Mac laptops. This worked perfectly. So far so good. Until we tried installing the software on one of our Windows desktops. Then the problems started. The Sabre software would make the initial connection and log in the user, but when it tried to download the initial info it needs, it just timed out. Every time. Meanwhile, the laptop, which was connected through the same dumb switch, continued to work perfectly. Every time. We took the Windows machine off our network and put it on Era's (different location, obviously) and it started working. Put it back on ours, timeout. Since then we have tried installing Sabre on a number of different computers on our network. A second Mac laptop running parallels works fine, while a mac desktop with an identical install of parallels doesn't. We have managed get one Windows machine running the software on our network, while a stack of four windows machines that Era sent us which had been working on their network just fine don't work. On one hand it seems impossible that it could be an issue with the network, because you would expect that to affect all computers equally, not the hit-and miss (but mostly miss) scenario that we are facing. On the other hand, it can't be a problem with the individual computer, because the same computer with the exact same configuration (including network settings - DHCP) works fine on the Era network, but stops working when moved to ours. We thought maybe it was a DNS problem (since obviously DHCP on our network would give different DNS servers than DHCP on Era's network), so we tried putting the Era DNS servers in statically, but while it worked as far as DNS went (we still got name resolution) that didn't help with Sabre. We are going bald here tearing our hair out trying to figure out what could be causing this issue. Sabre technical support is no help - they just say it's our problem. Management is coming down on us pretty hard to get something working here, so any help anyone can provide would be GREATLY appreciated :-). Let me know if I left out any relevant details or testing we performed! --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com
[Nagios-users] [OT] Network help?
I apologize for the off-topic post, but I figure the people here tend to be involved with networking, so perhaps someone can direct me to the right place to post a question about some networking problems I am having. Any suggestions? --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Children "unreachable" on soft down?
On Apr 8, 2009, at 9:28 AM, Marc Powell wrote: > > On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote: > >> So is this just something I'll have to live with? I don't seem to be >> getting much feedback on the subject. :( > > Well, my response would be to fix the problem that's causing the > outages in the first place or adjust the way you're monitoring the > parents so that the plugin used recognizes when this temporary event > is occurring. Ok, fair enough. There is nothing we can do about the outages (as I explained in one of my e-mail, they are an artifact of the connection type), so that leaves us with adjusting the monitoring. Now I thought that the recheck options were there exactly for this reason: to catch brief outages and not alert. And for the parent host that seems to be the case, but apparently that logic doesn't carry on to the child hosts. As such, somehow things would need to be adjusted so it never even sees the outages, even enough to go into a soft down state. Anyone have any suggestions for how I can accomplish this? Adjusting the timeout or using, say, an ssh check rather than icmp won't do it - the packets are still lost, and the ssh check would still timeout.. Perhaps if I sent more pings at longer intervals (so that if it doesn't get a response the single check retries at 15 second intervals or so before returning a response), but then the check would start taking several seconds or more to complete, and that wouldn't be a good thing. Assuming nagios even allowed a check to run that long - doesn't it have a mechanism to kill a check that doesn't return in a given time frame? I'm a little stumped here how I can adjust things. > What you're asking for is that nagios track that the > child went from down->unreachable->down without an intermediate OK > state and suppress notifications in that case. That would appear to be > a code change and would be better discussed on nagios-devel but I > would encourage the check plugin approach first. Ok. I know there is code in there that know who it sent down messages to and doesn't send up messages to people that didn't get a down (primarily dealing with escalations) so I was hoping that maybe there would be something similar for this, i.e. seeing that the last notification sent was a down notification, and as such there is no need to send another. But if not, so be it. Thanks for the response! --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > -- > Marc > > > -- > This SF.net email is sponsored by: > High Quality Requirements in a Collaborative Environment. > Download a free trial of Rational Requirements Composer Now! > http://p.sf.net/sfu/www-ibm-com > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Children "unreachable" on soft down?
So is this just something I'll have to live with? I don't seem to be getting much feedback on the subject. :( ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Apr 6, 2009, at 10:55 AM, Israel Brewster wrote: > On Apr 6, 2009, at 9:03 AM, Giorgio Zarrelli wrote: > >> Hi, >> >> I've not quite clear what happens to you, > > Thanks for the response. For clarification, the exact sequence of > events is as follows: > > 1) The link between the nagios box and one of our routers, which we > will refer to as the parent host glitches for 30 seconds or so. Due to > the nature of the link (satellite connection) this is semi-expected, > and happens a couple of times a day. > > 2) Nagios catches this glitch in one of it's regularly scheduled host > checks, and puts the parent host into a soft down state. Again, normal > and expected - even good. > > 3) At the same time, Nagios puts the children of the parent host into > an "unreachable" state. Makes sense, at least, but leads to the issue > > 4) The parent host is now in recheck mode (as it is only in a soft > down state and has three rechecks set), so it checks again a minute > later. This check succeeds, as the outage was transitory. The parent > host is put back into an "UP" state. As it never was in a hard "down" > state, no notification is sent. This is good. > > 5) since the parent is now up, the child host now is changed to a > (soft I think) "down" state. > > 6) check continue on a normal schedule. As the link does not glitch > again for several hours, parent remains up and child remains > (correctly) down. Three checks later, child enters a hard "down" state > (since it was unreachable and only just switched back to down). Down > notification is sent for child. > > 7) Everything remains good for the next several hours until the link > glitches again. Repeat from step one. > > The notification in step 6 is the problem here - the child host was > down before the glitch, the child host is still down after. But > because the child host was temporarily put in an unreachable state, we > get notified again that it is down, resulting in a string of "DOWN" > messages with no up or real change in status. > >> but one thing I have in mind is try >> >> soft_state_dependencies=0 >> >> Besides that, the problems seems to be in the roots of the check. >> It's not >> healty to have a ping check failing every 2 strikes. Try to change >> the host >> alive check, using a ssh check instead. > > The check is not failing every 2 strikes. It's failing once, briefly, > every few hours - just barely long enough to make one check fail and > throw the parent host into a soft down state. The first recheck (one > minute later) works fine, bringing the parent back to an up state. The > next several hundred or more checks also work fine (as the problem was > transitory and brief). For this reason, changing the check wouldn't > help - for the duration of that single check, the host really is down > (or more precisely, unreachable, as it is a link issue), and any check > I used would say so. > >> Another approach, not so useful, would be to increase the timeout >> for the ping >> (-W) so it will have less chances to fail. > > except that it's not a timeout issue. It is a very real, albeit brief > (around 30 seconds or so), outage. Not long enough or frequent enough > to really impact productivity or anything, but long enough for nagios > to catch it (for a single check). > > --- > Israel Brewster > Computer Support Technician II > Frontier Flying Service Inc. > 5245 Airport Industrial Rd > Fairbanks, AK 99709 > (907) 450-7250 x293 > --- >> >> Giorgio >> >> Israel Brewster (isr...@frontierflying.com) scritto: >>> >>> So does anyone have any ideas as to how I can resolve this >>> situation? >>> It continues to be an annoyance. Thanks. >>> >>> --- >>> Israel Brewster >>> Computer Support Technician II >>> Frontier Flying Service Inc. >>> 5245 Airport Industrial Rd >>> Fairbanks, AK 99709 >>> (907) 450-7250 x293 >>> --- >>> >>> >>> >>> On Mar 31, 2009, at 8:17 AM, Israel
Re: [Nagios-users] Children "unreachable" on soft down?
On Apr 6, 2009, at 9:03 AM, Giorgio Zarrelli wrote: > Hi, > > I've not quite clear what happens to you, Thanks for the response. For clarification, the exact sequence of events is as follows: 1) The link between the nagios box and one of our routers, which we will refer to as the parent host glitches for 30 seconds or so. Due to the nature of the link (satellite connection) this is semi-expected, and happens a couple of times a day. 2) Nagios catches this glitch in one of it's regularly scheduled host checks, and puts the parent host into a soft down state. Again, normal and expected - even good. 3) At the same time, Nagios puts the children of the parent host into an "unreachable" state. Makes sense, at least, but leads to the issue 4) The parent host is now in recheck mode (as it is only in a soft down state and has three rechecks set), so it checks again a minute later. This check succeeds, as the outage was transitory. The parent host is put back into an "UP" state. As it never was in a hard "down" state, no notification is sent. This is good. 5) since the parent is now up, the child host now is changed to a (soft I think) "down" state. 6) check continue on a normal schedule. As the link does not glitch again for several hours, parent remains up and child remains (correctly) down. Three checks later, child enters a hard "down" state (since it was unreachable and only just switched back to down). Down notification is sent for child. 7) Everything remains good for the next several hours until the link glitches again. Repeat from step one. The notification in step 6 is the problem here - the child host was down before the glitch, the child host is still down after. But because the child host was temporarily put in an unreachable state, we get notified again that it is down, resulting in a string of "DOWN" messages with no up or real change in status. > but one thing I have in mind is try > > soft_state_dependencies=0 > > Besides that, the problems seems to be in the roots of the check. > It's not > healty to have a ping check failing every 2 strikes. Try to change > the host > alive check, using a ssh check instead. The check is not failing every 2 strikes. It's failing once, briefly, every few hours - just barely long enough to make one check fail and throw the parent host into a soft down state. The first recheck (one minute later) works fine, bringing the parent back to an up state. The next several hundred or more checks also work fine (as the problem was transitory and brief). For this reason, changing the check wouldn't help - for the duration of that single check, the host really is down (or more precisely, unreachable, as it is a link issue), and any check I used would say so. > Another approach, not so useful, would be to increase the timeout > for the ping > (-W) so it will have less chances to fail. except that it's not a timeout issue. It is a very real, albeit brief (around 30 seconds or so), outage. Not long enough or frequent enough to really impact productivity or anything, but long enough for nagios to catch it (for a single check). --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > Giorgio > > Israel Brewster (isr...@frontierflying.com) scritto: >> >> So does anyone have any ideas as to how I can resolve this situation? >> It continues to be an annoyance. Thanks. >> >> --- >> Israel Brewster >> Computer Support Technician II >> Frontier Flying Service Inc. >> 5245 Airport Industrial Rd >> Fairbanks, AK 99709 >> (907) 450-7250 x293 >> --- >> >> >> >> On Mar 31, 2009, at 8:17 AM, Israel Brewster wrote: >> >>> On Mar 31, 2009, at 1:09 AM, Andreas Ericsson wrote: >>> >>>> Israel Brewster wrote: >>>>> Does nagios (3.0.3) mark a child host as unreachable when its >>>>> parent enters a soft down state? I am finding myself getting >>>>> repeated down messages for a host (which is, in fact, down), even >>>>> though I have notifications set to only send a single message. >>>>> Looking at the logs, it would appear that what is happening is >>>>> that the host is flipping between "down" (which notifies me) and >>>>> "unreachable" (which does not). The parent host, however, never >>>>> enters a hard down state. Looking at
Re: [Nagios-users] Children "unreachable" on soft down?
So does anyone have any ideas as to how I can resolve this situation? It continues to be an annoyance. Thanks. --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Mar 31, 2009, at 8:17 AM, Israel Brewster wrote: > On Mar 31, 2009, at 1:09 AM, Andreas Ericsson wrote: > >> Israel Brewster wrote: >>> Does nagios (3.0.3) mark a child host as unreachable when its >>> parent enters a soft down state? I am finding myself getting >>> repeated down messages for a host (which is, in fact, down), even >>> though I have notifications set to only send a single message. >>> Looking at the logs, it would appear that what is happening is >>> that the host is flipping between "down" (which notifies me) and >>> "unreachable" (which does not). The parent host, however, never >>> enters a hard down state. Looking at the logs, what I see is that >>> one ICMP check fails, throwing the host into a soft down state, >>> but the next one works just fine, bringing it back to an up state. >>> The logic works fine for the parent host- since it never hits a >>> hard down state, it doesn't alert, and everyone is happy. But >>> apparently with the child host every time this happens, it >>> switches from critical to unreachable and back again, triggering a >>> notification. Is there any way to keep this from happening? Thanks. >> >> Doesn't flapping detection do what you want? You'd get a few >> notifications, but they'd stop after the 3rd flip or something, I >> think. > > Flapping detection helps, but doesn't solve. For one thing, as you > mentioned, you still get at least a couple of notifications before it > kicks in. For another thing, this happens with a frequency of > something like once an hour or so (not consistently), so the host will > flip from down to unreachable and back again, triggering an e-mail, > perhaps do it a second time, and then it will sit in the correct > "down" state for the next 50 checks or so (thus canceling any flapping > detection) before repeating the process. It's not like I'm getting > messages every five minutes or anything, it's just that I'm getting > repeated down messages every hour or two for hosts that have been down > and haven't actually changed state. > > I could, of course, schedule down time, except that I want to be > notified if/when the people in the remote station get their act > together and get the machine(s) in question back online. Also that is > only partially effective for machines that have been sent in for > repair, because I don't really know when the scheduled down time will > be over. They are down, I know they are down, I just don't want to be > told about it every few hours :-) > > --- > Israel Brewster > Computer Support Technician II > Frontier Flying Service Inc. > 5245 Airport Industrial Rd > Fairbanks, AK 99709 > (907) 450-7250 x293 > --- > >> >> >> -- >> Andreas Ericsson andreas.erics...@op5.se >> OP5 AB www.op5.se >> Tel: +46 8-230225 Fax: +46 8-230231 >> >> Considering the successes of the wars on alcohol, poverty, drugs and >> terror, I think we should give some serious thought to declaring war >> on peace. > > > -- > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Children "unreachable" on soft down?
On Mar 31, 2009, at 1:09 AM, Andreas Ericsson wrote: > Israel Brewster wrote: >> Does nagios (3.0.3) mark a child host as unreachable when its >> parent enters a soft down state? I am finding myself getting >> repeated down messages for a host (which is, in fact, down), even >> though I have notifications set to only send a single message. >> Looking at the logs, it would appear that what is happening is >> that the host is flipping between "down" (which notifies me) and >> "unreachable" (which does not). The parent host, however, never >> enters a hard down state. Looking at the logs, what I see is that >> one ICMP check fails, throwing the host into a soft down state, >> but the next one works just fine, bringing it back to an up state. >> The logic works fine for the parent host- since it never hits a >> hard down state, it doesn't alert, and everyone is happy. But >> apparently with the child host every time this happens, it >> switches from critical to unreachable and back again, triggering a >> notification. Is there any way to keep this from happening? Thanks. > > Doesn't flapping detection do what you want? You'd get a few > notifications, but they'd stop after the 3rd flip or something, I > think. Flapping detection helps, but doesn't solve. For one thing, as you mentioned, you still get at least a couple of notifications before it kicks in. For another thing, this happens with a frequency of something like once an hour or so (not consistently), so the host will flip from down to unreachable and back again, triggering an e-mail, perhaps do it a second time, and then it will sit in the correct "down" state for the next 50 checks or so (thus canceling any flapping detection) before repeating the process. It's not like I'm getting messages every five minutes or anything, it's just that I'm getting repeated down messages every hour or two for hosts that have been down and haven't actually changed state. I could, of course, schedule down time, except that I want to be notified if/when the people in the remote station get their act together and get the machine(s) in question back online. Also that is only partially effective for machines that have been sent in for repair, because I don't really know when the scheduled down time will be over. They are down, I know they are down, I just don't want to be told about it every few hours :-) --- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > -- > Andreas Ericsson andreas.erics...@op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > Considering the successes of the wars on alcohol, poverty, drugs and > terror, I think we should give some serious thought to declaring war > on peace. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Children "unreachable" on soft down?
Does nagios (3.0.3) mark a child host as unreachable when its parent enters a soft down state? I am finding myself getting repeated down messages for a host (which is, in fact, down), even though I have notifications set to only send a single message. Looking at the logs, it would appear that what is happening is that the host is flipping between "down" (which notifies me) and "unreachable" (which does not). The parent host, however, never enters a hard down state. Looking at the logs, what I see is that one ICMP check fails, throwing the host into a soft down state, but the next one works just fine, bringing it back to an up state. The logic works fine for the parent host- since it never hits a hard down state, it doesn't alert, and everyone is happy. But apparently with the child host every time this happens, it switches from critical to unreachable and back again, triggering a notification. Is there any way to keep this from happening? Thanks. ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Host checks instead of service checks
Using check_dummy with a service check titled "No Services" is exactly how we are doing this here. We have a number of hosts which we are really only interested in if the machine is running or not - so check_icmp is specified as the host check, and "No services" as the only service check. You are right, however, in that all these "No Services" show up in the service status list. If you don't want that, about the only way I can think of is to not associate any services with the host, and simply ignore the warnings you get about that when running nagios -v. This does cause some display issues, however (hosts without services tend to not show up in various screens) - thus the reason I opted to add the "No services" service. ------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Mar 16, 2009, at 9:34 AM, Deborah Martin wrote: Folks, Whilst using check_dummy as a service check does resolve the pre- flight warnings, it's not the solution I think I'm looking for. The total number of service checks now includes the dummy checks which i've chosen to always exit as UP so that it doesn't show up as a service problem - which to me doesn't sound like the right way to do this. I want all service checks to be useful service checks rather than have checks which would need to be ignored as they don't return anything useful as with check_dummy - I wouldn't ever use that check in any Nagios SLA reporting. What I really want to do is the following : Hosts -->> ssh check -->> When "CRITICAL" alert under "Host problems" only. The docs show a distinct difference for behaviour between Nagios 2.0b4 and Nagios 3.0.6. From Nagios 2.0b4, under Host Definitions:- check_command: This directive is used to specify the short name of the command that should be used to check if the host is up or down. Typically, this command would try and ping the host to see if it is "alive". The command must return a status of OK (0) or Nagios will assume the host is down. If you leave this argument blank, the host will not be checked - Nagios will always assume the host is up. This is useful if you are monitoring printers or other devices that are frequently turned off. The maximum amount of time that the notification command can run is controlled by the host_check_timeout option. From Nagios 3.0.6 check_command: This directive is used to specify the short name of the command that should be used to check if the host is up or down. Typically, this command would try and ping the host to see if it is "alive". The command must return a status of OK (0) or Nagios will assume the host is down. If you leave this argument blank, the host will not be actively checked. Thus, Nagios will likely always assume the host is up (it may show up as being in a "PENDING" state in the web interface). This is useful if you are monitoring printers or other devices that are frequently turned off. The maximum amount of time that the notification command can run is controlled by the host_check_timeout option. The difference is highlighted in RED. So on changing the "ssh check" from a service to a host definition (to prevent PENDING on the hosts), the pre-flight warnings now complain there are no services associated with the hosts. On big systems here (100 nodes+) this makes the pre-flight output really unreadable and not really accurate as surely we should be able to choose whether something is a host check or service check but not have to define BOTH! Ultimately, I just want to filter SSH check problems to appear under "Host problems" rather than under "Service problems" so users can easily spot other service problem issues and not have to trawl through 100's of lines of output in the web interface. Believe me, Nagios web interface users can be a fickle bunch! A switch to tell Nagios to not complain about this would be really useful unless anyone thinks of a good reason why this would be a bad idea. any help / pointers would be appreciated, even if it's to tell me politely how stupid i'm being - I can take it! regards, deborah From: Deborah Martin [mailto:deborah.mar...@kognitio.com] Sent: 16 March 2009 15:34 To: 'Richard Quintin' Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Host checks instead of service checks Thanks Richard. I'll give that a try. regards, deborah -Original Message- From: Richard Quintin [mailto:rich+nag...@quintinz.com] Sent: 16 March 2009 15:26 To: Deborah Martin Cc: nagios-users@lists.sourceforge.net Subject: Re:
Re: [Nagios-users] snmp monitoring of RICOH / DEVELOP
Well, I threw together a (very) quick shell script for this purpose that I am using here. Basically just checks the status SNMP entry of our ricohs, and returns a status code depending on that as well as whatever status text SNMP gives. It is NOT polished, and will probably break if you look at it wrong, but it works for us. If anyone wants it, let me know. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Mar 5, 2009, at 6:28 AM, Kevin Keane wrote: > I'm actually writing a plugin for monitoring printers. Currently, I > have > only tested it with HP printers, but at least in theory it should work > with Ricoh and others as well, since it uses standard SNMP. If you > want > to try it out, download http://sourceforge.net/projects/tntnagiosplugin/ > I would actually appreciate feedback about what printers it does or > does > not work with. > > One catch: the plugin only runs on Windows, so it is not directly > useful > for the majority of nagios users (I wrote it primarily for my own > needs). Also, the only thing it currently monitors is toner level. > > Doug Veldhuisen wrote: >> >> I tried once to get RICOH Savin Printers checked via Nagios. Searched >> high and low for something and never did find anything that was >> already developed. >> >> Figured I was going to have to write my own stuff. >> >> Now I’m trying the same thing with Toshiba printers, nothing there >> either. >> >> If someone has a way to utilize “Check_hpjd” for non HP printers, >> lets >> pool the knowledge on this subject. >> >> Doug >> >> *From:* Trnka Tomas [mailto:tomas.tr...@podravka.cz] >> *Sent:* Thursday, March 05, 2009 7:38 AM >> *To:* nagios-users@lists.sourceforge.net >> *Subject:* [Nagios-users] snmp monitoring of RICOH / DEVELOP >> >> Anyone have experience with snmp monitoring of RICOH / DEVELOP >> printers ? i like to have results something like from Check_hpjd …. >> Thanks in advance for your notes >> > > -- > Kevin Keane > Owner > The NetTech > Find the Uncommon: Expert Solutions for a Network You Never Have to > Think About > > Office: 866-642-7116 > http://www.4nettech.com > > This e-mail and attachments, if any, may contain confidential and/or > proprietary information. Please be advised that the unauthorized use > or disclosure of the information is strictly prohibited. The > information herein is intended only for use by the intended > recipient(s) named above. If you have received this transmission in > error, please notify the sender immediately and permanently delete > the e-mail and any copies, printouts or attachments thereof. > > > -- > Open Source Business Conference (OSBC), March 24-25, 2009, San > Francisco, CA > -OSBC tackles the biggest issue in open source: Open Sourcing the > Enterprise > -Strategies to boost innovation and cut costs with open source > participation > -Receive a $600 discount off the registration fee with the source > code: SFAD > http://p.sf.net/sfu/XcvMzF8H > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Monitor SOAP?
Is there a nagios plugin out there that can monitor a soap server, i.e. by sending a soap call to the server and looking for a response? Or do I need to write one? I did a search on nagios exchange for "soap" and "Web services", but didn't find anything that looked promising. Thanks. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] 1 Host 2 IP
You can check multiple IPs in one check (and thus, one host) using check_icmp. This is a basic ping-type check, but it has a flag to say you only need one of the ips to return in order for the host to be considered up. Or just symlink it to check_host, and it returns OK as soon as it receives one reply, regardless of which IP the reply comes from. As far as monitoring different services on each host, what you'll have to do is make custom check commands that accept the ip as one of the arguments ($ARG1$ for instance) rather than just assuming the host address, which will still have to be set to a single IP even if using check_icmp. What I've done on a number of hosts of ours that have multiple IPs is to set the host check as check_icmp, looking at both IPs and only going critical if both are down. I then make a service that is also check_icmp, looking at the specific ports. That way I get a host critical alert if both are down, but otherwise the host stays OK, and just the service corresponding to the specific ip that isn't working goes critical. Works well for us. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Feb 18, 2009, at 7:09 AM, Romain VRIGNAUD wrote: > > I have to migrate a 1.x nagios to the last stable. > Until now, each host is defined 2 times for each network interface. > However I think it makes the overview more complicated to analyse. > So I'm seeking a way to make Nagios understand that they are the > same physical host. Is there a something to do in configuring host > group ? > > I don't understand your second question. I use different checks > (ssh / nrpe / nsca / nsclient ) but I don't see the point. > > Romain > -Message d'origine- > De : Assaf Flatto [mailto:assaf.fla...@ssp-intl.com] > Envoyé : mercredi 18 février 2009 16:57 > À : nagios-users@lists.sourceforge.net > Objet : Re: [Nagios-users] 1 Host 2 IP > > you can create different host for each ip and create test (services) > for those host with no > overlapping on the definitions . > > The issue is to know if you are using external checks only or also > using NRPE of another client to > execute checks on the machine ? > > Assaf > > > On Wednesday 18 February 2009 15:46:31 Romain VRIGNAUD wrote: >> Hi >> >> I'd like to know how should I configure Nagios for my hosts which >> have >> multiple ip address. >> >> I'd like that Nagios consider them as only one host but I don't >> know how >> to configure that. >> >> On top of that, I need to check different services on the different >> IPs. >> >> >> >> How should I do ? >> >> >> >> Regards > > > > -- > Assaf Flatto > SSP Ops Team > Linux System Administrator > 169 Euston Road, London, NW1 2AE > > > > > > IMPORTANT . this email and the information in it may be > confidential, legally > privileged and/or protected by law. It is intended solely for the > use of the > person to whom it is addressed. If you are not the intended > recipient, please > notify the sender immediately and do not disclose the contents to > any other > person, use it for any purpose, or store or copy the information in > any medium. > Please also delete all copies of this email and any attachments from > your > system. > > We cannot guarantee the security or confidentiality of email > communications. We > do not accept any liability for losses or damages that you may > suffer as a > result of your receipt of this email including but not limited to > computer > service or system failure, access delays or interruption, data non- > delivery or > mis-delivery, computer viruses or other harmful components. > > Copyright in this email and any attachments belong to Select Service > Partner UK > Limited. Should you communicate with anyone at Select Service > Partner UK Limited by > email, you consent to us monitoring and reading any such > correspondence. > > Nothing in this email shall be taken or read as suggesting, > proposing or > relating to any agreement concerted practice or other practice that > could > infringe UK or EC competition legislation. > > Select Service Partner UK Limited is a company registered in England > and Wales > (company number 05687183) whose registered office is at 1 The > Heights, Brooklands, Weybridge. Surrey. KT13 0NY > > > > ---
Re: [Nagios-users] Different notifications depending on warning?
On Feb 16, 2009, at 9:07 AM, RijilV wrote: 2009/2/16 Israel Brewster I don't know if this is possible, but is there any way to make nagios send different alerts (or, rather not send alerts) depending on what exactly a warning state is? My understanding is that nagios only looks at the return code, not the status text, so this shouldn't be possible, but I figured it couldn't hurt to ask. In the past I have written my own notification program to do similar things, ie instead of having nagios send email directly, pass it off to some other program that looks at whatever conditions you want and makes some decisions based on that. In this case you could just write a "printer notification" program that doesn't send email when the service output says low on paper. Sounds promising. I'll give it a shot. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- .r' -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Different notifications depending on warning?
I don't know if this is possible, but is there any way to make nagios send different alerts (or, rather not send alerts) depending on what exactly a warning state is? My understanding is that nagios only looks at the return code, not the status text, so this shouldn't be possible, but I figured it couldn't hurt to ask. The problem I am trying to solve is the following: I am monitoring a number of printers via SNMP, and there are a number of conditions that will trigger a "WARNING" status on the printer: Out of paper, open front cover, low toner, etc. Some of these, such as out of paper, can be and typically are quickly and easily solved by the end user, and as such there is no point in e-mailing us about it, at least not until it has been empty for a while. However, it should still show up as a warning in nagios, because there have been situations where someone is being unable to print to a remote printer, and it turns out it is because no one in the remote location noticed that it was out. Other "WARNING" situations we need to be notified about immediately, such as (primarily) low toner, in which case we need to make sure we have a replacement available, and perhaps even sent out, if the location is remote. I suppose I could write a plugin that looks at the status text and "upgrades" certain warnings, such as low toner, to critical status, but this would be problematic as well, because we would then loose the distinction between "low toner", which just means we need to make sure we have more, and "toner empty", which means we need to go replace it. So is there a good solution to this dilemma that I am missing? Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification on OK-state
On Feb 12, 2009, at 7:05 AM, sri.lu...@free.fr wrote: > There is still a small problem though... With that method, every > service with obsess_over_service enabled will execute the command, > and every command defined as ocsp_command in nagios.cfg will be > executed by services enabling obsess_over_service :s. > > Well... don't know if ther is a better way and it works so I keep it > for now... Well, since you are writing the OCSP command script, you should be able to have the script itself "Filter" for hosts you want, i.e if $HOSTNAME$=hostA or hostB, etc then run script, otherwise, do nothing. Course, that isn't exactly how you do it, as I don't know off the top of my head how you pass the hostname to the script (although I'm sure you can), and that is not valid code anyway, but hopefully you get the idea. On the other hand, that might be more trouble than it is worth :) ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > - Mail Original - > De: "Marc Powell" > À: "Nagios Users" > Envoyé: Jeudi 12 Février 2009 16:51:14 GMT +01:00 Amsterdam / > Berlin / Berne / Rome / Stockholm / Vienne > Objet: Re: [Nagios-users] Notification on OK-state > > > On Feb 12, 2009, at 8:45 AM, sri.lu...@free.fr wrote: > >> Thank you very much for the lead. It helped me a lot and. >> >> I followed your advice and I used the ocsp_command directive from >> the nagios.cfg with a script printing into the file nagios.cmd the >> external command SEND_CUSTOM_SVC_NOTIFICATION and it works fine now. > > ... and that is an interesting trick as well ;) I wouldn't have > thought to send the notification that way. > > -- > Marc > > > -- > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -- > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios and check_snmp
On Jan 8, 2009, at 4:40 AM, Marc Powell wrote: > > On Jan 7, 2009, at 9:28 PM, Chris St. Pierre wrote: > >> You'd have to track the rate of change in ifInOctets >> if you wanted to get bits per second, and that'd be a much more >> complicated plugin. > > For the OP, plugins that do this are available on nagiosexchange.org. Or use MRTG. While this would be another piece of software to set up, this is exactly what it is designed for. You can either set it up to alert itself, or (as I am doing) use one of the nagios plugins that checks the mrtg log files. The benifit of doing things this way is that you get the nice pretty MRTG graphs (not that you couldn't jigger nagios to produce them as well, but I don't know how hard that would be), while keeping a single point of monitoring/alerting. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > -- > Marc > > > -- > Check out the new SourceForge.net Marketplace. > It is the best place to buy or sell services for > just about anything Open Source. > http://p.sf.net/sfu/Xq1LFB > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Check out the new SourceForge.net Marketplace. It is the best place to buy or sell services for just about anything Open Source. http://p.sf.net/sfu/Xq1LFB ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] best way to analyse traffic on a network-device
Why not just use MRTG, if you need graphs like it makes? Or am I mis- understanding the question? If you want to monitor the values in nagios, there are plugins in the standard distribution that check MRTG log files for the current traffic values. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Dec 16, 2008, at 4:12 AM, Ralf Prengel wrote: > Hallo, > > what is the best way to analyse traffic on a network-device. > I need graphs with average-values in Kb like mrtg is building. > Using snmp and/ or new plugins is possible. > > -- > i.A. Ralf Prengel > Customer Care Manager > Comline AG > Hauert 8 > D-44227 Dortmund/Germany > > Fon +49231 97575- 904 > Fax +49231 97575- 905 > EMail ralf.pren...@comline.de > > > -- > www.comline.de > Vorstand Stephan Schilling, Erwin Leonhardi > Aufsichtsrat Dr. Franz Schoser (Vorsitzender) HR Dortmund B 14570 > USt.-ID-Nr. DE 124727422 > > -- > SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, > Nevada. > The future of the web can't happen without you. Join us at MIX09 to > help > pave the way to the Next Web now. Learn more and register at > http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada. The future of the web can't happen without you. Join us at MIX09 to help pave the way to the Next Web now. Learn more and register at http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check for crash?
On Nov 21, 2008, at 10:07 AM, Fernando Rocha wrote: > Hi Israel, > > I think that by the check_process is hard to identify this issue, > because as you said the process still running and just stop to work. > > How do you know when it crashes? > Is there some log occurrences? > Does it gives some errors on the web page? The applet in question is a status display screen. I know it has crashed because a) the data displayed is out of date, and b) it doesn't respond to user input. Since it is a passive monitoring system, there is no log written. No error is displayed, it just stops updating properly. I guess what I was hoping for was that there might be some indication I could get from the process list or the like that I simply didn't know to look for, rather than an application-specific check. Something along the lines of how you can check for a "zombie" process by looking at the output of a ps. From the responses I have gotten so far, however, this wouldn't appear to be the case. Thanks anyway! --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > -- > Fernando Rocha <[EMAIL PROTECTED]> > Analista de Suporte - Operação > OpServices - Porto Alegre - RS - Brasil > +55 51 3275.3588 > www.opservices.com.br > www.opmon.org > > - "Israel Brewster" <[EMAIL PROTECTED]> wrote: > >> I'm wondering if anyone can give me suggestions as to how to >> monitor a >> >> specific process to make sure it keeps running properly. The specific >> >> issue I have is that I have a Java Webstart applet running on OpenBSD >> >> 4.4 that, from time to time, randomly crashes. I can't just use >> check_process (I don't think so, at least) because when it "crashes" >> >> the process never actually exits- it just stops working. I have to go >> >> in and manually kill it before I can restart it. Does anyone have any >> >> thoughts as to how I could check for this situation? I realize this >> may not strictly be a nagios question, and I apologize to anyone who >> >> is bothered by that, but since I would be doing the monitoring with >> Nagios this seemed the most logical place I could think of to ask. >> Thanks! >> >> --- >> Israel Brewster >> Computer Support Technician >> Frontier Flying Service Inc. >> 5245 Airport Industrial Rd >> Fairbanks, AK 99709 >> (907) 450-7250 x293 >> --- >> >> >> >> - >> This SF.Net email is sponsored by the Moblin Your Move Developer's >> challenge >> Build the coolest Linux based applications with Moblin SDK & win >> great >> prizes >> Grand prize is a trip for two to an Open Source event anywhere in the >> world >> http://moblin-contest.org/redirect.php?banner_id=100&url=/ >> ___ >> Nagios-users mailing list >> Nagios-users@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/ >> null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check for crash?
I'm wondering if anyone can give me suggestions as to how to monitor a specific process to make sure it keeps running properly. The specific issue I have is that I have a Java Webstart applet running on OpenBSD 4.4 that, from time to time, randomly crashes. I can't just use check_process (I don't think so, at least) because when it "crashes" the process never actually exits- it just stops working. I have to go in and manually kill it before I can restart it. Does anyone have any thoughts as to how I could check for this situation? I realize this may not strictly be a nagios question, and I apologize to anyone who is bothered by that, but since I would be doing the monitoring with Nagios this seemed the most logical place I could think of to ask. Thanks! ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Pingging with check_fping
On Nov 21, 2008, at 4:14 AM, Yu Watanabe wrote: > Hello all > > I found a plugin check_fping in the nagios exchange that pings to > each host. > > http://www.nagiosexchange.org/cgi-bin/page.cgi?g=Detailed%2F1709.html;d=1 > > However, the timeout value and the thresholds seems to be not > working properly. > > Are there any ping plugins that can statisfy the following criteria? > > 1. Able to specify the actual timeout value of ping > 2. Able to specify the count of ICMP packets to be sent > 3. Able to specify the interval time between the ICMP packets > 4. Able to specify the thresholds of rta and pl > 5. If at least one ping packet is sent and received OK the plugin > returns 0 > > It would be a great help if someone gives me an advice. You might look at check_icmp. It has command line switches for requirements 1-4, and if you make a symlink to it named check_host, it behaves as specified in criteria #5. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > Thank you > Yu Watanabe > > > - > This SF.Net email is sponsored by the Moblin Your Move Developer's > challenge > Build the coolest Linux based applications with Moblin SDK & win > great prizes > Grand prize is a trip for two to an Open Source event anywhere in > the world > http://moblin-contest.org/redirect.php?banner_id=100&url=/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Add hostgroup to servicegroup?
On Nov 7, 2008, at 8:18 AM, Sean Carolan wrote: > Why can't you add an entire hostgroup to servicegroup? As it stands I > would have to maintain an entire list of hosts for each service group, > something I'm not keen on doing. It would be so much easier if we > could just drop the hostgroup name into the servicegroup definition > and be done with it. Is this planned for a future version of Nagios? Well, you can at least add an entire hostgroup to a service using the hostgroup_name directive in the service definition - perhaps this would work? Granted, you would still need to add the hostgroup to each individual service in the servicegroup, but I would think this would still be better than adding/maintaining an entire list of hosts for each servicegroup. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > - > This SF.Net email is sponsored by the Moblin Your Move Developer's > challenge > Build the coolest Linux based applications with Moblin SDK & win > great prizes > Grand prize is a trip for two to an Open Source event anywhere in > the world > http://moblin-contest.org/redirect.php?banner_id=100&url=/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] checkhostalive vs ping service
For me, I went to doing all host checks (pings)/no service checks once we went to nagios 3 because of the logic of it. When I am pinging a host, I am checking to see if a host is up, i.e. host check. "ping" isn't some service running on the host that I am checking, it is checking the host itself. So from a logic standpoint, to me at least, it made more sense to just do the host checks. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Oct 31, 2008, at 2:10 AM, L B wrote: > Hello all, > > What is the difference between checking a host is alive with a check > in the host template (check_command check-host-alive), and creating a > ping service applied to the host ? > I did a test with both configurations, and didn't see a different > behavior. > > I thought it was a question of dependencies: if a service fails, a > host check is done to check if the host is up. But if one of the > services (ping or another) fails, a host check is done anyway even if > my host checks are disabled (check_interval 0), (that's a correct > behavior for me) > > My configuration doesn't have host checks because it was imported > from Nagios 2.x, and disabling host checks was an optimization setting > I've read somewhere (I also read that it's not necessary anymore with > nagios 3) . Now I'm running Nagios 3, so I'm wondering if it would > make sense to disable the ping service checks and do it as a host > check. Why should I do /don't do this ? > > Thanks! > -- > L.B. > > - > This SF.Net email is sponsored by the Moblin Your Move Developer's > challenge > Build the coolest Linux based applications with Moblin SDK & win > great prizes > Grand prize is a trip for two to an Open Source event anywhere in > the world > http://moblin-contest.org/redirect.php?banner_id=100&url=/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_mysql
On Oct 15, 2008, at 11:00 AM, jeremiah wrote: > I want nagios to automatically check mysql services on my servers. > However when i view the web interface i see that staus is UNKNOWN > and it > displays the usage for check_mysql. In what file am i supposed to put > the arguments for this command? The arguments go in your service definition on the check_command line, separated from each other and the command name by "!". So in your case, you would have a check_command line something like: check_command check_mysql!!! where and are respectively whatever you need for the -P, -u and -p switches to check_mysql. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > Here is what i have so far in commands.cfg > > define command{ >command_namecheck_mysql >command_line$USER1$/check_mysql -H $HOSTADDRESS$ -P $ARG1$ > -u $ARG2$ -p $ARG3$ > } > > and my services.cf file... > > define service{ >use generic-service ; Name > of service template >hostgroup_name mysql_servers >service_description MYSQL >is_volatile 0 >check_period24x7 >max_check_attempts 3 >normal_check_interval 3 >retry_check_interval1 >contact_groups admins >notification_interval 30 >notification_period 24x7 >notification_optionsw,u,c,r >check_command check_mysql >} > > > Thanks! > > > > Disclaimer: The information contained in this transmission, > including any > attachments, may contain confidential information of Panasonic > Avionics > Corporation. This transmission is intended only for the use of the > addressee(s) listed above. Unauthorized review, dissemination or > other use > of the information contained in this transmission is strictly > prohibited. > If you have received this transmission in error or have reason to > believe > you are not authorized to receive it, please notify the sender by > return > email and promptly delete the transmission. > > > > - > This SF.Net email is sponsored by the Moblin Your Move Developer's > challenge > Build the coolest Linux based applications with Moblin SDK & win > great prizes > Grand prize is a trip for two to an Open Source event anywhere in > the world > http://moblin-contest.org/redirect.php?banner_id=100&url=/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NAGVIS minimal broker options
In trying to implement NAGVIS on my system, I have been running into performance issues where the NDO daemon/mysql consume vast amounts of processor power, essentially bringing nagios and everything else to a halt until they complete. This is a problem (I belive) with mysql, and as such can't really be helped directly (except by moving the sql server to another machine), however I was thinking it may be possible to minimize the impact by telling nagios to broker less than everything, on the assumption that nagios is sending some stuff to the database that nagvis doesn't actually need. Is this, in fact, the case? If so, what are the minimal broker options needed to make nagvis happy? Thanks. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagvis alternatives
On Sep 5, 2008, at 2:23 AM, Alex Dehaini wrote: Thanks matthew, Does nexsm display the change on the map when a device goes down? Are there any screen shots I can view? Yes, nexsm shows a live view of the status of your devices, sing different colored backgrounds to indicate device/service status. There are a couple of screen shots available at http://www.nagiosexchange.org/cgi-bin/jump.cgi?ID=2137&view=File1;d=1 and http://www.nagiosexchange.org/cgi-bin/jump.cgi?ID=2137&view=File2;d=1 In the first one, the ssh host in the lan group has a problem - note the red text and background. When I was using it, I had all the host icons be 1-pixel dots, with a significantly larger dot as a background image, so basically all you saw for each host was a red (critical) yellow (warning/unreachable/etc) or green (OK) dot. For our purposes, I found this approach made more sense than trying to fit a bunch of large icons onto the screen. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Lex On Fri, Sep 5, 2008 at 10:22 AM, Matthew Macdonald-Wallace <[EMAIL PROTECTED] > wrote: On Fri, 5 Sep 2008 10:18:19 + "Alex Dehaini" <[EMAIL PROTECTED]> wrote: > Basically the mapping feature - nagios does the rest, I am happy with > nagios but nagios maps arecrap. Nagvis allows you to display hosts > and services graphically. OK, cool. In that case, I believe that your best bet at the moment is Nexsm as previously discussed, however if I find anything else, I'll post it here! Cheers, M. -- Matthew Macdonald-Wallace [EMAIL PROTECTED] http://www.truthisfreedom.org.uk -- Alex Dehaini Developer Site - www.alexdehaini.com Email - [EMAIL PROTECTED] - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagvis alternatives
On Sep 3, 2008, at 1:25 AM, Alex Dehaini wrote: Have you used it in production environment before. What are the disadvantages and advantages over nagvis? Yeah, we were running nexsm on our network of about 300 devices - not huge, but not tiny. The main advantage that I found was ease of configuration. With nexsm, it starts with the map fully populated with all network devices and links, and you can simply drag the devices around to where you want them. With nagvis, it starts off with an empty map, and you have to manually place every device AND every link. If you are just trying to populate a server room or something, this is fine, and probably even easer than the nexsm approach (where you would have to tell it to hide all the unwanted hosts), but when you are trying to map 300 or so devices, plus links, it quickly becomes prohibitively time consuming. And while you can reposition devices in nagvis by dragging (although I have had no end of trouble getting that to work), they aren't attached to their links, so you have to manually reposition BOTH the device and any links to/from the device. This isn't an issue with nexsm. I also found the display in nexsm to be easier to use- you can easily zoom in and out to see more or less of the map (if you can do this in nagvis, I haven't discovered how), and the map can be as large as you want- nagvis (from what I can tell) is limited to the size of the background image you choose. Although I do prefer the icon sets with nagvis over nexsm- nexsm uses the image you set in the nagios config. Finally, I prefer nexsm over nagvis for performance issues, although admittedly I suspect this is due to mysql, not nagvis itself. On the system I am using, I have found that when nagios is dumping stuff to the database, mysql uses pretty much all available processing power, and nagios hangs waiting for it to complete. Nexsm just reads the nagios logs directly, and while it is java, which uses a fair amount of processor, it's not as bad as mysql, at least on my OS. Also, when running nagvis, nagios has a tendency to stop updating completely every day or two, and can't be restarted until I kill the ndo2db daemon. I never had issues like that with nexsm- probably because it was just passively reading the log files, rather than trying to use an active event broker in nagios. Nexsm also just "feels" simpler to me- nagvis consists of an event broker module, a ndo2db daemon, the mysql server (although that might already be running somewhere for something else) and the web front end. nexsm is "just" a single java applet. Keep in mind though that my opinions may be at least partially based on incomplete or incorrect knowledge of nagvis. While I used nexsm extensively until we moved to nagios 3, I have never actually managed to get nagvis running acceptably. The three separate occasions I tried it have all been non-starters. The last time I got it running for a couple of days, but between the above mentioned performance issues, and never really being able to get the maps configured the way I wanted, we quickly took it offline. Also it is entirely possible that my desired usage (as a full-network overview) is better suited for nexsm than nagvis, and a different usage might work better with nagvis. But so far, for us, nagvis has proven to not be sutable. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Lex On Tue, Sep 2, 2008 at 11:20 PM, Israel Brewster <[EMAIL PROTECTED] > wrote: On Sep 2, 2008, at 8:34 AM, Alex Dehaini wrote: Hi Guys, Are there other projects like nagvis? If you are still running nagios 2, nexsm ( http://nexsm.sf.net ) is an excellent option, one that I highly preferred over nagvis for a number of reasons. Doesn't work with nagios 3, however. I Don't know how hard it would be to fix (I suspect it is just a log parsing issue, but I don't know), but it is written in java, which I don't know, and the developer hasn't had time to work on it lately. Oh well. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- Alex Dehaini Developer Site - www.alexdehaini.com Email - [EMAIL PROTECTED] - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source ev
Re: [Nagios-users] Nagvis alternatives
On Sep 2, 2008, at 8:34 AM, Alex Dehaini wrote: Hi Guys, Are there other projects like nagvis? If you are still running nagios 2, nexsm ( http://nexsm.sf.net ) is an excellent option, one that I highly preferred over nagvis for a number of reasons. Doesn't work with nagios 3, however. I Don't know how hard it would be to fix (I suspect it is just a log parsing issue, but I don't know), but it is written in java, which I don't know, and the developer hasn't had time to work on it lately. Oh well. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- Alex Dehaini Developer Site - www.alexdehaini.com Email - [EMAIL PROTECTED] - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagvis ideas?
On Aug 19, 2008, at 8:29 AM, Paul Weaver wrote: >> It seems to me, that there are 2 basic ways to "view" our >> conglomeration of systems. One is geographic, and the 2nd is >> from a network topology point of view. Frankly I think a >> nagvis presentation of both would be useful. The network >> topology version is fairly straightforward (I think, but I >> would like to here others point of view on this). The >> geographic version however, frankly I am struggling with. If >> I show the systems on the large area map, they are fairly >> spread out. I am thinking of doing that, and making dynamic >> links for each cluster. These links would be color coded >> based upon the status of all of the computers in that >> cluster, and would provide a link to a more detailed view. >> >> I'd love to hear what other peoples thoughts on this are. > > Our old monitoring solution was a product called solarwinds, > the main view was a map which had little icons for various > machines (it didn't monitor a lot more, great for monitoring > all the interfaces on the network, but useless for system > monitoring) > > I felt the map was pointless. If something did break, you > would get a little red dot in the corner. When I implemented > Nagios I decided to ignore nagviz for a bit. > > The only "map" I can see being off use would be something like > http://www.nagvis.org/sites/default/files/screenshots/c_by_dave_rearden_ > 2.png > > If you had a monitor hanging in the apps room. but even then, we > rarely > have hardware problems, and if we do, the machine will have a flashing > red led. And even if we didn't, they're all labelled. > > If you are truly physically seperated (not just different rooms in the > same building, on the same power feed) > http://www.nagvis.org/sites/default/files/screenshots/nagvis_map_2.png > > Could show you areas where there are invading tanks knocking out your > infrastructure, but a network topology might be better. > > (Of course if you use nagios to monitor your country's defence > systems, > a physical map might be a good way of doing it) > > 1 of the 3 nagios installations I know about here have a default > screen > of "service problems". Our installation has a custom-written screen > which highlights problems affecting users. The other isn't a real > nagios installation, it was provided by an external company -- it > barely > > monitors hosts, and only does about 20 anyway, the statusmap is fine > for that. > > I'd be interested to know how many people find "maps" helpful for > anything > other than impressing management. > Well, in my company impressing management (or random guests who walk through) is an entirely legitimate reason for having some sort of network map ;) From a more practical standpoint, however, having a visual map can help tell at a glance where the problem lies- for example, do we just have one computer in Nome down, or is our entire Nome station having problems? The answer to this question lets us know where to start looking for a solution, and a visual map tells us at a glance how widespread an issue is by the numbewr of red blotches in a given area. It also shows at a glance that it is Nome, and not Barrow that is having problems (since they are geographically separated). If all we have is a text list of "Service Problems" we have to go over to it (or take the time to pull it up on our local computers) to figure out where the problem is, and how wide spread. A quick glance in the general direction will only tell us "something" is down, not where or how widespread. That said, I have yet to manage to get Nagvis to work well enough for us to be worth the time to set it up. In order to get a decent map out of it (in my experience, at least) you have to set it up manually, adding each host and link individually, which when you have hundreds of hosts would take quite a while. [Rant] The automap feature of the newest version I tried worked decently, but did not appear to be customizable (which I definitely want), and also did not appear to show all the hosts I was monitoring. I need something that automatically populates the map with your hosts and links, but allows you to then customize the populated map to your heart's content- modifying an already populated map takes way less time than creating one from scratch. Something like nexsm (nexsm.sf.net) but that works with nagios 3. [/Rant] --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ---
Re: [Nagios-users] Nagios 3 visual map
On May 20, 2008, at 12:12 PM, Hendrik Bäcker wrote: > Hi, > > Israel Brewster schrieb: >> I did look at nagvis a while back, but at least at that time it >> would have required me to go through a very time-consuming process >> of placing each host and each link on the layout manually, and >> then adjusting the coordinates of each host and link in a >> configuration file to make it look good. This process would have >> been prohibitively time-consuming. I need something that can >> automatically place the hosts and links on the map, and simply >> allow me to drag the hosts around to position them. Thanks for any >> suggestions anyone might have to offer! > > well... give NagVis a second chance - it now has a "Automap" feature. Ok, tried that, and it is certainly closer to what I am looking for than it was. Still not quite there though. Actually, the first thing that happened is that it completely hosed my OpenBSD install- I couldn't run nano or the Nagios CGI's. I think, however, that was actually the fault of the graphviz install rather than nagvis itself. As far as Nagvis goes, the automap feature looks good, with a couple of exceptions. First (and most importantly) it doesn't show all my hosts. It behaves as though there is a maximum parent depth it will display or something. I'm not really sure what's going on, but significant portions of my network are not displayed. Secondly, it does not appear to be editable. As I mentioned, I am looking for something where I can simply drag and drop hosts to position them. As far as I can tell, this is still not an option in nagvis, even with the standard maps. So, anyway to solve these issues with nagvis? Or perhaps some other front-end that hasn't been mentioned yet? Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > http://www.nagvis.org/_media/nagvis-1.3-automap_undirected-1.png > > Regards, > Hendrik - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3 visual map
On May 20, 2008, at 12:12 PM, Hendrik Bäcker wrote: > Hi, > > Israel Brewster schrieb: >> I did look at nagvis a while back, but at least at that time it >> would have required me to go through a very time-consuming process >> of placing each host and each link on the layout manually, and >> then adjusting the coordinates of each host and link in a >> configuration file to make it look good. This process would have >> been prohibitively time-consuming. I need something that can >> automatically place the hosts and links on the map, and simply >> allow me to drag the hosts around to position them. Thanks for any >> suggestions anyone might have to offer! > > well... give NagVis a second chance - it now has a "Automap" feature. > > http://www.nagvis.org/_media/nagvis-1.3-automap_undirected-1.png That sounds promising. I'll give it a shot --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > Regards, > Hendrik - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios 3 visual map
I posted this question about a month and a half ago, but didn't get any replies. I'm hoping that now that Nagios 3 has been out a bit longer, there might be more options. My company has been using nagios 2.x for some time, with the add-on Nexsm for displaying a visual overview of the network "at-a-glance". It was also useful for showing visitors (and management) what our network was like cleanly and easily. Since Nagios 3 was released, and we migrated over to it (for a number of reasons), nexsum no longer works. The author says he'd like to add nagios 3 support, but he doesn't have the time at the moment and may not anytime soon. So I am looking for another option. Any suggestions? I did look at nagvis a while back, but at least at that time it would have required me to go through a very time-consuming process of placing each host and each link on the layout manually, and then adjusting the coordinates of each host and link in a configuration file to make it look good. This process would have been prohibitively time-consuming. I need something that can automatically place the hosts and links on the map, and simply allow me to drag the hosts around to position them. Thanks for any suggestions anyone might have to offer! ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios + trap snmptt
On May 13, 2008, at 12:06 AM, Ange AMBEMOU wrote: hello, i wan to know if is possible to have interface web to log a trap of snmptt. And see this in web interface of nagios. thanks for your help. This might help: http://www.samag.com/documents/s=9559/sam0503g/ --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- __ Do You Yahoo!? En finir avec le spam? Yahoo! Mail vous offre la meilleure protection possible contre les messages non sollicités http://mail.yahoo.fr Yahoo! Mail - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hostgroup definition
On May 12, 2008, at 7:46 AM, Jay R. Ashworth wrote: > On Mon, May 12, 2008 at 05:03:17PM +0200, Andreas Ericsson wrote: >> Not really, no, but the hostgroup needs to be defined somewhere. >> When it >> is, you can do something like the following >> >> define host { >> use template_with_all_required_variables >> hostgroups hostgroup1,hostgroup2,hostgroup4,hostgroupn >> } > > It doesn't cause any confusion, statistical or otherwise, to put a > host > in more than one group, does it? Nope. I do this for pretty much all of my hosts, actually. For example, all printers are in a "Printer" host group, to associate services as well as group all printers together. Additionally, I have a host group for each location we have machines in, so a printer in Barrow would be in both the printer hostgroup (for the services) and the "Barrow" hostgroup (for the location). Makes it easy to find all the machines in barrow, as well as all the printers. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > Cheers, > -- jra > -- > Jay R. Ashworth Baylink [EMAIL > PROTECTED] > Designer The Things I > Think RFC 2100 > Ashworth & Associates http:// > baylink.pitas.com '87 e24 > St Petersburg FL USA http://photo.imageinc.us +1 > 727 647 1274 > >Those who cast the vote decide nothing. >Those who count the vote decide everything. > -- (Joseph Stalin) > > - > This SF.net email is sponsored by the 2008 JavaOne(SM) Conference > Don't miss this year's exciting event. There's still time to save > $100. > Use priority code J8TL2D2. > http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Fwd: MRTG Graf
On Apr 30, 2008, at 6:21 PM, Devinder Singh wrote: Hi How do i get the following log file 192.168.1.253_1.log i which directory is that log file found I only get a log file in var/log.mrtg.log Pls advise Devinder The location that MRTG uses for your log files is defined in your mrtg.cfg file, specifically the "WorkDir:" directive. Keep in mind, these files are not created until MRTG has been run successfully. If they don't exist, and you have run MRTG, it would make sense to look at the /var/log/mrtg.log file to see if maybe MRTG isn't running properly, or is generating some error. I would HIGHLY recommend that you read through http://oss.oetiker.ch/mrtg/doc/mrtg-unix- guide.en.html , and perhaps some of the other associated MRTG documentation to get a better understanding of how MRTG works and how to set it up- this stuff is all explained in there. Also, the MRTG mailing lists would be a better place for questions regarding setting up and using MRTG. After that, if you still have issues specifically with integrating the MRTG data with nagios, this would be an appropriate place to ask those questions. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- define service{ use generic-service ; Inherit values from a template host_name linksys-srw224p service_description Port 1 Bandwidth Usage check_command check_local_mrtgtraf!/var/lib/mrtg/ 192.168.1.253_1.log!AVG!100,200!500,500!10 } -- Forwarded message -- From: Devinder Singh <[EMAIL PROTECTED]> Date: 2008/4/30 Subject: MRTG Graf To: [EMAIL PROTECTED] Hi I have problems in getting bandwidth statistics using MRTG. define service{ use generic-service ; Inherit values from a template host_name D-Link service_description Port 1 Bandwidth Usage check_command check_local_mrtgtraf!/home/mrtg/cfg/ mrtg.cfg!AVG!100,100!500,500!10 } I tried to run the above command and see the file is outdated. i have ran the cfgmaker program and stored the mrtg.cfg in /home/ mrtg/cfg/mrtg.cfg Oh, I should also point out that unless you are using a non-standard mrtgtraf plugin, the file specified in your configuration above should be the .log file generated my mrtg, not the config file. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Please advise -- Devinder -- Devinder - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/ null -- Devinder - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Fwd: MRTG Graf
On Apr 29, 2008, at 8:11 PM, Devinder Singh wrote: -- Forwarded message -- From: Devinder Singh <[EMAIL PROTECTED]> Date: 2008/4/30 Subject: MRTG Graf To: [EMAIL PROTECTED] Hi I have problems in getting bandwidth statistics using MRTG. define service{ use generic-service ; Inherit values from a template host_name D-Link service_description Port 1 Bandwidth Usage check_command check_local_mrtgtraf!/home/mrtg/cfg/ mrtg.cfg!AVG!100,100!500,500!10 } I tried to run the above command and see the file is outdated. i have ran the cfgmaker program and stored the mrtg.cfg in /home/ mrtg/cfg/mrtg.cfg Oh, I should also point out that unless you are using a non-standard mrtgtraf plugin, the file specified in your configuration above should be the .log file generated my mrtg, not the config file. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Please advise -- Devinder -- Devinder - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Fwd: MRTG Graf
On Apr 29, 2008, at 8:11 PM, Devinder Singh wrote: -- Forwarded message -- From: Devinder Singh <[EMAIL PROTECTED]> Date: 2008/4/30 Subject: MRTG Graf To: [EMAIL PROTECTED] Hi I have problems in getting bandwidth statistics using MRTG. define service{ use generic-service ; Inherit values from a template host_name D-Link service_description Port 1 Bandwidth Usage check_command check_local_mrtgtraf!/home/mrtg/cfg/ mrtg.cfg!AVG!100,100!500,500!10 } I tried to run the above command and see the file is outdated. i have ran the cfgmaker program and stored the mrtg.cfg in /home/ mrtg/cfg/mrtg.cfg Is MRTG running? Just making the config isn't good enough- you do actually have to run MRTG. Typically, this is done with a crontab entry, so it runs every 5 minutes or so. See http://oss.oetiker.ch/mrtg/doc/mrtg-unix-guide.en.html ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Please advise -- Devinder -- Devinder - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Down/unreachable confusion
I have a host being monitored in nagios 3.0 that fluctuates between being down and unreachable. The periods where it is listed as unreachable typically only last a few seconds to a minute (I didn't even think it was checking that often, but whatever), while the periods of "down" time (the host really is down) are several hours in duration. Naturally, every time it fluctuates back to down (only up and down alerts are enabled for this host-not unreachable), it sends out an alert, which gets a little annoying. This all would make sense, however, except for one thing: this host's parent, and all other parents back up the tree to nagios are listed as being solidly up as far back as the records go. So why is it briefly switching to an "unreachable" state, and how can I make it stop? Or at the very least, how can I make it stop sending alerts every time it switches between these two critical states? Thanks. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Apr 16, 2008, at 11:32 AM, David Wallis wrote: > I have the same situation with some devices. I just configure a > "ping" service for them, and then move on to more interesting > problems. When there's such a simple solution, it sometimes doesn't > pay to overthink the problem. Yeah, I'm seeing that. Just didn't think it would be such a big issue :-D I was actually expecting either a simple solution, or a simple "it can't be done". Unfortunately, being the stubborn type, I tend to beat these things into the ground :-P :-D --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > -- > David Wallis > Information Technology > Advanced Photon Source > Argonne National Laboratory > 630.252.7375 - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Apr 16, 2008, at 11:32 AM, Marc Powell wrote: > > >> -Original Message- >> From: [EMAIL PROTECTED] [mailto:nagios- >> users- >> [EMAIL PROTECTED] On Behalf Of Israel Brewster >> Sent: Wednesday, April 16, 2008 2:20 PM >> To: Ford, Andy >> Cc: Nagios Users Mailinglist >> Subject: Re: [Nagios-users] Hosts w/o services >> >> On Apr 16, 2008, at 10:10 AM, Ford, Andy wrote: > > >> So yeah, while this may not be the way nagios is designed to work, >> and >> may never be (which I can live with if so), I really don't see this >> as >> being all that unusual a situation, as some responses seem to imply. > > In such cases the typical action is to configure the ping as a service > check and no, a single ping or check_dummy, as a host check, Whether > you're using parenting or not determines which of those you choose. Thanks :-) Yeah, that's how I had it when I was running 2.x. It just seemed more intuitively obvious to go with host checks once 3.0 made that a reasonable option, since it was the host state, not a service state, that we were monitoring. That's ok though. I was just wondering if there was a better way. Apparently not :-) Thanks again everyone! --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > -- > Marc > > - > This SF.net email is sponsored by the 2008 JavaOne(SM) Conference > Don't miss this year's exciting event. There's still time to save > $100. > Use priority code J8TL2D2. > http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Apr 16, 2008, at 11:25 AM, Jay R. Ashworth wrote: > On Wed, Apr 16, 2008 at 11:20:29AM -0800, Israel Brewster wrote: >> To some extent, yes, but not necessarily a service we can monitor. > > There are cases that fit this description, but I'm not sure if the > examples you provide are. > >> For example, printers. > > Telnet to 9100 and look for a banner. Or the equivalent for non > JetDirect printers.. Ok, that one might be of some use, in the case where the printer is responding to pings (host up) but not accepting connections (service down). Thanks for the suggestion, I'll look into it. >> Even some more essential devices such as switches... > > And moving traffic; you might find it useful to ping-check other > things > on that switch. Or at least ssh-check the switch controller itself. Yep, we are - all the hosts connected to the switch. That's not a service check on the switch, though, that's host checks on devices connected to it. Trying to monitor something like this directly on the switch as a service check would just be redundant. Maybe save a few seconds, but if you see all the hosts on the far side of a switch go down, while the switch is still up, it should be fairly obvious that the switch is having problems :-D >> Not to mention the rather large category of client machines > > Sure. But for workstations, you're not monitoring diskspace? Open > TCP > listens (to watch for trojans) Well, we could, and for some places this sort of through monitoring is overkill. >> So yeah, while this may not be the way nagios is designed to work, >> and >> may never be (which I can live with if so), I really don't see this >> as >> being all that unusual a situation, as some responses seem to imply. > > Well, it's not that it's unusual, I think; I believe the assertion > being made is that only doing a ping is not the Best Practice. I guess I see your point, in that there is typically something on any device which could be monitorable as a service. One could also argue that if it could be an issue (such as disk space on a client machine, or Open TCP listens), it should be monitored. These are valid arguments. However, I would argue that cluttering up your nagios display with services that either a) are fringe cases which would be quickly noticed anyway, or b) which you really don't care about, not to mention the additional network, monitoring server, and client load this imposes, is not best practice. Best Practice, in my mind at least, is Keep It Simple. Just monitor what you need to in order to provide the best service to your clients you can. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > Cheers, > -- jra > -- > Jay R. Ashworth[EMAIL > PROTECTED] > Designer+-Internetworking--+- > +RFC 2100 > Ashworth & Associates | Best Practices Wiki | > | '87 e24 > St Petersburg FL USA+-http://bestpractices.wikia.com-+ +1 > 727 647 1274 > > If you can read this... thank a system administrator. Or two. > --me > > - > This SF.net email is sponsored by the 2008 JavaOne(SM) Conference > Don't miss this year's exciting event. There's still time to save > $100. > Use priority code J8TL2D2. > http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Apr 16, 2008, at 10:10 AM, Ford, Andy wrote: > > > There must be some reason for each server or device to be on the > network, it must provide some service, even if it's just a lowly(?) > sshd. > Do yourself and your organization a favor and monitor that reason! > To some extent, yes, but not necessarily a service we can monitor. There are quite a few devices on the network that it is important to us to know should the device fail, but aren't offering any separately monitorable "service", per se. For example, printers. We need to know if one dies, so we can fix it before it becomes a problem (not to mention that fixing the boss's printer before he even knows it has a problem makes you look good ;-) ), but other than the simple ping host check, they don't offer any "services" we can monitor. Even some more essential devices such as switches can fall into this category, as they are just routing traffic, not offering any "services". I could, of course, go crazy and use check_snmp or something to monitor each port on said switch as a service, but that is way overkill for our needs- we just need to know that the switch is there and functioning, i.e. host check. Not to mention the rather large category of client machines, which we need to know are running so they can be backed up, and, of course, used for whatever the user needs. Kind of difficult to monitor the ability to run office, or an e-mail client :-). We still want to monitor the host, though, so that hopefully if/when a client machine should die we can fix it before the user (who may well come in before us in the morning) is impacted. We could, of course, monitor something like ssh on those machines, but why? All we (and the user) care about is that the machine is functioning. So yeah, while this may not be the way nagios is designed to work, and may never be (which I can live with if so), I really don't see this as being all that unusual a situation, as some responses seem to imply. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > >> -Original Message- >> From: [EMAIL PROTECTED] >> [mailto:[EMAIL PROTECTED] On Behalf >> Of Israel Brewster >> Sent: Wednesday, April 16, 2008 1:01 PM >> To: Nagios Users Mailinglist >> Subject: [Nagios-users] Hosts w/o services >> >> Running nagios 3.0, I have set up a number (most) of my hosts >> without >> services, since all I am interested in monitoring is the hosts >> themselves. However, this is causing a number of little annoyances, >> such as the hosts not showing up when I type their name into the >> nagios sidebar search, throwing numerous warnings if I do a verify >> from the command line, giving me a blank page when I click the >> hostname link from the hostgroup overview page, etc. Is there >> any way >> around these issues other than adding a check_dummy service check to >> each host? Thanks. > > > > > Wachovia Securities, LLC's outgoing and incoming e-mails are > electronically > archived and subject to review and/or disclosure to someone other > than the recipient. > > A.G. Edwards, a division of Wachovia Securities, LLC > > > - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Apr 16, 2008, at 10:21 AM, Paul Weaver wrote: > You can use check_ping -- I assume the boxes are pingable, and you > want > to check they are pingable, even if they do nothing else that can be > monitored. > > If you don't want to check they work via ping, why are you even > modelling them? I AM checking them via ping- in the host check. It just doesn't make sense to be pinging both as a service check and as a host check, and since the ping is actually determining the host state, not the state of a service, it makes more sense to put the ping check as the host check, rather than a separate service. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > >> -Original Message- >> From: [EMAIL PROTECTED] >> [mailto:[EMAIL PROTECTED] On Behalf >> Of Israel Brewster >> Sent: 16 April 2008 19:01 >> To: Nagios Users Mailinglist >> Subject: [Nagios-users] Hosts w/o services >> >> >> Running nagios 3.0, I have set up a number (most) of my hosts >> without >> services, since all I am interested in monitoring is the hosts >> themselves. However, this is causing a number of little annoyances, >> such as the hosts not showing up when I type their name into the >> nagios sidebar search, throwing numerous warnings if I do a verify >> from the command line, giving me a blank page when I click the >> hostname link from the hostgroup overview page, etc. Is there >> any way >> around these issues other than adding a check_dummy service check to >> each host? Thanks. >> >> --- >> Israel Brewster >> Computer Support Technician >> Frontier Flying Service Inc. >> 5245 Airport Industrial Rd >> Fairbanks, AK 99709 >> (907) 450-7250 x293 >> --- >> >> >> >> -- >> --- >> This SF.net email is sponsored by the 2008 JavaOne(SM) Conference >> Don't miss this year's exciting event. There's still time to >> save $100. >> Use priority code J8TL2D2. >> http://ad.doubleclick.net/clk;198757673;13503038;p?http://java > .sun.com/javaone > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > http://www.bbc.co.uk/ > This e-mail (and any attachments) is confidential and may contain > personal views which are not the views of the BBC unless > specifically stated. > If you have received it in error, please delete it from your system. > Do not use, copy or disclose the information in any way nor act in > reliance on it and notify the sender immediately. > Please note that the BBC monitors e-mails sent or received. > Further communication will signify your consent to this. > - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts w/o services
Running nagios 3.0, I have set up a number (most) of my hosts without services, since all I am interested in monitoring is the hosts themselves. However, this is causing a number of little annoyances, such as the hosts not showing up when I type their name into the nagios sidebar search, throwing numerous warnings if I do a verify from the command line, giving me a blank page when I click the hostname link from the hostgroup overview page, etc. Is there any way around these issues other than adding a check_dummy service check to each host? Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3 external commands problem
On Apr 10, 2008, at 10:28 AM, Frost, Mark {PBG} wrote: > > Can't say I can be a lot of help with the status map. I never found > it > to be workable in any form, so I just don't use it. Same here- thus the reason I'm looking for a better option. But that's another thread :) > > > When I look at my own id in cgi.cfg, I'm listed in > > authorized_for_system_information= > authorized_for_configuration_information= > authorized_for_system_commands= > authorized_for_all_hosts= > authorized_for_all_services= > authorized_for_all_service_commands= > authorized_for_all_host_commands= > > and I have no problems doing anything (but I should have all these > rights anyway). I've never heard that you could use a wildcard for > these fields. Ok, found the problem. Totally my fault- I had missed a couple of directives in the cgi.cfg file. Your response clued me into that fact :P Sorry to waste your guy's time. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > Mark - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios 3 external commands problem
I apologize for asking what I had thought was a FAQ, but I can't seem to find the needed info now. At any rate, I just got finished updating my nagios install to version 3 (OpenBSD 4.2 OS), and aside from the lack of a decent status map (no one has any suggestions there? Really?) it's working. With the exception of external commands from the web interface. For some reason, whenever I attempt an external command from the web, I get the following error: Sorry, but you are not authorized to commit the specified command. Read the section of the documentation that deals with authentication and authorization in the CGIs for more information. Return from whence you came I have the following in my cgi.cfg file: authorized_for_configuration_information=admin authorized_for_system_commands=* I actually don't want the "*" there, but I stuck it in just to see if it fixed the issue. It didn't. I am authorized to the web server, as I got the login/password prompt, and nagios recognizes the fact in the little header on every page that says "logged in as". The /var/nagios/ rw/nagios.cmd file exists with permissions prw-rw 1 _nagios www, and my apache error logs show no errors when attempting to run the command. What am I missing? Thanks! --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3 status map options?
No replies? Am I the only one who wants to use a graphical network overview with Nagios 3? Or is it just that everyone else uses Nagvis? --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Apr 2, 2008, at 9:57 AM, Israel Brewster wrote: > I have been using Nexsm with nagios 2 for some time now, and have been > reasonably happy with it. Unfortunately, I now want to update to > nagios 3 for a number of reasons, but Nexsm doesn't support Nagios 3 > and the developer says he probably won't have time to update it for a > while. Therefore, I am wondering what alternatives might be out there > that do work with Nagios 3? > > I tried Nagvis a while ago (while still on 2, haven't tried the latest > version or on 3), but it proved to be too much of a pain to get > configured the way I needed, as I would have had to add each host and > link between hosts manually, as well as the fact that at least with > the version I was using, the only way to move a host was to edit the > coordinates manually. So what other good and relatively easy to > configure options are there? Thanks for any suggestions! > > --- > Israel Brewster > Computer Support Technician > Frontier Flying Service Inc. > 5245 Airport Industrial Rd > Fairbanks, AK 99709 > (907) 450-7250 x293 > --- > > > > - > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services for > just about anything Open Source. > http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Register now and save $200. Hurry, offer ends at 11:59 p.m., Monday, April 7! Use priority code J8TLD2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Too stupid? Services are available, but nagios reports host to be down!
On Apr 8, 2008, at 2:50 AM, Heiko Schlittermann wrote: > Hello, > > (using 3.0.1) > > I've a list of hosts, these hosts are not available for ping, but > normal > service checks (SSH, SMTP, ...) work. Nagios reports theses hosts > beeing > down! Ugly! > > If I remember well, older nagios versions "knew" that's enough to see > one service on a host to know this host has to be up. To a degree, yes- if you aren't actively checking the host (as would appear to be the case from your next paragraph), then as long as all services on the host are listed as ok, nagios assumes the host is still ok (at least once running, I don't know how it behaves on the initial check). However, should any of the services go into a non-ok state, nagios will immediately check the host (using the host check_command), wherupon, in your case, it would determine the host to be down since it can't ping. The state of the other services does not affect this process, so any other services do not change state. > The host check_command is the normale 'check-host-alive' (which is > pinging), the check_interval is 0 -- why does nagios want to check > that > host? Because at some point one or more of the services went into a non-ok state. > The check_command is inherited from some template, if I try to > override > it with no value, nagios complains: > > Error: Host check command '(null)' specified for host 'diwi/diw' is > not defined anywhere Yep- you can't have no value in the check_command directive. If you just want to assume the host is up all the time, you can use the check_dummy plugin (after defining a check_dummy command in your checkcommands.cfg, naturally). Otherwise you'll need to figure out some check Nagios can perform to determine if the host is running, even if that check is just checking one of the services again or something. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > > So - please, could anybody point to my stupidity? > > Thanks. > > >Best regards from Dresden >Viele Grüße aus Dresden >Heiko Schlittermann > -- > SCHLITTERMANN.de internet & unix > support - > Heiko Schlittermann HS12-RIPE > - > gnupg encrypted messages are welcome - key ID: 48D0359B > --- > gnupg fingerprint: 3061 CFBF 2D88 F034 E8D2 7E92 EE4E AC98 48D0 > 359B - > - > This SF.net email is sponsored by the 2008 JavaOne(SM) Conference > Register now and save $200. Hurry, offer ends at 11:59 p.m., > Monday, April 7! Use priority code J8TLD2. > http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Register now and save $200. Hurry, offer ends at 11:59 p.m., Monday, April 7! Use priority code J8TLD2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios 3 status map options?
I have been using Nexsm with nagios 2 for some time now, and have been reasonably happy with it. Unfortunately, I now want to update to nagios 3 for a number of reasons, but Nexsm doesn't support Nagios 3 and the developer says he probably won't have time to update it for a while. Therefore, I am wondering what alternatives might be out there that do work with Nagios 3? I tried Nagvis a while ago (while still on 2, haven't tried the latest version or on 3), but it proved to be too much of a pain to get configured the way I needed, as I would have had to add each host and link between hosts manually, as well as the fact that at least with the version I was using, the only way to move a host was to edit the coordinates manually. So what other good and relatively easy to configure options are there? Thanks for any suggestions! ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Unwanted re-notifications
On Mar 19, 2008, at 8:58 AM, Jim Avery wrote: > On 17/03/2008, Israel Brewster <[EMAIL PROTECTED]> wrote: >> I have a service that keeps sending me notifications of the fact that >> it is critical every half hour or so, even though the notification >> interval for the service is set to zero. > > That's strange. It could be that the nagios daemon didn't restart > properly when you changed the config? It might be worth stopping > Nagios, making sure it's really stopped then starting it again. > Failing that I'm not sure. You could I guess get that behaviour if > you had an escalation defined on that service but I think that's > unlikely. Thanks for the response. After some further digging, I managed to track down the problem, although the why and how to fix are still a mystery. Apparently what's happening is that every so often nagios switches from the expected "CRITICAL" sate to a "warning" state, with a status message similar to "PING WARNING - DUPLICATES FOUND! Packet loss = 6%, RTA = 574.20 ms". Since I had warning messages disabled for this service, I never got that e-mail, but when it switched back to a "critical" state on the next check, I do get the critical message. So I guess Nagios is working as it should, and it's just a mystery of why I am getting that response from the ping when the router port is disabled. Thanks again! --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > Cheers, > > Jim > -- > Web Admin for Corsham Diary http://corshamdiary.org.uk > > - > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2008. > http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Dual-port monitoring
On Mar 17, 2008, at 1:30 PM, Cook, Garry wrote: Well, maybe you can use check_cluster as the host check? It sounds logical to me, but I’ve never used that check so I’m not really sure. Also, check_icmp has the ability to PING multiple addresses. Perhaps using this as your host check against both IPs would give you the results you’re looking for… The good: check_icmp appears to give the desired ok/critical behavior when used with multiple hosts. check_cluster also seems to work, and perhaps somewhat more efficiently, since it is just using the results from the service checks, rather than re-checking. Of course, this may not be desired. The bad: check_icmp (with default settings, at least) shows 60% packet loss to the device- even though a concurrently running ping shows no packet loss. Setting the max packet interval setting to something high (at least 3 seconds, although even that isn't high enough some times) seems to help, but slows down the check. Setting a lower number of pings (1 or 2) also works, which may be acceptable for a host check, as the default is just a single ping anyway. The Ugly: Creating a second service for the second WAN port seems to ignore the "no re-notifications" setting, thereby flooding my inbox with critical messages every so often (variable timing, usually about every half hour). I started a separate thread on that issue though. Thanks for the suggestions. It looks like check_icmp as the host check is going to be the best one of the lot. Once I get it working properly :) ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Thanks, Garry From: Israel Brewster [mailto:[EMAIL PROTECTED] Sent: Monday, March 17, 2008 2:37 PM To: Cook, Garry Cc: Nagios Users Mailinglist Subject: Re: [Nagios-users] Dual-port monitoring On Mar 17, 2008, at 12:06 PM, Cook, Garry wrote: Yes, check_ifstatus is a service associated with the hosts. It checks interfaces only. The host check is a check_icmp against the host address, which is the loopback. If any of the interfaces are down, check_ifstatus will report this, but the host does not show as down as long as the loopback responds to a PING, which it will if any of the interfaces are up. Ok, I see what you are saying, and it does sound as though it would work as I desire. Unfortunately, for the device in question, it doesn't look as though this will be an option, as I don't think I can use loopback addresses (primarily Linksys RV082 devices). Thanks anyway. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Thanks, Garry From: Israel Brewster [mailto:[EMAIL PROTECTED] Sent: Monday, March 17, 2008 1:46 PM To: Cook, Garry Cc: Nagios Users Mailinglist Subject: Re: [Nagios-users] Dual-port monitoring On Mar 17, 2008, at 11:03 AM, Cook, Garry wrote: Do you have the ability to use loopback addresses on these devices? If so, that would be used as the host address, and then you could use other checks for the interfaces. For instance, all of my Cisco routers have a loopback setup. I then use ‘check_ifstatus’ to check the individual interfaces. Thanks, Garry I assume the check_ifstatus for the interfaces are services associated with the host? If so, then what do you use for your host check? How do you get the host to show as down if all the interfaces are down, but not if one or both (or more, if you have more than two, although that is not the case with any of our devices) are up? ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] ] On Behalf Of Israel Brewster Sent: Monday, March 17, 2008 12:41 PM To: Nagios Users Mailinglist Subject: Re: [Nagios-users] Dual-port monitoring On Mar 17, 2008, at 9:03 AM, Gary Every wrote: In your services file: define service { use generic-service name Ping host_namemulti_homed_server_name service_description Ping Second IFace check_command check_ping2!10.1.1.100!3000,10%!1,20% contact_groupsUnix } for your checkcommands: define command{ command_namecheck_ping2 command_line/usr/lib/nagios/plugins/check_ping -H $ARG1$ -w $ARG2$ -c $ARG3$ -p 5 } check_ping2 adds the first ARG as the ip address that you want monitored - This means that the HOSTNAME will remain the same, but a diff
Re: [Nagios-users] Dual-port monitoring
On Mar 17, 2008, at 12:06 PM, Cook, Garry wrote: Yes, check_ifstatus is a service associated with the hosts. It checks interfaces only. The host check is a check_icmp against the host address, which is the loopback. If any of the interfaces are down, check_ifstatus will report this, but the host does not show as down as long as the loopback responds to a PING, which it will if any of the interfaces are up. Ok, I see what you are saying, and it does sound as though it would work as I desire. Unfortunately, for the device in question, it doesn't look as though this will be an option, as I don't think I can use loopback addresses (primarily Linksys RV082 devices). Thanks anyway. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Thanks, Garry From: Israel Brewster [mailto:[EMAIL PROTECTED] Sent: Monday, March 17, 2008 1:46 PM To: Cook, Garry Cc: Nagios Users Mailinglist Subject: Re: [Nagios-users] Dual-port monitoring On Mar 17, 2008, at 11:03 AM, Cook, Garry wrote: Do you have the ability to use loopback addresses on these devices? If so, that would be used as the host address, and then you could use other checks for the interfaces. For instance, all of my Cisco routers have a loopback setup. I then use ‘check_ifstatus’ to check the individual interfaces. Thanks, Garry I assume the check_ifstatus for the interfaces are services associated with the host? If so, then what do you use for your host check? How do you get the host to show as down if all the interfaces are down, but not if one or both (or more, if you have more than two, although that is not the case with any of our devices) are up? ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] ] On Behalf Of Israel Brewster Sent: Monday, March 17, 2008 12:41 PM To: Nagios Users Mailinglist Subject: Re: [Nagios-users] Dual-port monitoring On Mar 17, 2008, at 9:03 AM, Gary Every wrote: In your services file: define service { use generic-service name Ping host_namemulti_homed_server_name service_description Ping Second IFace check_command check_ping2!10.1.1.100!3000,10%!1,20% contact_groupsUnix } for your checkcommands: define command{ command_namecheck_ping2 command_line/usr/lib/nagios/plugins/check_ping -H $ARG1$ -w $ARG2$ -c $ARG3$ -p 5 } check_ping2 adds the first ARG as the ip address that you want monitored - This means that the HOSTNAME will remain the same, but a diff interface will be checked. Thanks for the response. If I understand your reply correctly, though, this just adds a second service for the host, with a separate IP from that specified in the host directive. it does not, however, allow me to monitor the HOST on two separate IP's, taking into account that if either interface is up, the host is up. Perhaps my question wasn't quite clear enough. Take, for example, the following situation: host a has two network interfaces: a primary with an IP of 10.1.1.100 and a secondary with an IP of 10.1.1.101. Host a can be reached through either interface. Host a has its host address defined as the primary of 10.1.1.100, with a check command of check_ping Host a also has a service defined as above, checking the secondary interface (10.1.1.101) If the secondary interface goes down, then everything is fine: nagios notices that the service for the secondary interface is critical, checks the host on the primary, sees that as being fine, and sends an alert about the service (secondary port), while leaving the host in an OK state. This is exactly as it should be. However, now imagine that the primary interface goes down instead. Nagios attempts to check the host on the primary interface, which also fails (sicne the interface is down), and therefore nagios alerts that the host is down, and any devices behind it are unreachable. This, however, is incorrect-since the secondary interface is still connected and fully functional, the host and everything behind it is still up and reachable. What I want is a situation where Nagios will send me an alert if either interface of a host is down, but only consider the host to be down if BOTH interfaces on the host are down. As I explained in my original posting, I have considered a couple of options that may accomplish this, but neither seems as elegant as I would like. Thanks for any thoughts! --- Israel Brews
Re: [Nagios-users] Dual-port monitoring
On Mar 17, 2008, at 12:00 PM, Marcel wrote: If you would have a round-robin dns setup to reach that particular host, you would be fine tweaking a check_host_alive command with high lost percentage, but still reachable. But you will rely on dns name resolution to do that. Right, unfortunately we aren't set up quite that fancy. We simply have two seperate DNS entries for the two ports, and when we can't reach the device on one, we go on the other. The dual connections are more for outgoing traffic than incoming, such that devices behind the device will still have a route to the outside world should one route die (load balancing and failover), but nagios is on the outside looking in. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- HTH, Marcel On Mon, Mar 17, 2008 at 4:45 PM, Israel Brewster <[EMAIL PROTECTED] > wrote: On Mar 17, 2008, at 11:03 AM, Cook, Garry wrote: Do you have the ability to use loopback addresses on these devices? If so, that would be used as the host address, and then you could use other checks for the interfaces. For instance, all of my Cisco routers have a loopback setup. I then use 'check_ifstatus' to check the individual interfaces. Thanks, Garry I assume the check_ifstatus for the interfaces are services associated with the host? If so, then what do you use for your host check? How do you get the host to show as down if all the interfaces are down, but not if one or both (or more, if you have more than two, although that is not the case with any of our devices) are up? ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] ] On Behalf Of Israel Brewster Sent: Monday, March 17, 2008 12:41 PM To: Nagios Users Mailinglist Subject: Re: [Nagios-users] Dual-port monitoring On Mar 17, 2008, at 9:03 AM, Gary Every wrote: In your services file: define service { use generic-service name Ping host_namemulti_homed_server_name service_description Ping Second IFace check_command check_ping2!10.1.1.100!3000,10%!1,20% contact_groupsUnix } for your checkcommands: define command{ command_namecheck_ping2 command_line/usr/lib/nagios/plugins/check_ping -H $ARG1$ -w $ARG2$ -c $ARG3$ -p 5 } check_ping2 adds the first ARG as the ip address that you want monitored - This means that the HOSTNAME will remain the same, but a diff interface will be checked. Thanks for the response. If I understand your reply correctly, though, this just adds a second service for the host, with a separate IP from that specified in the host directive. it does not, however, allow me to monitor the HOST on two separate IP's, taking into account that if either interface is up, the host is up. Perhaps my question wasn't quite clear enough. Take, for example, the following situation: host a has two network interfaces: a primary with an IP of 10.1.1.100 and a secondary with an IP of 10.1.1.101. Host a can be reached through either interface. Host a has its host address defined as the primary of 10.1.1.100, with a check command of check_ping Host a also has a service defined as above, checking the secondary interface (10.1.1.101) If the secondary interface goes down, then everything is fine: nagios notices that the service for the secondary interface is critical, checks the host on the primary, sees that as being fine, and sends an alert about the service (secondary port), while leaving the host in an OK state. This is exactly as it should be. However, now imagine that the primary interface goes down instead. Nagios attempts to check the host on the primary interface, which also fails (sicne the interface is down), and therefore nagios alerts that the host is down, and any devices behind it are unreachable. This, however, is incorrect-since the secondary interface is still connected and fully functional, the host and everything behind it is still up and reachable. What I want is a situation where Nagios will send me an alert if either interface of a host is down, but only consider the host to be down if BOTH interfaces on the host are down. As I explained in my original posting, I have considered a couple of options that may accomplish this, but neither seems as elegant as I would like. Thanks for any thoughts! --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industr
Re: [Nagios-users] Dual-port monitoring
On Mar 17, 2008, at 11:03 AM, Cook, Garry wrote: Do you have the ability to use loopback addresses on these devices? If so, that would be used as the host address, and then you could use other checks for the interfaces. For instance, all of my Cisco routers have a loopback setup. I then use ‘check_ifstatus’ to check the individual interfaces. Thanks, Garry I assume the check_ifstatus for the interfaces are services associated with the host? If so, then what do you use for your host check? How do you get the host to show as down if all the interfaces are down, but not if one or both (or more, if you have more than two, although that is not the case with any of our devices) are up? --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] ] On Behalf Of Israel Brewster Sent: Monday, March 17, 2008 12:41 PM To: Nagios Users Mailinglist Subject: Re: [Nagios-users] Dual-port monitoring On Mar 17, 2008, at 9:03 AM, Gary Every wrote: In your services file: define service { use generic-service name Ping host_namemulti_homed_server_name service_description Ping Second IFace check_command check_ping2!10.1.1.100!3000,10%!1,20% contact_groupsUnix } for your checkcommands: define command{ command_namecheck_ping2 command_line/usr/lib/nagios/plugins/check_ping -H $ARG1$ -w $ARG2$ -c $ARG3$ -p 5 } check_ping2 adds the first ARG as the ip address that you want monitored - This means that the HOSTNAME will remain the same, but a diff interface will be checked. Thanks for the response. If I understand your reply correctly, though, this just adds a second service for the host, with a separate IP from that specified in the host directive. it does not, however, allow me to monitor the HOST on two separate IP's, taking into account that if either interface is up, the host is up. Perhaps my question wasn't quite clear enough. Take, for example, the following situation: host a has two network interfaces: a primary with an IP of 10.1.1.100 and a secondary with an IP of 10.1.1.101. Host a can be reached through either interface. Host a has its host address defined as the primary of 10.1.1.100, with a check command of check_ping Host a also has a service defined as above, checking the secondary interface (10.1.1.101) If the secondary interface goes down, then everything is fine: nagios notices that the service for the secondary interface is critical, checks the host on the primary, sees that as being fine, and sends an alert about the service (secondary port), while leaving the host in an OK state. This is exactly as it should be. However, now imagine that the primary interface goes down instead. Nagios attempts to check the host on the primary interface, which also fails (sicne the interface is down), and therefore nagios alerts that the host is down, and any devices behind it are unreachable. This, however, is incorrect-since the secondary interface is still connected and fully functional, the host and everything behind it is still up and reachable. What I want is a situation where Nagios will send me an alert if either interface of a host is down, but only consider the host to be down if BOTH interfaces on the host are down. As I explained in my original posting, I have considered a couple of options that may accomplish this, but neither seems as elegant as I would like. Thanks for any thoughts! ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Mon, Mar 17, 2008 at 9:38 AM, Israel Brewster <[EMAIL PROTECTED] > wrote: I have a number of devices on my network that have dual-interfaces such that either one can go down, but the device itself, and all the devices behind it (in the case of a router) is still up and reachable. What is the best way to set up monitoring of such devices, considering that as far as I can tell you can only assign one IP per host? I have looked at check_cluster, but if that is the best method I am unsure of the best way of implementing it. Things I have thought of: 1) Create two "dummy" hosts which are the two ports, and then a third host with a check command of check_cluster that looks at those two dummy hosts. This seems overly complicated, not to mention cluttering up my configs and nagios interface with three hosts where there is really only one. 2) Simply monitor the two ports as two separate hosts. This could work, but again clutters things up with mul
[Nagios-users] Unwanted re-notifications
I have a service that keeps sending me notifications of the fact that it is critical every half hour or so, even though the notification interval for the service is set to zero. In this particular case, the port that is being monitored is temporarily disabled, so there is no chance that the service is coming up briefly and then going back down, thus triggering the notification. Looking at the nagios page for the service, I notice, however that it says "Current Notification Number: 7", which would imply (to me at least) that re-notification is enabled. What am I missing here? how can I get it to stop sending me messages? I can, of course, acknowledge the down state, or put it in scheduled downtime, but I don't want it to send more than the initial notification regardless. Thanks! The service definition is as follows: define service{ use generic service host_name ffsbrw-rtr00 service_description WAN 2 notification_interval 0 check_command check_custom_ping!xx.xxx.xxx.xxx! 1200.0,20%!1600.0,50% normal_check_interval 1 max_check_attempts 5 servicegroups Satelite Services notification_optionsu,c,r } and generic service (which this inherits from) as such (yes, much came from the sample files): define service{ namegeneric service ; The 'name' of this service template active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (disabling this can lead to major performance problems) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are not enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled failure_prediction_enabled 1 ; Failure prediction is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts is_volatile 0 check_period24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval1 contact_groups admins2 notification_optionsn notification_interval 0 notification_period 24x7 servicegroups z Ping Services register0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! } ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Dual-port monitoring
On Mar 17, 2008, at 9:03 AM, Gary Every wrote: In your services file: define service { use generic-service name Ping host_namemulti_homed_server_name service_description Ping Second IFace check_command check_ping2!10.1.1.100!3000,10%!1,20% contact_groupsUnix } for your checkcommands: define command{ command_namecheck_ping2 command_line/usr/lib/nagios/plugins/check_ping -H $ARG1$ -w $ARG2$ -c $ARG3$ -p 5 } check_ping2 adds the first ARG as the ip address that you want monitored - This means that the HOSTNAME will remain the same, but a diff interface will be checked. Thanks for the response. If I understand your reply correctly, though, this just adds a second service for the host, with a separate IP from that specified in the host directive. it does not, however, allow me to monitor the HOST on two separate IP's, taking into account that if either interface is up, the host is up. Perhaps my question wasn't quite clear enough. Take, for example, the following situation: host a has two network interfaces: a primary with an IP of 10.1.1.100 and a secondary with an IP of 10.1.1.101. Host a can be reached through either interface. Host a has its host address defined as the primary of 10.1.1.100, with a check command of check_ping Host a also has a service defined as above, checking the secondary interface (10.1.1.101) If the secondary interface goes down, then everything is fine: nagios notices that the service for the secondary interface is critical, checks the host on the primary, sees that as being fine, and sends an alert about the service (secondary port), while leaving the host in an OK state. This is exactly as it should be. However, now imagine that the primary interface goes down instead. Nagios attempts to check the host on the primary interface, which also fails (sicne the interface is down), and therefore nagios alerts that the host is down, and any devices behind it are unreachable. This, however, is incorrect-since the secondary interface is still connected and fully functional, the host and everything behind it is still up and reachable. What I want is a situation where Nagios will send me an alert if either interface of a host is down, but only consider the host to be down if BOTH interfaces on the host are down. As I explained in my original posting, I have considered a couple of options that may accomplish this, but neither seems as elegant as I would like. Thanks for any thoughts! ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Mon, Mar 17, 2008 at 9:38 AM, Israel Brewster <[EMAIL PROTECTED] > wrote: I have a number of devices on my network that have dual-interfaces such that either one can go down, but the device itself, and all the devices behind it (in the case of a router) is still up and reachable. What is the best way to set up monitoring of such devices, considering that as far as I can tell you can only assign one IP per host? I have looked at check_cluster, but if that is the best method I am unsure of the best way of implementing it. Things I have thought of: 1) Create two "dummy" hosts which are the two ports, and then a third host with a check command of check_cluster that looks at those two dummy hosts. This seems overly complicated, not to mention cluttering up my configs and nagios interface with three hosts where there is really only one. 2) Simply monitor the two ports as two separate hosts. This could work, but again clutters things up with multiple hosts where only one exists, and adds the requirement of multi-parenting any child devices, which can get ugly, especially in a graphical representation of the network. I have read http://nagios.sourceforge.net/docs/2_0/clusters.html, but that doesn't seem to apply, at least not directly, in that it refers either to situations where you have redundant services on one or more hosts (perhaps that is sort of what I want?) or to where you have multiple redundant hosts, but not to the situation where you have one host offering one service over two ports. The main problem with that doc, though, is that the cluster is always a service, which wouldn't work in my situation, as the host itself is a cluster. Any thoughts? Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Stud
[Nagios-users] Dual-port monitoring
I have a number of devices on my network that have dual-interfaces such that either one can go down, but the device itself, and all the devices behind it (in the case of a router) is still up and reachable. What is the best way to set up monitoring of such devices, considering that as far as I can tell you can only assign one IP per host? I have looked at check_cluster, but if that is the best method I am unsure of the best way of implementing it. Things I have thought of: 1) Create two "dummy" hosts which are the two ports, and then a third host with a check command of check_cluster that looks at those two dummy hosts. This seems overly complicated, not to mention cluttering up my configs and nagios interface with three hosts where there is really only one. 2) Simply monitor the two ports as two separate hosts. This could work, but again clutters things up with multiple hosts where only one exists, and adds the requirement of multi-parenting any child devices, which can get ugly, especially in a graphical representation of the network. I have read http://nagios.sourceforge.net/docs/2_0/clusters.html, but that doesn't seem to apply, at least not directly, in that it refers either to situations where you have redundant services on one or more hosts (perhaps that is sort of what I want?) or to where you have multiple redundant hosts, but not to the situation where you have one host offering one service over two ports. The main problem with that doc, though, is that the cluster is always a service, which wouldn't work in my situation, as the host itself is a cluster. Any thoughts? Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_rrdtraf
On Feb 17, 2008, at 8:57 AM, Marc Powell wrote: > > On Feb 16, 2008, at 4:32 AM, Pili Muñoz Gargallo wrote: > >> Hi Israel, >> Well that is what i get >> >> ./check_rrdtraf -f /var/www/html/cfg/62.81.189.117_1.rrd -c >> 5000,5000 -w 3000,3000 -vv >> Using RRD file: /var/www/html/cfg/62.81.189.117_1.rrd >> Input warning level(kb/s): 3000 >> Output warning level(kb/s): 3000 >> Input critical level (kb/s): 5000 >> Output critical level (kb/s): 5000 >> Fetching data with command: rrdtool fetch /var/www/html/cfg/ >> 62.81.189.117_1.rrd AVERAGE -s-10minutes | grep -vi "nan" >> RRD File Data: >>ds0 ds1 >> >> 1203155100: 4.4908022742e+04 6.8003888963e+04 >> Raw Input Traffic Value (b/s): 32,00 >> Raw Output Traffic Value (b/s): 48,00 >> Decimal Input Traffic Value (b/s): 32,00 >> Decimal Output Traffic Value (b/s): 48,00 >> ./check_rrdtraf: line 248: [: 32,00: integer expression expected >> ./check_rrdtraf: line 251: [: 32,00: integer expression expected >> ./check_rrdtraf: line 254: [: 32,00: integer expression expected >> ./check_rrdtraf: line 263: [: 48,00: integer expression expected >> ./check_rrdtraf: line 266: [: 48,00: integer expression expected >> ./check_rrdtraf: line 269: [: 48,00: integer expression expected > > Seems like a regional number-formatting issue. Your OS LC_NUMERIC > settings specify a number format that uses ',' in place of '.' and the > script (or perl?) isn't internationalized to understand that. Would > that be the case? I'm not too familiar with i18n so can't be of much > help. Ok, yeah, that would make sense. It is a shell script, and no, it isn't internationalized- partly because I didn't think of that when writing it, partly because I'm not entirely sure how. I suppose I could just do a simple string search and replace on the values to replace any commas with periods, but I don't really know if this would be the best or right way of doing it. Any of the more experienced programers out there have any suggestions? --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > -- > Marc > > > - > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2008. > http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_rrdtraf
--- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- On Feb 15, 2008, at 2:11 AM, Pili Muñoz Gargallo wrote: Of course yes, PEBKAC... i have a new issue: if i execute the same command from command line i obtain: ./check_rrdtraf -f /var/www/html/cfg/62.81.189.118_1.rrd -c 500,500 -w 300,300 ./check_rrdtraf: line 248: [: 8,00: integer expression expected ./check_rrdtraf: line 251: [: 8,00: integer expression expected ./check_rrdtraf: line 254: [: 8,00: integer expression expected ./check_rrdtraf: line 263: [: 32,00: integer expression expected ./check_rrdtraf: line 266: [: 32,00: integer expression expected ./check_rrdtraf: line 269: [: 32,00: integer expression expected (standard_in) 1: parse error (standard_in) 1: parse error (standard_in) 1: parse error (standard_in) 1: parse error OK - Current BW in: Gbps Out: Gbps(standard_in) 1: parse error (standard_in) 1: parse error |in=Gb/s;300;500 (standard_in) 1: parse error (standard_in) 1: parse error out=Gb/s;300;500 so Is there another problem or it is just me? Ok, here's where that -vv switch can come in handy. Try again from the command line with -vv, and send the output you get. That should help me with the debugging. As it is, the only thing I would note is to wonder if you really want the critical and warning values to be 3 Gbps and 5Gbps. It's not completely unreasonable, of course, depending on what you are monitoring (I think there are devices that can get multi- gigabit speeds), but most devices never get anywhere near that kind of traffic. Regardless, that shouldn't cause the first errors you are seeing, as the lines referenced are just seeing how big the input values are. Course, if you ARE getting gigabit speeds, I suppose it's possible i'm getting an overflow somewhere- i've never tested with that level traffic (as I don't have anything that generates that kind of traffic). Thanks again, and forgive me for these questions: Like i used to say, i am a student and i am newbie in linux, LAMP servers, Nagios, MRTG, RRDTool. and everything you imagine.. Cheers Pili 2008/2/15, Israel Brewster <[EMAIL PROTECTED]>: On Feb 14, 2008, at 1:48 PM, Pili Muñoz Gargallo wrote: hi all, thanks for the help. I have been trying several things that you told me. I'll give more information. I am monitoring bandwith with MRTG and storing datas in rrdtool format. If i run command from command line is ./check_rrdtraf -f ./var/www/html/cfg/62.81.189.118_1.rrd -l B -c 300,300 -w 500,500 ./var/www/html/cfg/62.81.189.118_1.rrd does not exist. I am graphing with routers2.cgi and it receives values, and if i do : rrdtool fetch /var/www/html/cfg/62.81.189.118_1.rrd AVERAGE . . . 1203025200: 1.3963531752e+05 8.2446477796e+04 1203025500: 1.3719540269e+05 1.0068914358e+05 1203025800: 1.3673972419e+05 1.0818787703e+05 1203026100: 1.3729848910e+05 1.1104579469e+05 1203026400: 1.3708077389e+05 1.1129381874e+05 1203026700: 1.3590177194e+05 1.1081213980e+05 1203027000: 3.1652589163e+05 2.5986265329e+05 1203027300: 3.8436060720e+05 3.1651678017e+05 1203027600: 1.4309192557e+05 1.1708320591e+05 1203027900: nan nan 1203028200: nan nan i see it is storing values... what is wrong? I had read check_rrdtraf help and i have included -l B option to indicate that data are bits...but it is still not working As Marc pointed out in his response, having the ./var instead of / var in the path to the .rrd file is wrong. you want /var WITHOUT the period. Also, note that the -l switch only affects the display of the output. It is not needed in normal operation, unless you really want to see the output as bits rather than Kb, Mb, etc. The plugin will auto-choose the best option based on the data if you leave this switch out. But the main problem is the period before that slash. Remove it, and it should work. ------- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- I obtain now: OK - Current BW in: bps Out: bps any more ideas? Thank you again for your patience!!! Pili 2008/2/13, Israel Brewster <[EMAIL PROTECTED]>: On Feb 13, 2008, at 6:34 AM, Marc Powell wrote: >> -Original Message- >> From: [EMAIL PROTECTED] [mailto:nagios- >> users- >> [EMAIL PROTECTED] On Behalf Of Pili Muñoz Gargallo >> Sent: Wednesday, February 13, 2008 7:23 AM >> To: nagios-users@lists.sourceforge.net >> Subject: [Nagios-users] check_rrdtraf >> >> Hi >> i am trying to measure bandwith
Re: [Nagios-users] check_rrdtraf
On Feb 14, 2008, at 1:48 PM, Pili Muñoz Gargallo wrote: hi all, thanks for the help. I have been trying several things that you told me. I'll give more information. I am monitoring bandwith with MRTG and storing datas in rrdtool format. If i run command from command line is ./check_rrdtraf -f ./var/www/html/cfg/62.81.189.118_1.rrd -l B -c 300,300 -w 500,500 ./var/www/html/cfg/62.81.189.118_1.rrd does not exist. I am graphing with routers2.cgi and it receives values, and if i do : rrdtool fetch /var/www/html/cfg/62.81.189.118_1.rrd AVERAGE . . . 1203025200: 1.3963531752e+05 8.2446477796e+04 1203025500: 1.3719540269e+05 1.0068914358e+05 1203025800: 1.3673972419e+05 1.0818787703e+05 1203026100: 1.3729848910e+05 1.1104579469e+05 1203026400: 1.3708077389e+05 1.1129381874e+05 1203026700: 1.3590177194e+05 1.1081213980e+05 1203027000: 3.1652589163e+05 2.5986265329e+05 1203027300: 3.8436060720e+05 3.1651678017e+05 1203027600: 1.4309192557e+05 1.1708320591e+05 1203027900: nan nan 1203028200: nan nan i see it is storing values... what is wrong? I had read check_rrdtraf help and i have included -l B option to indicate that data are bits...but it is still not working As Marc pointed out in his response, having the ./var instead of /var in the path to the .rrd file is wrong. you want /var WITHOUT the period. Also, note that the -l switch only affects the display of the output. It is not needed in normal operation, unless you really want to see the output as bits rather than Kb, Mb, etc. The plugin will auto-choose the best option based on the data if you leave this switch out. But the main problem is the period before that slash. Remove it, and it should work. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- I obtain now: OK - Current BW in: bps Out: bps any more ideas? Thank you again for your patience!!! Pili 2008/2/13, Israel Brewster <[EMAIL PROTECTED]>: On Feb 13, 2008, at 6:34 AM, Marc Powell wrote: >> -Original Message- >> From: [EMAIL PROTECTED] [mailto:nagios- >> users- >> [EMAIL PROTECTED] On Behalf Of Pili Muñoz Gargallo >> Sent: Wednesday, February 13, 2008 7:23 AM >> To: nagios-users@lists.sourceforge.net >> Subject: [Nagios-users] check_rrdtraf >> >> Hi >> i am trying to measure bandwith using check_rrd > > When asking about plugins that don't come with the distribution it's > helpful to include where you obtained the plugin so everyone knows > which check_rrd (or check_rrdtraf as the case may be) you're talking > about. That would be (partly) my plugin, obtained from Nagiosexchange.org, most likely :) >> i use the command: >> $USER1$/check_rrdtraf -f ./var/www/html/cfg/$HOSTADRESS$_1.rrd -l >> b -vv - >> c $ARG1$ -w $ARG2$ > > You meant $HOSTADDRESS$ here, right? And did you really want that period before the /var ? > > >> and in the service i give it the next arguments: >> !300,300!500,500 >> >> i have read about it several times, and it doesn't work how i espect: >> it returns me an OK value, but it doesn't tell me the present BW... >> Am i missing something? >> > > When you run the above command as the nagios user from the command > line, does the first line of output match what you're seeing in the > GUI? I expect it starts with the word 'Using'. What is the -vv flag > causing to happen when the program is run (hint, run check_rrdtraf -- > help to find out)? The -vv is an issue, although there could be a couple of others (see previous comments about the formating of the file string in the command). As that flag gives a lot of debugging output, and nagios only displays the first line of output, I wouldn't expect it to show the traffic values with this switch in there. What I would expect nagios to be showing is something along the lines of "Using RRD File: / var/www/html/cfg/$HOSTADRESS$_1.rrd" (I think that's the first line of debugging output I put in there), with $HOSTADDRESS$ replaced with the applicable value. Try without the -vv and see what happens. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > -- > Marc > > > > - > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2008. > http://clk.atdmt.com/MRT/go/vse012070mrt/d
Re: [Nagios-users] check_rrdtraf
On Feb 13, 2008, at 6:34 AM, Marc Powell wrote: >> -Original Message- >> From: [EMAIL PROTECTED] [mailto:nagios- >> users- >> [EMAIL PROTECTED] On Behalf Of Pili Muñoz Gargallo >> Sent: Wednesday, February 13, 2008 7:23 AM >> To: nagios-users@lists.sourceforge.net >> Subject: [Nagios-users] check_rrdtraf >> >> Hi >> i am trying to measure bandwith using check_rrd > > When asking about plugins that don't come with the distribution it's > helpful to include where you obtained the plugin so everyone knows > which check_rrd (or check_rrdtraf as the case may be) you're talking > about. That would be (partly) my plugin, obtained from Nagiosexchange.org, most likely :) >> i use the command: >> $USER1$/check_rrdtraf -f ./var/www/html/cfg/$HOSTADRESS$_1.rrd -l >> b -vv - >> c $ARG1$ -w $ARG2$ > > You meant $HOSTADDRESS$ here, right? And did you really want that period before the /var ? > > >> and in the service i give it the next arguments: >> !300,300!500,500 >> >> i have read about it several times, and it doesn't work how i espect: >> it returns me an OK value, but it doesn't tell me the present BW... >> Am i missing something? >> > > When you run the above command as the nagios user from the command > line, does the first line of output match what you're seeing in the > GUI? I expect it starts with the word 'Using'. What is the -vv flag > causing to happen when the program is run (hint, run check_rrdtraf -- > help to find out)? The -vv is an issue, although there could be a couple of others (see previous comments about the formating of the file string in the command). As that flag gives a lot of debugging output, and nagios only displays the first line of output, I wouldn't expect it to show the traffic values with this switch in there. What I would expect nagios to be showing is something along the lines of "Using RRD File: / var/www/html/cfg/$HOSTADRESS$_1.rrd" (I think that's the first line of debugging output I put in there), with $HOSTADDRESS$ replaced with the applicable value. Try without the -vv and see what happens. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > -- > Marc > > > > - > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2008. > http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] mrtgtraf help needed
On Feb 12, 2008, at 8:26 PM, John King wrote: > Hi list > > ...[Hi All, I have the latest versions of Nagios, the plugins, net- > snmp, net-snmp-utils all installed on a fedora 5 box. I've setup the > monitoring of my windows and linux machines and all is well with > those. I have or am trying to setup monitoing of my cisco switches > and while I am getting gree lights on the snmp and ping tests, none > of the bandwith (mrtg) checks are working. Error msg is > "check_mrtgtraf: Unable to open MRTG log file " > > Any help would be greatly appreciated]... Well, the obvious questions: is MRTG set up and functioning? You didn't mention installing it. If it is, then what is the command definition you have for the bandwidth checks? Do the log files they point to actually exist? From the error you are getting, if MRTG is installed an functioning, then the most likely problem is that your command definitions are looking in the wrong place for the MRTG log files. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > > - John King (jkingsol1) > > --- > The mailing list archive is found here: > http://www.nagiosexchange.org/nagios-users.34.0.html > > > - > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2008. > http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Host check clarification
On Jan 14, 2008, at 1:27 PM, Sloane, Robert Raymond wrote: > To start off with, there isn't any WARNING state for hosts, just UP, > DOWN, PENDING or UNREACHABLE. Assume the host is OK to start off > with. > If the host check returns DOWN then then the parent of the host will > be > check to see if it is DOWN. If the parent is OK, then the host is > marked as DOWN in a soft state, and if the parent is DOWN, then the > host > is UNREACHABLE. The host check will be repeated in rapid succession > until the host is either UP or the retry count is reached. If any > check > returns the host as UP, then Nagios marks the host UP and goes to the > next thing. If the retry count is reached, then the host is marked as > DOWN in a hard state and the notification process is started. > > Does that help? Yep, that answers it. Thanks! --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > -- > Bob Sloane, University of Kansas Computer Center, Lawrence, KS, 66045 > Email:[EMAIL PROTECTED] http://www.ku.edu/home/sloane Phone:(785)864-0444 > > >> -Original Message- >> From: [EMAIL PROTECTED] >> [mailto:[EMAIL PROTECTED] On Behalf >> Of Israel Brewster >> Sent: Monday, January 14, 2008 4:18 PM >> To: Nagios Users mailinglist >> Subject: [Nagios-users] Host check clarification >> >> >> I need some clarification as to how exactly the host check >> determines >> if a host is down. In the default configuration, the >> check-host-alive >> command sends a single ping, which can be considered (obviously) >> either ok, warning, or critical. The host then has a >> max_check_attempts setting of three, which if I understand things >> correctly means it will try this check twice more, in rapid >> succession, before putting the host into a hard state. My >> question is >> what is the logic if one of these three checks returns a different >> state than the other two? Say, for example, the first ping >> never makes >> it back, therefore returning a critical state, but the second and >> third come back, giving OK states? is the host then considered up or >> down? Thanks! >> >> --- >> Israel Brewster >> Computer Support Technician >> Frontier Flying Service Inc. >> 5245 Airport Industrial Rd >> Fairbanks, AK 99709 >> (907) 450-7250 x293 >> --- >> >> >> >> -- >> --- >> Check out the new SourceForge.net Marketplace. >> It's the best place to buy or sell services for >> just about anything Open Source. >> http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.n > et/marketplace > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Host check clarification
I need some clarification as to how exactly the host check determines if a host is down. In the default configuration, the check-host-alive command sends a single ping, which can be considered (obviously) either ok, warning, or critical. The host then has a max_check_attempts setting of three, which if I understand things correctly means it will try this check twice more, in rapid succession, before putting the host into a hard state. My question is what is the logic if one of these three checks returns a different state than the other two? Say, for example, the first ping never makes it back, therefore returning a critical state, but the second and third come back, giving OK states? is the host then considered up or down? Thanks! --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Delay notification/escalations
On Dec 15, 2007, at 12:24 AM, Andreas Ericsson wrote: > Israel Brewster wrote: >> I need some clarification of how nagios will respond to something I >> am trying to do. I have a number of hosts I am monitoring that I >> want nagios to note immediately (well, almost) if they go down, >> but I don't want to receive a notification unless they stay in a >> hard down state for more than, say, 5 minutes. Now if I understand >> things correctly, I can accomplish this using host escalations by >> setting the notification options on the host to n (none) and the >> notification interval to 5 minutes, and then setting up a host >> escalation with a first_notification directive of 2. Does this >> sound correct? >> I have a couple of concerns about this setup. From my reading of >> the documentation (correct me if I am wrong), it sounds as though >> escalations only change the contact_groups and notification_periods >> of the notifications. If I have the notification options of the >> host set to n, does the escalation override this? If not, and I >> set the host notification options to, say, d,r, then how do I >> prevent a notification being sent out immediately when the host >> goes down? Does the notification_interval directive even have any >> effect if the notification_options are set to none? The last thing >> I want to do is try making this change, and end up not being >> notified at all when a host goes down for an extended period of >> time. Thanks. > > You should be able to use "first_notification_delay" for this. I'm not > sure which version it's included in, but 3.x will almost certainly > have > it. > Thanks for the info. That sounds like exactly what I am looking for, but from what I can tell it is not available in version 2.7 (which is what I am running) - at any rate, I couldn't find it in the documentation. I plan on moving to 3 as soon after it is released as I can manage (and my boss allows), but for now I need a solution that works with 2.7 --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > -- > Andreas Ericsson [EMAIL PROTECTED] > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 - SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Delay notification/escalations
I need some clarification of how nagios will respond to something I am trying to do. I have a number of hosts I am monitoring that I want nagios to note immediately (well, almost) if they go down, but I don't want to receive a notification unless they stay in a hard down state for more than, say, 5 minutes. Now if I understand things correctly, I can accomplish this using host escalations by setting the notification options on the host to n (none) and the notification interval to 5 minutes, and then setting up a host escalation with a first_notification directive of 2. Does this sound correct? I have a couple of concerns about this setup. From my reading of the documentation (correct me if I am wrong), it sounds as though escalations only change the contact_groups and notification_periods of the notifications. If I have the notification options of the host set to n, does the escalation override this? If not, and I set the host notification options to, say, d,r, then how do I prevent a notification being sent out immediately when the host goes down? Does the notification_interval directive even have any effect if the notification_options are set to none? The last thing I want to do is try making this change, and end up not being notified at all when a host goes down for an extended period of time. Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Nagios
On Nov 28, 2007, at 4:52 AM, Mohr James wrote: > Hi All! > > We had a problem the other day where the nagios process died and no > one > noticed it for over 30 minutes. Therefore, I was thinking about ways > to > monitor Nagios itself. One simply way would be a cronjob that checks > if > the nagios process is running and then uses yaps or something to > send an > SMS. But what about cases where the nagios process is running, but for > whatever reason the messages are not being processed. > > I was thinking about setting up a service that is triggered by a > cronjob > and then sends a notification. This notification writes a flag file > and > every X minutes a cron job checks the age of the flag file. If the > flag > file is too old, it sends an SMS via yaps. > > Rather than re-invent the wheel, I was wondering if other people had > already implemented something similar. > > Regards, > > Jim Mohr For a simple solution that requires no additional software, I just use the check_nagios plugin that comes with nagios, and run it from a script/cron job. This not only checks to see if nagios has crashed, but also checks the age of the status file. If a problem is detected with either (and I have had both situations) the script then first attempts to restart nagios via a full stop/start, then checks again and e-mails me with the result. So far, this has always just been a status notification saying that it had restarted nagios, as the stop/ start has worked as expected, but if for some reason that should fail it would notify me of this fact as well. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- > > > - > SF.Net email is sponsored by: The Future of Linux Business White Paper > from Novell. From the desktop to the data center, Linux is going > mainstream. Let it simplify your IT future. > http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Status display colors
Is there any way to change the colors generated by status.cgi? My boss is complaining that it is really hard for him to read the dark blue hostname on the dark red background you get when a host is down. Thanks! --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Host searching
On Oct 4, 2007, at 10:56 PM, hindrek murdsalu wrote: > Hey, > > I gather the 'freenasxxx' servers are somehow connected to each other > (they're in same farm or provide same services), if so you could just > define a hostgroup named 'freenas servers' or whichever name you > desire > and add the servers as members to it so you'd be able to list them all > in 1 window. > This, of course, isn't a solution to search rather than being an > alternative. > > > Hindrek > > Thanks for the response. Yeah, with that example a hostgroup could be a relatively adequate solution. I'll have to consider it further for organizational sake, if nothing else. the problem though is that this is only one example of where I would want a real search, and others may not work as well with this solution. I can't count the number of times I have wanted to get the IP for a given device, but don't remember exactly what I named it (such as freenasbrw or brw-freenas) or else have some other device with a similar name that comes up first. Or, perhaps for a better example, I know I have a printer out in Nome, but I can't remember if that was the xerox F12 or M15. To work with the hostgroup solution, I would need to have that in a "Nome" hostgroup. But I would also want it in a "printers" host group for logic sake, and perhaps a "F12" or "M15" hostgroup as well, for situations when I want to see where in the network I have a given type of printer. Can I put things in multiple host groups? I'll have to look into that. And no, with about a dozen different locations around the state, and probably at least as many different types of printers around the company, I'm not quite yet to the point where I can remember them all off the top of my head :) --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ------- > > -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf Of > Israel > Brewster > Sent: 5. oktoober 2007. a. 0:00 > To: Nagios Users mailinglist > Subject: [Nagios-users] Host searching > > Is there a way to search for a host in Nagios? For example, I have a > number of hosts set up named freenasxxx, where xxx is the three- > letter designator for the location. I would like to be able to type > freenas and get a list of all of them. Or if I named one xxx-freenas > for some reason, but don't remember that, I would like to be able to > do the same thing and get that one as well. The "show host" box in > the default sidebar only shows the first result for a partial match, > which is almost useless. I did find < http://www.nagiosexchange.org/ > Misc.36.0.html?&tx_netnagext_pi1[p_view]=553 >, which almost works, > but has the problem of inserting two tabs before the hostname when > you choose one of the options, which makes the search return no > results. You can delete the tabs, of course, but this brings up the > choice list again with the first item selected, so when you hit enter > you get the tabs back. It also shows each host twice for some reason > (I'm using the php script version). Any suggestions would be > appreciated. Thanks! > > --- > Israel Brewster > Computer Support Technician > Frontier Flying Service Inc. > 5245 Airport Industrial Rd > Fairbanks, AK 99709 > (907) 450-7250 x293 > --- > > > > -- > -- > - > This SF.net email is sponsored by: Splunk Inc. > Still grepping through log files to find problems? Stop. > Now Search log events and configuration files using AJAX and a > browser. > Download your FREE copy of Splunk now >> http://get.splunk.com/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -- > --- > This SF.net email is sponsored by: Splunk Inc. > Still grepping through log files to find problems? Stop. > Now Search log events and configuration files using AJAX and a > browser. > Download your FREE copy of Splunk no