Re: [Nagios-users] Dynamic warning/critical thresholds
As indicated, make a plugin that gets the info and set thresholds so high they're never likely to ring the red bell. But in addition to this, I'd set it up with something like nagiosgraph to generate graphs that you can watch. This will save/show historical data and show you the norm. Thus you may wish to then set a threshold at a later date based on this empirical data. If you don't want to go to the trouble of nagiosgraph, your plugin can still email you when the rate reaches a threshold you define (again, without setting off nagios warning/critical alarms) On Tue, Jul 10, 2012 at 10:28 AM, Jonathan Gazeley jonathan.gaze...@bristol.ac.uk wrote: On 10/07/12 14:47, C. Bensend wrote: On 22/06/12 15:11, Jonathan Gazeley wrote: I've got a bunch of Nagios plugins that monitor things like DNS/HTTP/RADIUS hits per second. I've set what I believe to be sensible max/min warning thresholds but what I really want is dynamic thresholds. If some quantity suddenly doubles or halves, I'd like an alert. For example, if I usually serve 10 DNS lookups per second, and suddenly it is doing 20 per second, that isn't a fault but I would like to know about it, because it might mean there is a problem with the network in general. Is there a way of doing this? Any ideas? You've already received two replies, both stating that you'll likely have to write some code to do it. I'm not aware of any common plugins out there that calculate rates of change and alert appropriately. Maybe they exist, but I don't recall seeing any of them. Have you tried any of the plugin sites? Oh, I didn't receive any replies. Presumably the mails got lost in the ether. I'm happy to write code - I just wondered if there was a built-in way of doing this. Thanks for your response, Jonathan -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hostgroup Members
Actually, that raises an interesting tangential question. Can/should your nagios server handle mail delivery to many many people? Unless it's tuned for higher capacity email delivery, I'd expect SMTP functions to negatively impact nagios checks. In other words, why not just use a single address that points to a listserver that can better handle delivery to larger groups of people. On Fri, Sep 23, 2011 at 11:51 AM, Brandon Phelps bphe...@gls.com wrote: Hello, We are using a fairly old version of Nagios (1.4.1) which has been running great for years and is in production on 100+ servers so we are a bit hesitant to update. If it ain't broke don't fix it, right? Anyway, one minor problem is the fact that in the nagios configuration, the members directive for a hostgroup can only support a certain number of entries, due to the fact that the members directive takes a comma delimited list of members and that list, it seems, can only be a maximum of 2000ish (I think, I don't recall off hand) characters. Like: hostgroup { ... members = Member1,Member2,Member3,...,Member200, Member201, Member202 } My question is, do newer version of nagios remove this limitation? It isn't really a huge deal since we can simply create additional hostgroups when we reach the limit on one, however if this is fixed in a newer version then that, for us, would be a good reason to upgrade. Thanks, Brandon -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2dcopy2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2dcopy2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Single host, multiple HTTP services
One way might be to use the check_http plug-in but have it look for a specific string in each web-page being hosted and use the warn/critical-time returns for empirical threshold On Thu, Jul 21, 2011 at 9:02 AM, af.at.w...@gmail.com wrote: Hey gang, I am having some difficulty getting a single host setup that would have multiple HTTP services attached to it. The scenario is a load balanced group of web servers and I am looking to monitor numerous public facing web sites that would be bound to a single load balanced IP address. Any help would be most appreciated! -- 5 Ways to Improve Secure Unified Communications Unified Communications promises greater efficiencies for business. UC can improve internal communications as well as offer faster, more efficient ways to interact with customers and streamline customer service. Learn more! http://www.accelacomm.com/jaw/sfnl/114/51426253/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- 5 Ways to Improve Secure Unified Communications Unified Communications promises greater efficiencies for business. UC can improve internal communications as well as offer faster, more efficient ways to interact with customers and streamline customer service. Learn more! http://www.accelacomm.com/jaw/sfnl/114/51426253/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Better way to check Cisco switches?
Nagios is a phenomenal tool but isn't necessarily a silver bullet. Nagios does try a certain number of times before a soft failure becomes a hard failure. This can be adjusted. Also, many people have replaced a very tight check_ping with check_icmp and looser return times. But in general, you might wish to monitor a manageable switch with something else. Indeed snmp is an excellent way to go. You'll then need to figure out some way to manage the snmp messages though. One way might be to search for interface down messages where it doesn't come back within a period of time. This can be done with logsurfer, splunk and probably a number of other tools On Tue, Jul 5, 2011 at 9:39 AM, Bailey, Damian S. baile...@lcps.k12.va.us wrote: Good morning! After posting a bit ago, I found that other Nagios users experienced the same issue I did in relying on the check_ping routine when monitoring Cisco switches – they would occasionally return false critical; however, the switches were actually still online and would immediately recover. So this leads me to the obvious question – how do you all use Nagios to effectively monitor Cisco switches? Is there a better way to check these with snmp? Something else? Thanks! Damian Bailey Lead Technician | LCPS Technology From: Bailey, Damian S. Sent: Monday, June 27, 2011 11:55 AM To: 'nagios-users@lists.sourceforge.net' Subject: new to Nagios - known issue w/ Cisco switch host checks? We’re new to using Nagios but I’ve grown to love it! In using the product to perform host checks on our Cisco switches, I find that they “randomly” will fail, then recover almost immediately. I think the host checks use ping to verify that the switches are active. Is there something I am likely doing wrong, or should I look at our network as a possible cause? Our network itself isn’t perfect…but I don’t want to go looking for problems if it’s a nagios issue. Thanks for any help. I’ll be glad to provide more info if needed. Damian Bailey Lead Technician | LCPS Technology -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Reloading configuration on-the-fly
To get all checks to run in a shorter amount of time, take a look at reducing max_service_check_spread and max_host_check_spread. You might also be interested in the fast startup options in the documentation. This will also point you to such parameters as use_large_installation_tweaks On Fri, Jul 1, 2011 at 11:22 AM, nag ios nagiost...@gmail.com wrote: can u let me know the on-fly process in detail. On Fri, Jul 1, 2011 at 3:58 PM, Andreas Ericsson a...@op5.se wrote: On 07/01/2011 11:48 AM, Marc-André Doll wrote: Hi list, I'm starting to have some big perimeter (well, probably not as big as some of yours) on a Nagios and it's starting to be quite delicate to restart/reload it as it takes between 6 and 10 minutes to start scheduling checks again. Is there a way/module/tool to reload the configuration without those dead windows in my monitoring? First of all, make sure you're running a recent enough version of Nagios to have Jean Gabès' patch for speeding up circular host/parent paths. Secondly, precache the configuration before reloading and use the precached version of the object config when doing the actual restart. This will bring your downtime down from 6-10 minutes to perhaps 1-5 seconds. You'll still lose the full 6-10 minutes for recently added objects, but the previously existing ones will keep being monitored. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Question about check_disk -L option
Yueh-Hung Liu wrote: it's not an error, you specify to check /usr/local only and it's a remote fs, so no local fs will be checked and the -L option just test the access to nfs. I should have included that the check_disk plugin using the -L option returns the value 3, which indicates unknown status. I would expect check_disk -L to return a 0 status if the remote file system is accessible. -- David Wallis Principal Computer Engineer Information Technology Group Advanced Photon Source Argonne National Laboratory -- Achieve unprecedented app performance and reliability What every C/C++ and Fortran developer should know. Learn how Intel has extended the reach of its next-generation tools to help boost performance applications - inlcuding clusters. http://p.sf.net/sfu/intel-dev2devmay ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] acknowledge triggers a script
We have an interesting need. When a particular service goes red on our Nagios 3.2.1 server, we'd like to be able to click on Acknowledge this service problem and have that activate a local script. Anyone have any idea how this can be accomplished? TIA -- Achieve unprecedented app performance and reliability What every C/C++ and Fortran developer should know. Learn how Intel has extended the reach of its next-generation tools to help boost performance applications - inlcuding clusters. http://p.sf.net/sfu/intel-dev2devmay ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Question about check_disk -L option
On Red Hat 6 clients, I need to monitor some NFS mount points for conditions like stale NFS file handles. The check_disk plugin describes the -L (--stat-remote-fs) option as providing this functionality. However, either I'm not understanding how the option works, or it's not working properly. Here is an example of what I'm seeing: # df -h /usr/local Filesystem Size Used Avail Use% Mounted on oxygennfs48:/usr/local 250G 214G 37G86% /usr/local # # ./check_disk -w 20% -c 10% -p /usr/local DISK WARNING - free space: /usr/local 37846 MB (14% inode=98%);| /usr/local=218154MB;204800;230400;0;256000 # # ./check_disk -w 20% -c 10% -p /usr/local -L DISK UNKNOWN - free space:| # Does anyone have any experience using this option, or have any guesses why it's generating an error? -- David Wallis Principal Computer Engineer Information Technology Group Advanced Photon Source Argonne National Laboratory -- WhatsUp Gold - Download Free Network Management Software The most intuitive, comprehensive, and cost-effective network management toolset available today. Delivers lowest initial acquisition cost and overall TCO of any competing solution. http://p.sf.net/sfu/whatsupgold-sd___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring who is connected to Samba share
I don't know of any plug-in but you can roll your own. If you're already running samba on a linux box, you can mess with the output of net status shares assuming users pick up (mount) their homes when they login to Windows. I believe this also shows login times For a more generic solution (if you don't want to run samba and join the domain), I've had some luck with the sysinternals tools, psloggedon. I set up a scheduled job on a windows host that searches all windows hosts for logins and dumps the output to a share I can see on the linux side. A linux host then parses the crap out of this and makes a nice web page. Ugly, but it works. I bet a good vbs programmer could do it all on the windows side and this could be tied in to a linux nagios server via NRPE, NSClient++ NSCA or whatever. On Wed, Mar 2, 2011 at 9:04 AM, Kaplan, Andrew H. ahkap...@partners.org wrote: Hi there -- Is there a plug-in that will provide a list of users and/or computers that are currently connected to a samba share? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring unmounted partition
Write a plugin. It could search the output of the command, mount On Tue, Feb 8, 2011 at 6:49 AM, Javier Rivera jriv...@skisa.com wrote: Good Morning, Im having a problem with check_nrpe. Im monitoring a partition /mnt/2 f.e. If i dont have this partition mounted, it just returns the value of / witout sending any error. How can i get an alert when the partition isn`t mounted. Thanks in advance. Sorry for my english. I write it also in spanish, if u use a translator maybe u understand me better. Estoy teniendo un problema con check_nrpe. Estoy monitorizando la partición /mnt/2 por ejemplo. Si no tengo esta partición montada, me devuelve la información de / y no da ningún error. ¿Como puedo hacer para que salte alerta cuando no esta montada?. Gracias -- The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE: Pinpoint memory and threading errors before they happen. Find and fix more than 250 security defects in the development cycle. Locate bottlenecks in serial and parallel code that limit performance. http://p.sf.net/sfu/intel-dev2devfeb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE: Pinpoint memory and threading errors before they happen. Find and fix more than 250 security defects in the development cycle. Locate bottlenecks in serial and parallel code that limit performance. http://p.sf.net/sfu/intel-dev2devfeb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] What happened during night for services that failed and timeperiod was off
I'd try to write a wrapper for an event handler that checked the time and kept a log. A morning cron could then check for presence of the log and if it exists, email to you On Mon, Jan 31, 2011 at 9:46 AM, Tntteam tntt...@free.fr wrote: Hello there, I'm wondering if there is a way to make nagios send notification for services which failed during non notification timeperiod ? I mean if I define a service with a timeperiod for ntifications from 7am to 7pm, and the service fails at 4am, there will be no notification sent unless I set up a resend notification interval. There are services that doesn't need a resend plus I want all missed notification to be sent at 7am. Like a night notification digest, or what happened from 7pm to 7 am to be sent at 7am, when the service enters in the notification_send period. Any ideas ? Thanks -- Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)! Finally, a world-class log management solution at an even better price-free! Download using promo code Free_Logger_4_Dev2Dev. Offer expires February 28th, so secure your free ArcSight Logger TODAY! http://p.sf.net/sfu/arcsight-sfd2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)! Finally, a world-class log management solution at an even better price-free! Download using promo code Free_Logger_4_Dev2Dev. Offer expires February 28th, so secure your free ArcSight Logger TODAY! http://p.sf.net/sfu/arcsight-sfd2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Checking multiple TCP ports for a single status?
Wrapper? $NAGIOS/libexec/check_tcp -H myhost -p 135 $NAGIOS/libexec/check_tcp -H myhost -p 445 On Wed, Dec 29, 2010 at 1:39 PM, stan st...@panix.com wrote: I think I need to verify that both port 135, and 445 are avaialble on some Windows amchines. As I understand it, both of these need to be up. I'd like to make this a single check. Looks like check_tcp will only accept a single -p argument. Is this correct? If so, is there a way I can AND to different check_tcp runs, and report a single status back to Nagios? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios configuraion best practice
I think this is more a personal (or site) preference. But using host groups can be very useful to eg insure that all windows machines get thus-and-such service checks all RedHat machines get thus-and-such service checks etc In other words, a wildcard or hostgroup tag for a service check can substantially reduce the size of your config, make it more readable and insure uniformity. From a grouping perspective, ie if you tend to display by hostgroups, it's often helpful to list all hosts once and only once. However, in other environments, some find it very useful to list some (or all ) machines in multiple groups. This would allow you to easily view all RHEL9 hosts, SLES10 hosts that have not yet been updated, hosts in room 123, etc. The downside of this is when hosts go red, you'll see a lot more red... On Wed, Dec 8, 2010 at 7:52 AM, Kenneth Holter kenneho@gmail.com wrote: Hi all, I'm picking up this old tread, as I'm about to start restructuring parts of my Nagios configuration. In my current configuration I've created host groups to which I've linked both hosts and services. Example: I have a host group called linux-servers, to which all linux servers are linked. A number of service checks common for all linux servers are also linked to this host group. When adding new linux boxes I simply link them to the linux-servers host group. In a similar fashion I have a host-group called application-servers to which all application servers and related service checks are linked. When adding a new application server, I make sure it links to both linux-server and applications-servers. This setup works pretty well, but it really clutters up the host groups web page. Is this the way others have structured their nagios configuration? Regards, Kenneth On Wed, Sep 17, 2008 at 2:55 PM, Kenneth Holter kenneho@gmail.com wrote: I should have been more precise regarding the cluttering of the display - it was the hostgroup display I was referring to. :/ Would it be an idea to as much as possible use this configuration method: hosts --- hostgroups --- services? To elaborate some: Hosts are always connected to host groups, and the same for services. This way one will avoid linking services to specific hosts, making the service definitions more clean. Also, I'm thinking about having a host group called for example linux-servers, to which all linux servers are linked. A number of service checks common for all linux servers are also linked to this host group. When adding new linux boxes I simply link them to the linux-servers host group, and add extra services checks where needed. In a similar fashin I would create a host-group called for example dell-servers to which all Dell hosts and related service checks are linked. New Dell servers are then linked to this host group. I'm thinking this may be a good idea, but would like to hear how others have structured their configuration and if there may be some hidden pitfalls in my proposed configuration method. On 9/16/08, dave stern - e-mail.pluribus.unum dit.d...@gmail.com wrote: Not quite sure what you're trying to do. If you monitor a service, it's going to be on the nagios service details pagre regardless, That's the whole point. There are some tricks to slim down some other displays eg I define a whole bunch of services under a host without defining the hosts these services run on explicitly. I just feed the IP address as part of the check_command. This reduced the hostgroup grid page. I don't think that will work for you. If you're asking how to most efficiently code this in your configs, the timesaving tips page mentions ganging together services. ie you don't need a service stanza for each host you want to run this on. Rather, use a wildcard or group. define service{ use generic host_namehosta, hostb hostgroup special-ones,!webguy . } Run the service on hosts hosta, hostb, all hosts in the hostgroup, special-ones but not webguy One other thing you should consider. Where will the plugins live? The easy answer is to put them on an NFSmounted partition that all hosts can access and that you can easily update. This is a bad practise. What happens when NFS breaks? You should really copy the plugins to each host st they run locally. On Tue, Sep 16, 2008 at 8:05 AM, Kenneth Holter kenneho@gmail.com wrote: Thanks for the reply. I'll read the documents you listed. Meanwhile, I have a design question: What's the best way to set up a service check to be executed on a selected few (or maybe all for that matter) nodes? Say I need to monitor a particular process (lets call this service A) on a number of systems, how would I implement this? I guess one way would be to link service A to a hostgroup, say hostgroup A, and add the selected systems to that hostgroup. This, however, will somewhat
Re: [Nagios-users] is there a way to pass contact username thru url to status.cgi for authentication?
Yes http://nagiosuser:nagiospassw...@nagioshost.domain /nagios/cgi-bin/status.cgi\?host=server... Note the backslash before the question mark. BTW, I think IE blocks this for what MicroSloth calls security reasons On Thu, Oct 7, 2010 at 1:48 AM, Steve Kemp st...@dedicatedserversaustralia.com.au wrote: Im looking for a way to allow remote access to Nagios based on specific clients (contacts) in Nagios without using http authentication The remote access is being displayed in another site using object tags I don’t however want them to have access to any data not relating to their specific servers. So for example: http://www.domain.com/nagios/cgi-bin/status.cgi?host=server1.clientsdomain.com.austyle=detailis what I would like to load but the cgis can only show the data relating to the contacts servers. In Firefox and Chrome I can pass authentication thru the browser via http://user:pass@ but this isn’t allowed in IE, and isn’t ideal or secure anyway. Is there a way to pass the contact variable thru to the browser ie http://www.domain.com/nagios/cgi-bin/status.cgi?host=server1.clientsdomain.com.austyle=detailcontact=xyzso that while the user isn’t given a popup login box they are still restricted to only seeing data relating to their server or servers I have hunted high and low for a way around this and have had no luck so far…. Regards *Steve Kemp* *Dedicated Servers Australia* [image: Description: ded-aus-logo] *www.dedicatedserversaustralia.com.au* *Tel:* *+61 7 30187567* *Fax:* *+61 7 38476684* *Support: **https://accounts.dedicatedserversaustralia.com.au* *Sales Email:* *dedaus.sa...@dedicatedserversaustralia.com.au* -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today. http://p.sf.net/sfu/beautyoftheweb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null image001.gif-- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today. http://p.sf.net/sfu/beautyoftheweb___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Mail sending issue.
Sounds like a postfix error. You DID configure postfix mail, right? Take a look at http://www.postfix.org/BASIC_CONFIGURATION_README.html This isn't a nagios question On Mon, Aug 16, 2010 at 3:10 PM, i...@toonz it.to...@gmail.com wrote: Hi all, We are testing with Fully Automated Nagios. Have installed and started monitoring 2 Windows 2003 servers. But we can't get notification to work. When we certify Nagios installation we get the following error send-mail: fatal: config variable inet_interfaces: host not found: local host What does notify-host-by-mail mean? Is the notification send to host only. How can we make FAN send mails? We have an internal mail server, and are trying to send notifications to an ID there. Connectivity is fine, have entered the id in contacts.cfg file. We have installed NSClient++ in these servers, but the host server, shows connection refused, there is no firewall! Please help!! Ananth. -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Executing a script based on text from a web page
We've had a lot of luck with check_http with the params -u and -s -u points to the particular path on the web server and -s is the return string it expects. And then there's the hacky way of doing it; wget the page and search for presence/absence of a particular string On Fri, Aug 6, 2010 at 10:16 AM, Joseph Devaney jdeva...@blackducksoftware.com wrote: I know that nagios can monitor pages for certain text on that page to verify that its there. In our organization we have several monitors like this currently setup and they work well for us. I want to take this operability and apply it to a webpage that only appears when there is a server error and have it kick off a script to restart a service. This is for a website within the company. When there are too many mysql connections, the site itself will not be displayed but will display a friendly error for us to perform a flush of the hosts. This error doesnt occur often and when it does occur, it doesn't always happen when someone is awake to address it. Basically what I want to happen is for nagios to monitor a website and if/when this error appears (I will set the check_command to look for the right text) nagios will execute a script to restart that service. 99% of the time, the site will be up and functioning as it should be, so I am not sure how to write this one up without giving a false positive on our monitors. Any thoughts on how to implement this? Joseph Devaney | Jr. Linux Systems Administrator | Black Duck Software | v: 781-810-1334 | m: 781-974-6729 | jdeva...@blackducksoftware.com | 265 Winter St, Waltham, MA 02451 -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] checks per host
As pointed out, normal depends on your environment. Some sites only check a handful of things per host; some check more. The bigger issue is if your server can handle the number of checks and in a timely fashion. Look at nagios parameters Max_service_check_spread, max_host_check_spread as well as the various params for individual services such as check_interval If you have a check that takes a long time to complete, you'll probably want to run it less frequently. If the client were Unix, not windows, you'd have the luxury of being able to run the check on the host in question. This can be done either via cron and have it generate a status file that nagios then checks or using NSCA. On Mon, Jul 12, 2010 at 1:39 PM, Joel Brooks jbro...@oddelement.com wrote: hey gang, I'm trying to get a sense of what's normal for the number of checks per host. I'm pushing nagios to a number of servers and the list of things I want to monitor keeps growing. For some servers, I've got 30 checks - some 50. what is normal out there? is there a practical limit? i'm using nagios 3 on centos 5 to monitor windows with nsclient++ 3.8. cheers, J -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Are NSClient++ and NRPE mutually exclusive?
I've got NSClient++ installed on a number of Windows XP and W2K servers reporting on such things as disk utilization, memory, load. This all works fine. I'd like to take it to the next level defining my own external scripts. But the examples all show using the command, check_nrpe on the linux/nagios side. Can't they be run under check_nt (something like $USER1$/check_nt -p 12489 -H $HOSTADDRESS$ -s mysecret -v check_something) Or must I also install check_nrpe on the linux side? And if so, do I need to install the windows nrpe as well? On the Windows side, the config file is basically the default with the following additions to the [External Scripts] section: check_something=C:\myscripts\check_something.vbs check_test=C:\myscripts\psloggedon.exe TIA -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] monitoring radius on windows
We're interested in monitoring radius with nagios 3.2.1. We have IAS installed on two windows boxes providing separate radius aaa authentication. I tried the check_radius.pl plugin as well as another compilable one but they don't appear to work despite providing them with user, password and secret. I suspect the way windows handles radius thru IAS may be different. In the case of one of the radius boxes, I just installed NSClient++ and monitored for the service. But the otehr box is a windows 2008 server and doesn't use service for IAS but roles. Anyone have any idea how to monitor? TIA -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check load oddity
This is making no sense to me at all. It is obvious to me that my load is less than the critical threshold, why is the plugin reporting a critical state? [r...@monitor1 plugins]# ./check_load 2.0 1.8 1.5 3.0 2.8 2.5 CRITICAL - load average: 1.96, 1.01, 0.75|load1=1.960;0.000;0.000;0; load5=1.010;0.000;0.000;0; load15=0.750;0.000;0.000;0; Usage:check_load [-r] -w WLOAD1,WLOAD5,WLOAD15 -c CLOAD1,CLOAD5,CLOAD15 Options: -h, --help Print detailed help screen -V, --version Print version information -w, --warning=WLOAD1,WLOAD5,WLOAD15 Exit with WARNING status if load average exceeds WLOADn -c, --critical=CLOAD1,CLOAD5,CLOAD15 Exit with CRITICAL status if load average exceed CLOADn the load average format is the same used by uptime and w -r, --percpu Divide the load averages by the number of CPUs (when possible) Completely confused with this one. DAve -- Posterity, you will know how much it cost the present generation to preserve your freedom. I hope you will make good use of it. If you do not, I shall repent in heaven that ever I took half the pains to preserve it. John Adams http://appleseedinfo.org -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check load oddity
Morris, Patrick wrote: DAve wrote: This is making no sense to me at all. It is obvious to me that my load is less than the critical threshold, why is the plugin reporting a critical state? [r...@monitor1 plugins]# ./check_load 2.0 1.8 1.5 3.0 2.8 2.5 CRITICAL - load average: 1.96, 1.01, 0.75|load1=1.960;0.000;0.000;0; load5=1.010;0.000;0.000;0; load15=0.750;0.000;0.000;0; Re-read the output of the syntax help the plugin is giving you. You are not passing the plugin valid paramaters. The preloaded command in NagiosQL is wrong and I never even checked it. Coffee needed... stares at ceiling sigh... [r...@monitor1 plugins]# ./check_load -w 2.0,1.8,1.5 -c 3.0,2.8,2.5 OK - load average: 1.76, 0.86, 0.72|load1=1.760;2.000;3.000;0; load5=0.860;1.800;2.800;0; load15=0.720;1.500;2.500;0; DAve -- Posterity, you will know how much it cost the present generation to preserve your freedom. I hope you will make good use of it. If you do not, I shall repent in heaven that ever I took half the pains to preserve it. John Adams http://appleseedinfo.org -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagios output as clickable link
This may be stretching nagios beyond its designed intention but... I'm writing a plugin that could produce many many lines of output. Since this obviously wont fit on a status page, what I'd like to do is if the output is not exit-status=0, produce a webpage and in the status information field of the service, have a clickable link to transport you to that page. Has anyone successfully tried this before? TIA -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] tailoring a notification
I'd like to specify an additional string within the notification message for just a few hosts that will be sent to the contactgroup for those hosts and only those hosts. The string will be static. I presume this can be done with macros somehow? TIA -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] read-only nagios
We're looking for a mechanism to allow others access to our nagios server for viewing only. The kicker is they want to be able to reach it without any htpasswd so multiple htpasswords and authorization_for_system_commands is not an option. I'm thinking the only practical method to accomplish this is to somehow mirror the install elsewhere. One idea was a cronjob that ran a wget hitting the status cgi and dumping it somewhere on the same machine. This still tends to have authentication queries embedded in it. Another option might be another install on the same server, different port, no authentication and perhaps rsync'ing data from the master. Has anyone set up anything like this before? TIA -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Summary: read-only nagios
It seems that all options will run up against the fact that we have enabled web authentication and are not willing to turn it off. I've come up with a couple of ideas however. 1/ create a htpasswd account called guest with no password. At least that limits the necessary typing/authentication 2/ You can encode a username/password combination into a URL. This can also work with a redirect. So if I have an htpasswd account named open, password open, I can point people to a URL containing this: META HTTP-EQUIV=refresh CONTENT=1;URL=http://open:o...@mynagios.com/nagios; This works perfectlyfor most browsers. IE, in its infinite wisdom wont allow this for security concerns. Thanks to everyone for their help -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] monitor nfs share on windows
I'm trying to check for the size of an nfs share that is accessible from a windows host. The windows host is running NSClient++ but there are a number of problems here. First, the share isn't persistently mapped so at a minimum, I'd have to write a .bat warpper that first mapped it to a letter then ran the disk check. Second, to test this, I mapped the drive to M: then ran on my nagios server check_nt -H ... -p 12489 -s ... -v USEDDISKSPACE -l M -w 80 -c 90 but I got a segfault So I tried another tactic. Install, turned on snmp on the windows host. Mapped the drive to H and tried check_win_snmp_disk.pl $HOSTADDRESS$ $community 6 80 90 I did get information back. The % usage appears correct but the sizes are way off (Gig vs Terabyte). Can I trust the percentages from the above? Is the above script limited in size to Gigabytes? Is there a better/easier way to pull down this info? I tried an snmpwalk but couldn't find anything useful. Otherwise, I'd just call it directly with an snmpget TIA -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] can nagios take some pro-active actions?
Ok, everyone agrees event handler can take action to fix a problem but bear in mind that this comes with caveats. Affectively, nagios event handler is treating a symptom; the disease goes merely on its way. If a service stops, WHY did it stop in the first place? Most good sysadmins would tackle the problem from the system end to insure that the service would never fail again. Furthermore, let's say a service failed for a reason, eg out of disk space. What good what it do to restart the service again? And if you build smarts into the event handler to look for and fix such a condition, is that the ONLY condition that could occur to stop this service? Having said all this, event handlers do have their place. We in fact use them to shut down hosts if the temperature gets too hot. You can imagine the testing we went through before rolling out something like this. On Thu, Sep 3, 2009 at 7:44 AM, Leonardo Carneirolscarne...@veltrac.com.br wrote: hello everyone. Started to play with Nagios a few days ago and i'm very excited with it. I have a very small setup (2 linux server being monitored via npre by a third linux server) and i'd wrote some bash scripts to monitor some of the services that we run on those services (proprietary services, non-standard ones like ssh, apache and that stuff). I know Nagios can send sms, email and other things to warn administrators about problems, but can Nagios take any action to fix the problem, like restart the service if reach critical state, or restart the service if the service stays critical for more than 5 minutes? If yes, can someone just point me to the direction i should go? :) Tks in advance, and sorry about my poor english. I'm from Brazil. -- *Leonardo de Souza Carneiro* *Veltrac - Tecnologia em Logística.* lscarne...@veltrac.com.br mailto:lscarne...@veltrac.com.br http://www.veltrac.com.br http://www.veltrac.com.br/ /Fone Com.: (43)2105-5601/ /Av. Higienópolis 1601 Ed. Eurocenter Sl. 803/ /Londrina- PR/ /Cep: 86015-010/ -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] hostgroups causing hairloss
I am at a point that I am looking for gremlins, no other cause can be found. I am replacing a very old Nagios install with Nagios 3.0.6, no importing, a fresh install, new hardware, a network audit and entering all hosts and services manually. It seems that no matter what I do I keep getting some hosts added to a host group, even though I clearly did not define them as part of the host group. My hostgroups.cfg, ### define hostgroup { hostgroup_name AUTH_DNS alias Authoritative DNS members CDC-Ns1auth,CDC-NSMS,IDC-Ns2auth,SDC-Ns3auth } define hostgroup { hostgroup_name CACHE_DNS alias Caching DNS members CDC-Dnscache,CDC-Ns1,IDC-Ns2,SDC-Ns3 } My hosts/CDC-Ns1auth.cfg, ### define host { host_name CDC-Ns1auth alias ns1auth.tls.net display_nameTLS DNS server address 65.124.104.30 parents null hostgroups null check_command check-host-alive use generic-host initial_state o max_check_attempts 3 check_interval 15 retry_interval 1 check_period24x7 process_perf_data 1 retain_status_information 1 contact_groups host-oncall notification_interval 15 notification_period 24x7 notification_optionsd,u,r register1 } Everything looks fine, but when I start Nagios and look at View Config-Object Type-Host Groups, I see this! Group Name Description Host Members CACHE_DNS Caching DNS service CDC-Dnscache , CDC-Ns1 , CDC-Ns1auth , IDC-Ns2 , IDC-Ns2auth , SDC-Ns3 , SDC-Ns3auth The CACHE_DNS host group should only contain CDC-Ns1, IDC-Ns2, and SDC-Ns3. Why are the other hosts in there? I have tried renaming, I have tried starting fresh with new config files, new retention data, new object cache. Yet the hostgroup CACHE_DNS still contains hosts I did not define to be there. Any help is appreciated, clues, clue bats, links, (yes I looked at the change log and searched for an answer, no luck). Thanks, DAve -- Posterity, you will know how much it cost the present generation to preserve your freedom. I hope you will make good use of it. If you do not, I shall repent in heaven that ever I took half the pains to preserve it. John Quincy Adams http://appleseedinfo.org -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroups causing hairloss
David Rosenstrauch wrote: DAve wrote: Everything looks fine, but when I start Nagios and look at View Config-Object Type-Host Groups, I see this! Group Name Description Host Members CACHE_DNSCaching DNS service CDC-Dnscache , CDC-Ns1 , CDC-Ns1auth , IDC-Ns2 , IDC-Ns2auth , SDC-Ns3 , SDC-Ns3auth The CACHE_DNS host group should only contain CDC-Ns1, IDC-Ns2, and SDC-Ns3. Probably not a coincidence that CDC-Ns1auth starts with CDC-Ns1. Sounds like it's using CDC-Ns1 as a prefix to match hostnames, rather than as an actual hostname. Looks like there's config setting in 3.0 called use_regexp_matching which might control that. Perhaps in your old config file that was somehow turned on. (Or perhaps not defined at all, and on is the default.) Who knows what other old settings you're inheriting too. It's probably best for you to scrap your old config, and start fresh with a new 3.0 config file, then adopt it to your needs (i.e., point it to whatever custom definitions/definition-files you've created). As I did, this is an all new install, everything is being re-done after a complete host and service audit of our network. There are no old configs in use. Matching, h, that sticks in my mind somewhere. Time to go read the docs yet again for what I may have missed. DAve -- Posterity, you will know how much it cost the present generation to preserve your freedom. I hope you will make good use of it. If you do not, I shall repent in heaven that ever I took half the pains to preserve it. John Quincy Adams http://appleseedinfo.org -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroups causing hairloss
Martin Melin wrote: Something that immediately comes to mind: What are your regex_matching settings? use_regex_matching and use_true_regex_matching Ahhh! You are correct sir, thank you. I totally and completely missed that new feature. Thank you. DAve Regards Martin On Wed, Jul 8, 2009 at 6:15 PM, DAve dave.l...@pixelhammer.com mailto:dave.l...@pixelhammer.com wrote: I am at a point that I am looking for gremlins, no other cause can be found. I am replacing a very old Nagios install with Nagios 3.0.6, no importing, a fresh install, new hardware, a network audit and entering all hosts and services manually. It seems that no matter what I do I keep getting some hosts added to a host group, even though I clearly did not define them as part of the host group. My hostgroups.cfg, ### define hostgroup { hostgroup_name AUTH_DNS alias Authoritative DNS members CDC-Ns1auth,CDC-NSMS,IDC-Ns2auth,SDC-Ns3auth } define hostgroup { hostgroup_name CACHE_DNS alias Caching DNS members CDC-Dnscache,CDC-Ns1,IDC-Ns2,SDC-Ns3 } My hosts/CDC-Ns1auth.cfg, ### define host { host_name CDC-Ns1auth alias ns1auth.tls.net http://ns1auth.tls.net display_nameTLS DNS server address 65.124.104.30 parents null hostgroups null check_command check-host-alive use generic-host initial_state o max_check_attempts 3 check_interval 15 retry_interval 1 check_period24x7 process_perf_data 1 retain_status_information 1 contact_groups host-oncall notification_interval 15 notification_period 24x7 notification_optionsd,u,r register1 } Everything looks fine, but when I start Nagios and look at View Config-Object Type-Host Groups, I see this! Group Name Description Host Members CACHE_DNS Caching DNS service CDC-Dnscache , CDC-Ns1 , CDC-Ns1auth , IDC-Ns2 , IDC-Ns2auth , SDC-Ns3 , SDC-Ns3auth The CACHE_DNS host group should only contain CDC-Ns1, IDC-Ns2, and SDC-Ns3. Why are the other hosts in there? I have tried renaming, I have tried starting fresh with new config files, new retention data, new object cache. Yet the hostgroup CACHE_DNS still contains hosts I did not define to be there. Any help is appreciated, clues, clue bats, links, (yes I looked at the change log and searched for an answer, no luck). Thanks, DAve -- Posterity, you will know how much it cost the present generation to preserve your freedom. I hope you will make good use of it. If you do not, I shall repent in heaven that ever I took half the pains to preserve it. John Quincy Adams http://appleseedinfo.org -- Posterity, you will know how much it cost the present generation to preserve your freedom. I hope you will make good use of it. If you do not, I shall repent in heaven that ever I took half the pains to preserve it. John Quincy Adams http://appleseedinfo.org -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Simplified Nagios user interface for end users/help desk
Adam Forsyth wrote: I'm monitoring lots of different services on various hosts with Nagios. Many of these things are very useful for me to monitor as the sys admin, but when alerts occur occur with them, they do not represent downtime for our users, they represent problems I should fix proactively before they result in user noticeable downtime. For example, if Nagios notices a fan failure on one of my Procurve switches, I as the admin want to know about the problem, and probably want to replace the failing fan during the next scheduled maintenance time. The switch is still running just fine, however and there is no effect on service to users. Currently when I get such a notification, I'd acknowledge the problem, and it would stay in critical state until I've fixed the problem. What I'd like to create is a more end user targeted display of Nagios data. It would display OK or Alert status based only on whether the particular service is up or down from the user perspective, and wouldn't show any of the proactive nice for the sys admin to know about details. So in the case of the procurve switch, as long as the fan failure hasn't made the entire switch crash (we can still ping it) it would remain in an OK state. The only way I can think of to accomplish this would be to make a second installation of nagios. It would be a lot of duplicate configuration, but many of the services would be left out. I think that would create this second end user display as I'm imagining it, but it would come at the expense of having to maintain 2 sets of configuration files, and the server would have to do duplicate checking of lots of the services and hosts. Can anyone think of a better way to accomplish this that wouldn't need to involve duplication of checks? We have installed two of these for clients with Nagios in their networks. The best way we found, YMMV, was to use NagioSL. We wrote a flash app to poll NagioSL for devices to display on a pretty non-threatening map. We then wrote a new alert script that updates the flash map with any alerts/recoveries. It's rough around the edges but it works. DAve -- Posterity, you will know how much it cost the present generation to preserve your freedom. I hope you will make good use of it. If you do not, I shall repent in heaven that ever I took half the pains to preserve it. John Quincy Adams http://appleseedinfo.org -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] disk IO for windows?
Anyone know of a plug-in or mechanism to log local disk I/O on windows? My nagios server is currently using check_nt to connect to windows hosts via nsclient++. I was hoping perhaps COUNTER has something buried within it to pull down this info. TIA -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] number of acknowledgments
I'm looking for a way to see if there are any unacknowledged host/service problems within nagios. I've written up my own gosh-awful ugly code that parses nagios logs to search for hosts that went down, haven't come back up, are still in the nagios config and have not been acknowledged. Although the coding appears to work, I'm wondering if there's an easier/cleaner way to do it ie some plugin that just searchs for any ACKs more directly TIA -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] configure Nagios to monitor the temperature of the server room
We've had great success with devices from avtech eg http://avtech.com/LP/LP_RMA_1.htm Added benefit; they also sell industrial grade light bars (sorta like traffic lights) that can be triggered by nagios to show overall status http://avtech.com/Products/Environment_Monitors/Room_Alert_Signal_Tower_A.htm On Thu, Apr 2, 2009 at 7:56 AM, Jim Avery j...@jimavery.me.uk wrote: 2009/4/1 ben amar benamar.moha...@free.fr: How to configure Nagios to monitor the temperature of the server room? What plugin? I can't say I've tried it myself, but this one is advertised on www.nagios.org :- http://www.nagios.org/products/environmental/esensors/em01b.php hth, Jim -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check windows file?
I'm looking for a plugin that I can run from the unix side to connect to a windows host and at least test for the existance of a file, although checking for its contents would be even more ideal. I couldn't find anything at nagiosexchange unless my search string wasn't specific enough. Anyone? Thanks - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] hierarchical nagios
We have a primary nagios install that monitors hosts/services spread across multiple labs (political domains). In one of the labs, I've build a nagios install monitoring all of its local hosts and services. It would be interesting to tie the two together. Ultimatey what I'd like to see is a 30,000 foot view monitoring just the distributed servers and some kind of link to drill down into the individual labs. The top view would not change color if a host/service in a lab went down unless it were defined as a server. Is this possible? Has anyone done anything like this? TIA - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_ping can't interpret output
Hugo van der Kooij wrote: Hmm. Please read the various comments on the lifespan of fedora releases on this and other mailinglists. Is this what you want in a corporate environment? Totally irrelevant. I am a bit puzzled. Plugins are run as nagios user and not as apache user so why test as apache user. Unless your nagios setup deviates quite a bit in this regard bt I find that rather unlikely at this momemt. Yes, my error. However, running as user nagios behaved exactly the same as user apache. Marc Powell wrote: Beyond Hugo's comments about running as the user nagios, this would seem to indicate that /bin/ping is not SUID root. Is it? If not, chmod u+s /bin/ping. Bingo! That was the problem. Thanks. Dave Close - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_ping can't interpret output
I have the same version of Nagios installed on a dozen systems (not machines, full systems with several hundred machines each). Except for the specific list of machines, the configuration is identical between them. All are installed by yum from the Fedora 7 RPMs, version 2.10-3.fc7. All the plugins were installed the same way and are version 1.4.11-2.fc7. On one system only, check_host_alive (which calls check_ping) reports that all machines are down, while at the same time Nagios reports that most services on those machines are up. Running check_ping as root from the command line gets a successful response. Digging further, I have enabled user apache, which is the user httpd runs as, then tried the command as user apache. Here is the result. # /usr/lib/nagios/plugins/check_ping -H machine -w 3000.0,80% -c 5000.0,100% -p 1 PING OK - Packet loss = 0%, RTA = 0.16 ms # su - apache -c /usr/lib/nagios/plugins/check_ping -H machine -w 3000.0,80% -c 5000.0,100% -p 1 /bin/ping -n -U -w 10 -c 1 machine CRITICAL - Could not interpret output from ping command Some online references appear to suggest this could be caused by a LANG environment variable. I temporarily replaced the check_ping command with a wrapper script that saved the environment to a file before calling the actual command. There was no difference in LANG and the only other differences had to do with the name of the last machine tested. All systems report LANG=en_US.UTF-8. I have compared every configuration file for Nagios and Apache between systems and not found any unexpected discrepancy. I have verified the installation of all relevant packages (rpm -V). I can't find anything which explains the different behavior on the one system. Other suggestions are welcome. -- Dave Close - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagios plugins on 32 bit
I'm trying to build nagios plugins (1.4.12) on several 32 bit hosts (suse 10.1). The configs complete but throw a lot of warnings about 32 vs 64 bit. Running the plug-ins generate segmentation faults. I tried setting LD_LIBRARY_PATH but that didn't make a difference. Google shows me one potential solution about making some (fake?) link but this sounds like an ugly hack to me. Is there a configure switch for 32 bit? TIA - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Thoughts about a custom plugin
This can be done without NRPE. We have a cluster on a private net. The head node is visible from nagios so I wrote a plugin that lives on the head node (The dual homed machine, in your case). On the nagios side, a plugin ssh'es to the head node, runs the script which talks to hosts on the private net and returns the output/status. On Sun, Jul 20, 2008 at 11:17 AM, stan [EMAIL PROTECTED] wrote: On Sun, Jul 20, 2008 at 10:46:39AM -0400, Paulus, Jake wrote: You should really look into NRPE (Nagios Remote Plugin Execution) http://nagios.sourceforge.net/docs/3_0/addons.html#nrpe There are alternate versions of NRPE that run on Windows, etc. if you must do that. Check out NagiosExchange.org for those. You should find that you can use normal plugins you already use elsewhere with NRPE to do what you need. I'm not certain this is intended for exactly the application I have. perhaps it is, and I am misunderstanding it. let me clarify my application a bit with an ASCI diagram: --- | | |||| | NAGIOS |--| DUAL || TARGET | | Host | | HOMED |---+|| | | | Machine| ||| | | || ||| - |-- | |-- ||| || TARGET | +|| ||| ||| |-- ~ The Nagios host can get to the dual homed machine, and I can use all the standard plugins to get status from it. What I want to do is get status from the TARGETS. Are you suggesting using NAPE running on the dual homed machines? And somehow using this to query the targets? The web page you pointed out makes me believe that NAPE is intended to allow me to execute checks on a reachable machine, by using local scripts there, rather than having the NAGIOS machine use the more traditional check_* plugins. Am I confused? -- One of the main causes of the fall of the roman empire was that, lacking zero, they had no way to indicate successful termination of their C programs. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] monitor status of legato backups
We're looking for some method to have nagios check the status of our many backups run off a backup server. Currently, we have a perl script search for just one or two specific lines and pass the results to send_nsca. It's crude and doesn't cover all instances. The problem is the nsr logfile has so much info in it, it seems to REQUIRE human perusal to determine status of all backups. Has anyone spent any time on a plugin/script for this before I code what might already exist? Thanks - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios, on a remote server
Good afternoon, We use Nagios here and we would like to have a Nagios install outside of our network. We are in the midwest with our fiber coming out of Chicago. Ideally we would like to get a BSD/Linux host serviced by the west or east coast to monitor our public services. We can certainly find a hosting provider to use, but I am interested in any recommendations. We are fine with a virtual server, but we need full control. The server will only host a Nagios install polling back to our server's public interfaces. Thanks, DAve -- In 50 years, our descendants will look back on the early years of the internet, and much like we now look back on men with rockets on their back and feathers glued to their arms, marvel that we had the intelligence to wipe the drool from our chins. - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_snmp
I've got two nagios installs with snmp giving different responses. Details: Host1 is an operational nagios 2.9 install on SuSE 10.1-64bit machine Host2 is a development machine, Suse 10.2-64bit with newly installed nagios 3.02 and nagios-plugins-1.4.12. I have all the perl modules necessary (Net SNMP, Crypt DES, Digest HMAC as well as libgd. running check_snmp -H SOMEHOST -o 1.3.6.1.4.1.674.10892.1.700.20.1.6.1.1 -C Nmystring% -P 1 -c 1:250 -u degC*10 -l 'Front side I/O Temp: ' returns on the operational host Front side I/O Temp: OK - 200 degC*10 | iso.3.6.1.4.1.674.10892.1.700.20.1.6.1.1=200 But from the development host, I get Front side I/O Temp: problem - No data received from host CMD: /usr/bin/snmpget -t 1 -r 5 -m '' -v 1 [authpriv] SOMEHOST:161 1.3.6.1.4.1.674.10892.1.700.20.1.6.1.1 Both resolve the hostname the same and other plugins to same host (ping,Mysql) work fine. Strace of the command shows it finding all the same files until it times out. Where else can I look for differences? TIA - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] analog nagios transmitter
Anyone know of a dirt-simple analog device that can take ANY red status's from nagios and transmit the signal to some visible device? Details: We have a medium nagios install on a closed network and wish to have it transmit a something is wrong message to some device not on the network. We are further constrained by what kind of a signal it can be so it must be just a 1 or 0. 1 Lights up a red light indicating we need to go to another building and look at nagios ie ANY host/service is down should trigger this. TIA - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] External nagios monitoring
DAve wrote: Good morning, We are writing a app for a client that will display Nagios status in a way that doesn't strike fear into the minds of the PHBs. Currently the designers have a nice Flash based app with lots of pretty pictures and dancing monkey icons, all the glitter a CIO could want. I need to stuff data into a DB for the designer to use for updating the app. I've looked at event handlers but I am leaning towards using the notification script to also update the DB. I don't think the event handler will execute for an acknowledgment correct? My question, has anyone ever done this? I need to sent very little info, 1) the date 2) the host/service 3) the notification type 4) the admin name/string who acks the notofication I have access to #1 through #3 for the notification handler obviously, I don't see where I have access to #4. Will I have to parse the comments.dat for that information or can I grab that somewhere? Running Nagios 2.5 Never mind, sorry for the luser moment. I just looked at the correct Nagios version and found $HOSTACKAUTHOR$ and $SERVICEACKAUTHOR$ DAve Thanks, DAve -- In 50 years, our descendants will look back on the early years of the internet, and much like we now look back on men with rockets on their back and feathers glued to their arms, marvel that we had the intelligence to wipe the drool from our chins. - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] External nagios monitoring
Good morning, We are writing a app for a client that will display Nagios status in a way that doesn't strike fear into the minds of the PHBs. Currently the designers have a nice Flash based app with lots of pretty pictures and dancing monkey icons, all the glitter a CIO could want. I need to stuff data into a DB for the designer to use for updating the app. I've looked at event handlers but I am leaning towards using the notification script to also update the DB. I don't think the event handler will execute for an acknowledgment correct? My question, has anyone ever done this? I need to sent very little info, 1) the date 2) the host/service 3) the notification type 4) the admin name/string who acks the notofication I have access to #1 through #3 for the notification handler obviously, I don't see where I have access to #4. Will I have to parse the comments.dat for that information or can I grab that somewhere? Running Nagios 2.5 Thanks, DAve -- In 50 years, our descendants will look back on the early years of the internet, and much like we now look back on men with rockets on their back and feathers glued to their arms, marvel that we had the intelligence to wipe the drool from our chins. - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] No Email Notifications
You're command line should look like this: /usr/bin/printf %b * Nagios 2.6 *\n\nNotification Type: PROBLEM\nHost: SMTP\nState: DOWN\nAddress: IP ADDRESS\nInfo: PING CRITICAL - Packet loss = 100%\n\nDate/Time: Tue Feb 5 17:14:02 EST 2008\n | /usr/bin/mail -s Host DOWN alert for SMTP! EMAIL ADDRESS -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Gregory Wong Sent: Tuesday, February 05, 2008 5:34 PM To: Patrick Morris Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] No Email Notifications In /usr/bin there is mail and printf. Should I be using /usr/bin/mail or /usr/bin/printf before the notification string? On 2/5/08 5:24 PM, Patrick Morris [EMAIL PROTECTED] wrote: On Tue, 05 Feb 2008, Gregory Wong wrote: Hi Everyone, I am having some problems getting email notifications on a new Nagios server I just built. I have only setup PING and SMTP checks to see if everything is working properly. The Event Log on the web interface shows the following message: [02-05-2008 17:14:02] Warning: Attempting to execute the command /usr/bin/mail/printf %b * Nagios 2.6 *\n\nNotification Type: PROBLEM\nHost: SMTP\nState: DOWN\nAddress: IP ADDRESS\nInfo: PING CRITICAL - Packet loss = 100%\n\nDate/Time: Tue Feb 5 17:14:02 EST 2008\n | -s Host DOWN alert for SMTP! EMAIL ADDRESS resulted in a return code of 127. Make sure the script or binary you are trying to execute actually exists... Have you done what the error message suggests, and looked to see if /usr/bin/mail/printf actually exists? Betchya it doesn't. - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Acking an alert by replying to an email
fetchmail might do the job for you. Because of a particular arrangement of our mail server, I was unable to use procmail for a similar purpose (not nagios but email front-end for rt). It depends on whether notes is willing to speak to a client with a protocol that fetchmail knows. (POP worked for me, not sure what notes likes.) Added advantages over procmail - definitely simpler, probably consumes fewer resources, possibly easier to keep secure (especially due to the complexity issue). I am unfamiliar with notes, doesn't it have some sort of native pre-processing available? Dave On Jan 4, 2008 6:31 AM, [EMAIL PROTECTED] wrote: It's the procmail part that will be the political hot potato. Thou Shalt Have No Other Email Platform Before Me is the edict from on high, regarding notes. Not withstanding that the nagios server has to have a mailing agent of some sort to work of course. I'll see what I can squeak past the powers that be. Petersen, Mark [EMAIL PROTECTED] Sent by: [EMAIL PROTECTED] To [EMAIL PROTECTED], nagios-users@lists.sourceforge.net cc 01/04/2008 09:22 AM Subject Re: [Nagios-users] Acking an alert by replying to an email I think the easiest solution would be to have Lotus Notes forward mail for [EMAIL PROTECTED] (or whatever from address you use) to [EMAIL PROTECTED] and run procmail on the nagios server. Is this compatible with your setup? I don't really know much about Notes, but I assume it's not running it on the nagios box. mark - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] restricting unknown user
arpwatch does something like that, keeps an eye on what mac addresses are active using what IP on the local subnet, sends an alert when a new mac address appearrs or some change occurs in the pairs of active MAC/IP pairs. Currently there is no integration between arpwatch and nagios. arpwatch sends email alerts, maybe outputs to the log or console? I don't think it is a good fit, since nagios assumes that all the hosts it cares about are pre-defined in its config file. There might be a web front end available for arpwatch I am unfamiliar with, if so that sounds more like what you want. HTH Dave On Dec 9, 2007 2:56 AM, sachin kumar [EMAIL PROTECTED] wrote: Hi list I want to configure nagios in such a way that if unknown mac-address enters into network to access resources , that system will be displayed in nagios.I want to create a list of mac-addresses which are in our network , and link it with nagios and if any other mac-address (out of this list)connects to network, it will be displayed in the nagios . - sachin kumar (sachin1361) --- The mailing list archive is found here: http://www.nagiosexchange.org/nagios-users.34.0.html - SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Need help using check_by_ssh with check_disk
Oops, sent from wrong return address. -- Forwarded message -- From: Dave Burns [EMAIL PROTECTED] Date: Nov 29, 2007 11:12 AM Subject: Re: [Nagios-users] Need help using check_by_ssh with check_disk To: nagios-users@lists.sourceforge.net On Nov 29, 2007 6:09 AM, Sean Bowers [EMAIL PROTECTED] wrote: From the command line, both hosts work fine and 'echo $?' shows 0 for both. In a similar situation, I found it helpful to break my command line tests into steps. First ssh directly into the remote as nagios and execute the disk check, as you seem to have done. Then sudo -u nagios ssh -i /var/spool/nagios/.ssh/id_dsa [EMAIL PROTECTED] /usr/lib/nagios/plugins/check_disk -w 10% -c 5% -e -m -p / to make sure the key is working and the args and path are correct. Then sudo -u nagios /usr/lib/nagios/plugins/check_by_ssh -H linoleum -i /var/spool/nagios/.ssh/id_dsa -C /usr/lib/nagios/plugins/check_disk -w 10% -c 5% -e -m -p / If that works, the regular nagios process should work. I once had a sticky problem where I could get the above working, but nagios kept telling me unknown. It turned out I had a subtle little typo in my config that made it different from my command line tests. Headache! HTH, Dave Here are my defs define command{ command_namecheck_slash_free command_line $USER1$/check_by_ssh -i /var/spool/nagios/.ssh/id_dsa -H $HOSTADDRESS$ -C '$USER1$/check_disk -w $ARG1$ -c $ARG2$ -e -p \/ -m ' } and the service define service { host_name hostname service_description check slash check_command check_slash_free!30%!20% max_check_attempts 3 retry_check_interval1 normal_check_interval 5 check_period24x7 notification_interval 120 notification_period 24x7 notification_optionsw,u,c,r,f contact_groups localadmins } - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] putting limits on check_by_ssh
I've been RTFMing SSH. For background authentication like nagios uses, the book I'm reading recommends using the user's ssh config file to limit using passwordless keys to just do one task each. So if you want nagios to be able to do 3 kinds of checks without a password, you put 3 keys in nagios .ssh/authorized_keys file with command= stuff for each. Then no matter what nagios thinks it is asking for (parameter of check_by_ssh) it gets whatever is configured for that key. I'm just wondering if anyone has taken this approach. It seems a bit complicated, spreading some of the nagios config info around to each monitored system, but it sort of appeals to me. Then you know that even if someone manages to get your key, all they can do is check_disk or something else boring. Am I missing something? In addition to limiting key authentication to doing specific tasks, I also put an '*' in the nagios user's password field in /etc/passwd, which prevents them from logging in by password. Thanks, Dave - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] putting limits on check_by_ssh
Thanks to the two Toms for their helpful responses. On Nov 16, 2007 8:26 AM, Thomas Guyot-Sionnest [EMAIL PROTECTED] wrote: While I use command-restricted keys for all passwordless auth (usually cronjobs), this is the reason why I never looked into check_by_ssh and use NRPE instead. NRPE makes me a bit nervous because I suppose (without any data to back it up) that relatively few people use it (at least compared to ssh). What criteria did you use in making the evaluation of check_by_ssh vs. NRPE? SSH is tempting because I don't have to install and configure much new stuff or learn much new stuff, or at least what I learn has broad applications. And though the configuration may be a bit long-winded, it seems pretty clear. And I *think* I can nail it down pretty well, so that even if my private keys on the nagios server got compormised, nothing much bad could happen to the other hosts. You can also set the shell to /bin/false and set a non existent home directory (ex. /nonexistent). Doesn't the nagios user need a shell and a home dir to run the daemon? Maybe I was just thinking that it needed a home dir to put the ssh keys in, but they can be located anywhere. Slowly understanding blooms. On Nov 16, 2007 8:51 AM, Tom Throckmorton [EMAIL PROTECTED] wrote: I do something similar, though also add a 'from' restriction, in the event the private key is compromised - here are a few examples: Thanks for examples and suggestions, they help. I usually only allow a single command per host - on hosts which I want to execute multiple commands, rather than having a keypair-per-command, I make the command a script which sanitizes the input and checks the command against a list of predetermined allowed commands. Interesting. I hadn't gotten to the details of the forced commands part of the ssh book yet, so I didn't know about $SSH_ORIGINAL_COMMAND. Hmmm Flexible, complicated. A bit harder to be sure I haven't left any cracks. Stuff to think about. Dave Dave - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] putting limits on check_by_ssh
My problems with NRPE are: 1) I know nothing about it. 2) I think it requires me to open up another port on my machine. My problems with SSH are: 1) configuration is a bit tedious My positives for SSH are: 1) It is very well known and respected 2) Lots of people use it for lots of things, not just nagios My positives for NRPE are; 1) I was hoping you guys would tell me what helped you decide to go with NRPE. I just don't know much about it. Since I feel fairly confident that I could seal up an SSH configuration, but NRPE is an unknown factor for me, it is very tempting to stick with SSH. Thanks, Dave - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Can Nagios implement with Expect to automate tasks when issues arise?
Event handlers can take care of any common task but you need to be careful with them. Why is a task required to be done so frequently? Shouldn't we take care of the cause rather than the symptom? Also, scripts that fix known problems can be very dangerous if they inadvertantly run at times when a service/host looks critical but perhaps critical wasn't well enough defined, particularly if you're not using parent dependancies. So if you do go this root (pun intended), add in a LOT of checks to the expect script. On Nov 14, 2007 11:27 AM, Patrick M. [EMAIL PROTECTED] wrote: Hi all, I just learned what Expect was and wrote a few scripts to automate certain tasks around the office, and now I'm wondering if Expect can be used in conjunction with Nagios. Is this possible? Example: One of our servers go down at least twice a day, and we know it's problematic. We connect to the power strip and recycle the power on the port that machine is plugged in. I'd like to use Expect to automate this when Nagios notices it is down. Has anyone else implemented something similar to what I'm describing? Thanks in advance. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Email Notifications
Hi list I've successfully install nagios 2.9 and I'm monitoring hosts ok but notifications are not being sent out. The event log and notification log display that notifications should be being sent but they are not being received and I cannot find any trace of an email. Best Regards - Dave Roberts (davros) --- The mailing list archive is found here: http://www.nagiosexchange.org/nagios-users.34.0.html - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios-users Digest, Vol 15, Issue 10
On 8/7/07, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Send Nagios-users mailing list submissions to nagios-users@lists.sourceforge.net To subscribe or unsubscribe via the World Wide Web, visit https://lists.sourceforge.net/lists/listinfo/nagios-users or, via email, send a message with subject or body 'help' to [EMAIL PROTECTED] You can reach the person managing the list at [EMAIL PROTECTED] When replying, please edit your Subject line so it is more specific than Re: Contents of Nagios-users digest... Today's Topics: 1. check_nt default port (sujith h) 2. Re: check_nt default port (Frost, Mark {PBG}) 3. Re: two notifications everytime (Marc Powell) 4. Re: two notifications everytime (Lalita Drolia) 5. Re: two notifications everytime (Lalita Drolia) 6. Re: two notifications everytime (Marc Powell) 7. N2rrd problem (Mattia) 8. CGIs path to css is wrong (Marcus Rej?s) 9. Re: dont want notification when machine reboots (Jim Avery) 10. Re: dont want notification when machine reboots (Marc Powell) 11. Re: N2rrd problem (Dani) 12. Re: Trap Handling Scheme (Jim Avery) 13. Re: check_nt default port (Hugo van der Kooij) 14. Redudant checks (but also two notifies) (GJP) -- Message: 1 Date: Mon, 6 Aug 2007 12:45:48 +0530 From: sujith h [EMAIL PROTECTED] Subject: [Nagios-users] check_nt default port To: Nagios Users mailinglist nagios-users@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=utf-8 Can any one help me to find the default port used by nagios for the plugin check_nt(not NSClient). Cheers, Sujith Haridasan -- ?? ??? -- next part -- An HTML attachment was scrubbed... -- Message: 2 Date: Mon, 6 Aug 2007 03:22:33 -0400 From: Frost, Mark {PBG} [EMAIL PROTECTED] Subject: Re: [Nagios-users] check_nt default port To: sujith h [EMAIL PROTECTED] Cc: Nagios Users mailinglist nagios-users@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=utf-8 Sujith, If you run check_nt --help you'll see a bunch of output including the following: -p, --port=INTEGER Optional port number (default: 1248) Mark Sujith, try port 5667 From: [EMAIL PROTECTED] [mailto: [EMAIL PROTECTED] On Behalf Of sujith h Sent: Monday, August 06, 2007 3:16 AM To: Nagios Users mailinglist Subject: [Nagios-users] check_nt default port Can any one help me to find the default port used by nagios for the plugin check_nt(not NSClient). Cheers, Sujith Haridasan -- ?? ??? -- next part -- An HTML attachment was scrubbed... -- Message: 3 Date: Mon, 6 Aug 2007 07:09:17 -0500 From: Marc Powell [EMAIL PROTECTED] Subject: Re: [Nagios-users] two notifications everytime To: nagios-users@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=US-ASCII -Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Lalita Drolia Sent: Monday, August 06, 2007 12:49 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] two notifications everytime Hi, I have been using nagios for a long time now. But suddenly since a couple of days, all email notifications are being sent out twice. Sounds like you may have 2 copies of nagios running at the same time. -- Marc -- Message: 4 Date: Mon, 6 Aug 2007 05:14:28 -0700 From: Lalita Drolia [EMAIL PROTECTED] Subject: Re: [Nagios-users] two notifications everytime To: Marc Powell [EMAIL PROTECTED], nagios-users@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=us-ascii I didn't know that was possible. If it is, then that must be the case. Please tell me how can I check and stop that? Regards, Lalita -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Marc Powell Sent: Monday, August 06, 2007 5:39 PM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] two notifications everytime -Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Lalita Drolia Sent: Monday, August 06, 2007 12:49 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] two notifications everytime Hi, I have been using nagios for a long time now. But suddenly since a couple of days, all email notifications are being sent out twice. Sounds like you may have 2 copies of nagios running at the same time. -- Marc - This
[Nagios-users] check_apc_temp
hi , can you please help me where to get check_apc_temp plugin. Thanks - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagiosgraph restart, aggregate.cgi
I had a linux-based nagios 2.9 install with nagiosgraph 0.8. The server recently shutdown and upon startup, while nagios ran fine and nagiosgraph.log and perdata.log both show the correct type of activity, new graphs simply weren't generated after startup. The classic: Illegal attempt to update message goes into the log. I took this to be just a warning that one of the epoch values is incorrect. I base this assumption on the fact that higher level debug shows RRD inserts. I tried running insert.pl manually, moving the RRDs aside to make new ones but still couldn't get the graphs generated so I backed up my setup and upgraded to nagiosgraph 0.9. Now I am generating graphs however nagiosgraph-0.9 has an update.pl which moves the RRDs into hostname subdirectories. I installed aggregate.cgi with the previous version to create a single webpage with all the graphs and this no longer works due to the RRD format or location. Does anyone have a patch or newer version of aggregate.cgi (http://www.nebrwesleyan.edu/people/stpierre/aggregate.cgi) Also, I notice that the perfdata.log file seems to get rolled on its own every few minutes and running nagiosgraph with warning-level debugs produces a warning perfdata not recognized. I presume these two events are related. Thanks - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] wild cards with exceptions?
I'm trying to streamline my nagios config using wildcards. Unfortunately, not all services I wish to define via wildcard follows a clean set of rules. Is it possible to define a service with a host list of something like *,!linux1, !linux2 I suspect the answer is no and what I'd need to do is use a combination of hostgroups and hosts eg define service { hostgroup unix, ultrix, sco service_description } define service { host_name host1, host2, host3, host4 ... } Anyone find a way around this? TIA - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Just using SNMP
You can do the NSclient installs remotely by staging it on a common share, have a .cmd file something like this: mkdir c:\nsclient @if %ERRORLEVEL%==1 goto :EXIT copy C:\\temp\nsclient\Win_2k_XP_Bin\* c:\nsclient @if %ERRORLEVEL%==1 echo error occurred while copying files c:\nsclient\pNSClient.exe /install @if %ERRORLEVEL%==1 echo error running pNSClient net start nsclient @if %ERRORLEVEL%==1 echo error occured while starting nsclient service The above can be remotely run thru a variety of mechanisms, even submitting a remote scheduler job to run once. Then just write a script to install on all your servers once it's been tested. It's worth it in the longrun. - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagfs
Hiya. nagfs is an event broker module, which implements a filesystem interface to the current status of a running Nagios daemon. This means in a nutshell that you can query the status of a given service or set of services using everyday shell tools like find and grep. Each host is represented as a directory in the filesystem. Each host directory contains a file for each service nagios is monitoring on that host. Each file contains a single number representing the state of that service. For example, once nagfs is running, you can list all the services currently in a critical state by typing: grep -rl 2 /usr/share/nagios/status/local The numbers are what you'd expect ( 0:OK, 1:Warning, 2:Critical, 3,Unknown). If the service is in a soft state a servicename.soft file is created with the soft state status number. Each host directory also contains a HOST file which contains the current status of the host itself. As a second example, you could get a list of all the services Nagios is checking with: find /usr/share/nagios/status/local -type f This is a handy thing, and much easier than scraping html for the same info. nagfs registers for SERVICE_STATUS_DATA and HOST_STATUS_DATA updates from the Nagios server, which means that the nagfs filesystem is immediately brought up to date upon Nagios startup, and is always perfectly in sync with Nagios's memory state. It's released under the BSD license. The tarball is available on nagios exchange, and my blog http://www.skeptech.org Nagfs was originally the brainchild of Jeremy Blosser, who implemented it as a collection of shell scripts called via global event handlers. I found it so useful, and became so attached to it that I eventually forgot it wasn't actually part of the core Nagios package. I later re-implemented it as an event broker module as an object lesson for the addison wesley nagios book I authored, and refined it as part of a larger nagios clustering project called nagimesh, which is not yet complete. However, I find nagfs so indispensable that I decided to release the source as a stand-alone NEB module, in the hope that you may too someday forget the time you've already spent without it. Check INSTALL in the tarball for installation instructions. It is assumed that you have a working Nagios implementation which has been compiled with the --enable-event-broker configure switch. I have several Nagios servers running versions of Nagios from 2.0alpha1 to 2.8, and nagfs works with all of them, so it ought to work for you if you haven't ventured into 3.0 territory yet. I haven't taken a look at the 3.0 event broker code yet, so for all I know it could work (if you try it and it works for you let me know). Have fun! -dave josephsen. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] list remote logged on users
Anyone know of a plugin (NRPE, nsclient) that will display back to nagios on linux a list of users logged into win2000 or winXP? I can do this on the window side thru scripting and running from the resource kit psloggedon on each host. My users would find it useful to see which machines are available TIA - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] two sets of host groups
Is it possible to have hosts in two sets of groups? Currently I've divided hosts by hostgroup names associated with their physical location. But it would be quite useful to have hostgroups based on operating systems. Anyone have a way to accomplish this allow multiple views? TIA - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] plugins with rsh
We have a legacy setup with tru64 hosts that are currently configured for rsh, not ssh. I've installed the latest and greatest nagios on a relatively recent fedora host and have used plugins with ssh in the past. The problem is that rsh doesn't return status codes properly. Rather, it returns only the status code of rsh itself, not the command. A search implies you just need to echo $? with rsh. rsh linux to linux, I get this: linux1 rsh linux1 'date ; echo $?' Wed Jan 3 15:15:07 EST 2007 and trying to generate an error: linux1 rsh linux1 'bogus ; echo $?' bogus; Command not found. 1 So that looks ok but first problem is when I put into nagios commands define rsh $HOSTADDRESS$ 'mycommand; echo $?' The web page show (No output!) Is it possible it swallows the quotes? Further, rsh behaves differently on tru64: linux1 rsh tru64-host 'date;echo $?' Variable syntax. Any ideas for these problems? Thanks and - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] small alternative frontend
On 10/10/06, Chris Waters [EMAIL PROTECTED] wrote: Robin, I have a perl cgi (stoplight.pl) that does a redlight/greenlight page. based on if all problems are handled or not. It currently does not do warning states or display the problem hosts/services but could be extended to do so. Might be a good place for you to start if you want to start down that path. I would be happy to send you the perl if you want it. It's a quick had I did a while ago so there might be things you want to change/fix. I am working on a version that does exactly what you are asking for but my time is limited. Chris Waters Information Systems - Networks Group JELD-WEN, Inc. [EMAIL PROTECTED] -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Robin Ericsson Sent: Friday, October 06, 2006 4:56 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] small alternative frontend Hi, I'm looking for a small front-end cgi that should be extremely simple. If everything is OK, it should display a green background and say that everything is OK. If warning, it should display an orange background and list services/hosts that are warned. If critical, it should display a red background and list services/hosts with errors and warnings. Is there something like this available? Otherwise I guess it could be wrapped out pretty quick if I get some tips on how :) -- regards, Robin Is this something that has been posted online and available for download? I'm interested in seeing it in action. Thanks, Dave - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] host details time vs service details time
We have an interesting problem that might be quite simple. Listing service-details shows all hosts and their services and last-check time correctly. Listing Host-details shows older last-check times (perhaps up to a month). If the host goes down, the last Check date is correct. If it comes back, the date is correct and seems to keep that date. In other words, the last-checked field on the hosts-detail page retains only a state-change. I'm using active and passive test on nagios 2.3. Here's the output from nagios -s. Note that the Total sheduled hosts is 0. Thanks HOST SCHEDULING INFORMATION --- Total hosts: 696 Total scheduled hosts: 0 Host inter-check delay method: SMART Average host check interval: 0.00 sec Host inter-check delay: 0.00 sec Max host check spread: 10 min First scheduled check: N/A Last scheduled check:N/A SERVICE SCHEDULING INFORMATION --- Total services: 2120 Total scheduled services: 2120 Service inter-check delay method: SMART Average service check interval: 300.00 sec Inter-check delay: 0.14 sec Interleave factor method: SMART Average services per host: 3.05 Service interleave factor: 4 Max service check spread: 8 min irst scheduled check: Mon Aug 14 11:25:06 2006 Last scheduled check: Mon Aug 14 11:30:05 2006 CHECK PROCESSING INFORMATION Service check reaper interval: 10 sec Max concurrent service checks: Unlimited PERFORMANCE SUGGESTIONS --- I have no suggestions - things look okay. =-=-=-=-=-=-=-=-=-=- generated by /dev/dave -=-=-=-=-=-=-=-=-=-=-=-= David SternUniversity of Maryland Institute for Advanced Computer Studies - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] how to string commands?
On Fri, 28 Jul 2006, Thomas Slutyer wrote: I am defining a template for my linux machines. I'd like the check_command for that template to have it check ping, local drive, pop, httpd, etc How can I string them together? Would something like this work: check_command check-host-alive; check-pop; etc.. or do I need to use commas or is this completely illegal? It won't work. Each check commad can only have one result. Whereas Nagios' syntax will not allow for such trickery, you -could- however make a composite check script yourself. Just create a shell script that calls all of the checks you want to perform and let the script decide on the exit code and alarm message, based on the output of the other checks. This way I've create real service checks for our SMTP environment for example. Why? Because a mail environment is more than just a running SMTP daemon. My script checks: * Are the main SMTP VIPs (Virtual IPs) available in DNS? * Is the load balancer running the VIP properly? * How many SMTP servers out of our complete pool are up and available for SMTP? * Are we able to connect to send a message. That way you can safely say that service X works properly :) Great idea for single services such as smtp but not so good for overall system health as it's difficult to quantify. For smtp, you might right a script to - see that smtp is running - see that the spool area has free space - see that system load is reasonable (que load average for sendmail might otherwise start refusing new messages) and finally if all is well, send back a good status, if not, decide on status (warning or critical) and output appropriate message. But even here, the problem is that the different things you're checking are not necessarily reliant on each other so several of the above could have problems and if so, what should the overall status and output message be. So by extension, writing a single script to test many unrelated items defining the overall health of a host can result in confusing output. =-=-=-=-=-=-=-=-=-=- generated by /dev/dave -=-=-=-=-=-=-=-=-=-=-=-= David SternUniversity of Maryland Institute for Advanced Computer Studies - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagios speed to refresh
I have a nagios 2.3.1 install that is acting laggy. Updates can be as old as a half hour for the ~2200 services I monitor. I'm currently using active and passive checks, generally with 10 minute check intervals altho some have intervals of 30 minutes so as not to kill these hosts. I intend to reduce the check interval and am considering changing inter_check_delay_method from smart to something else. By default, inter_check_delay_method=s is commented out of nagios.cfg and just uncommenting makes it complain when I run nagios -v saying UNKNOWN VARIABLE so I can't even change it to n or d or some small number of seconds. I'm guessing the config I have may need to be converted(?) I'm looking to get information no older than 10 minute (assuming the nagios server can handle it) Has anyone had any experience with this? Here are some values of interest: command_check_interval=-1 freshness_check_interval=60 status_update_interval=30 % nagios -s nagios.cfg ... HOST SCHEDULING INFORMATION --- Total hosts: 700 Total scheduled hosts: 0 Host inter-check delay method: SMART Average host check interval: 0.00 sec Host inter-check delay: 0.00 sec Max host check spread: 30 min First scheduled check: N/A Last scheduled check:N/A SERVICE SCHEDULING INFORMATION --- Total services: 2244 Total scheduled services: 2244 Service inter-check delay method: SMART Average service check interval: 1782.35 sec Inter-check delay: 0.79 sec Interleave factor method: SMART Average services per host: 3.21 Service interleave factor: 4 Max service check spread: 30 min ... CHECK PROCESSING INFORMATION Service check reaper interval: 10 sec Max concurrent service checks: Unlimited TIA =-=-=-=-=-=-=-=-=-=- generated by /dev/dave -=-=-=-=-=-=-=-=-=-=-=-= David SternUniversity of Maryland Institute for Advanced Computer Studies Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] compiling nagios-2.3.1 on solaris
I've downloaded nagios 2-3.1 for a number of architectures after the cgi buffer overflow exploit described in http://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2006-2162 The compile for our server (Rhel3) went fine. But To keep things clean, I want to replace the structure on other OS's including Solaris8. Here's what I'm getting. The line indicated appears to be new code to check for the buffer overflow. Has anyone had any probs building this on solaris? gcc -g -O2 -DHAVE_CONFIG_H -DNSCORE -c -o perfdata-base.o perfdata.c gcc -g -O2 -DHAVE_CONFIG_H -DNSCORE -c -o xperfdata-base.o ../xdata/xpddefault.c gcc -g -O2 -DHAVE_CONFIG_H -DNSCORE -c -o downtime-base.o ../common/downtime.c gcc -g -O2 -DHAVE_CONFIG_H -DNSCORE -c -o xdowntime-base.o ../xdata/xdddefault.c gcc -g -O2 -DHAVE_CONFIG_H -DNSCORE -o nagios nagios.c broker.o nebmods.o checks.o config.o commands.o events.o flapping.o logging.o notifications.o sehandlers.o utils.o retention-base.o xretention-base.o comments-base.o xcomments-base.o objects-base.o xobjects-base.o statusdata-base.o xstatusdata-base.o perfdata-base.o xperfdata-base.o downtime-base.o xdowntime-base.o-Wl,-export-dynamic-lm -lpthread -lltdl -lrt gcc -g -O2 -DHAVE_CONFIG_H -DNSCORE -o nagiostats nagiostats.c -lm -lrt make[1]: Leaving directory `/tmp/nagios-2.3.1/base' cd ./cgi make make[1]: Entering directory `/tmp/nagios-2.3.1/cgi' gcc -g -O2 -DHAVE_CONFIG_H -DNSCGI -c -o getcgi.o getcgi.c getcgi.c: In function `getcgivars': getcgi.c:174: `INT_MAX' undeclared (first use in this function) getcgi.c:174: (Each undeclared identifier is reported only once getcgi.c:174: for each function it appears in.) make[1]: *** [getcgi.o] Error 1 make[1]: Leaving directory `/tmp/nagios-2.3.1/cgi' make: *** [all] Error 2 Here's the line numbers for getcgi.c 173 /* suspicious content length */ 174 if((content_length0) || (content_length=INT_MAX-1)){ 175 printf(getcgivars(): Suspicious Content-Length was sent with the POST request.\n); 176 exit(1); 177 } Fortunately, we only use the plugins on Solaris but I'd like to get the whole structure built anyhow. TIA =-=-=-=-=-=-=-=-=-=- generated by /dev/dave -=-=-=-=-=-=-=-=-=-=-=-= David SternUniversity of Maryland Institute for Advanced Computer Studies --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Status map circular markup background bug
I have noticed the following bug and am not sure where to report it. We have a number of hosts with multiple parent links, and this seems to confuse the circular markup status map. It draws the green (or pink) background colors in BOTH places where the relevent nodes could be, even though the nodes are only drawn in one of the places. Also sometimes it seems to crowd things together a little too much (icons and text overlapping other icons and text), even when there is room for it to spread things out more. I suspect this may also be related; when there are two parent links, and as a result it could draw the icons in the 3rd outer circle in one position, or in the 7th outer circle in the other position, it seems to draw the icons in the 3rd outer circle, but using the spacing for the 7th outer circle, or vs versa. I couldn't find any other reports of these particular status map problems. I can provide an example image of the status map showing these problems if necessary and someone tells me where to put/email it. --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnkkid=110944bid=241720dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios not notifying by email
On Thu, 23 Feb 2006, Toto Capuccino wrote: Hi, Did you check the timeperiods.cfg file as you are using workhours notification for services, you wont receive notification out of this period. 2006/2/23, Dave Stern [EMAIL PROTECTED]: I didn't include timeperiods.cfg because this was the generic one. We're using workhours and 24x7 for all hosts. Basically, notification_period is 24x7 for all, {service|host_notification_period} is either 24x7 or workhours. I presume it's not a problem if a host is set to notification_period=work in contacts but 24x7 in hosts.cfg and services.cfg =-=-=-=-=-=-=-=-=-=- generated by /dev/dave -=-=-=-=-=-=-=-=-=-=-=-= David SternUniversity of Maryland Institute for Advanced Computer Studies --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnkkid=110944bid=241720dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios not notifying by email
On Thu, 23 Feb 2006, Mrutyunjaya Dash wrote: Hi, You can check you services.cfg file to verify you have notifications_options to w c u r. This might be one of the reason if you had copied services.cfg file from the sample config file. services.cfg has notification options currently set to w,u,c. I added recover. One hint, on the webpage for a host availability, it says in the upper left corner Notifications are disabled This is in contrast to the fact that enable_notifications=1 is set in nagios.cfg (and cgi.cfg says main_config_file=/some/path/nagios.cfg) I just tried disabling all state_retention (was previously on) and bounced nagios and finally got a message. so perhaps state retention was set incorrectly. What I'd like to do is have it retain state when I bounce nagios st I don't get msgs about those I know are down on restart. Here's what I have: nagios.cfg: log_initial_states=0 retain_state_information=0 state_retention_file=/usr/local/stow/nagios/var/status.sav use_retained_program_state=0 host.cfg: retain_status_information 0 retain_nonstatus_information0 services.cfg: retain_status_information 0 retain_nonstatus_information0 =-=-=-=-=-=-=-=-=-=- generated by /dev/dave -=-=-=-=-=-=-=-=-=-=-=-= David SternUniversity of Maryland Institute for Advanced Computer Studies --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnkkid=110944bid=241720dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagios not notifying by email
We have two iterations at our site, one running the old netsaint (which works flawlessly) and one running nagios 1.2 on RHEL E3-AS. The latter rebuilds its config files each night based on various site-specific files detailing what hosts we have, what to monitor on them etc. The Linux version is not notifying when hosts/services go down. I suspect this is nothing more than the notify-by-email command. On netsaint, it's command[notify-by-email]=/usr/bin/printf '%s\n%s\n%s\n%s' '* NetSaint 0.0.7 * $SERVICESTATE$ alert/notification' 'Host: $HOSTNAME$ Address: $HOSTADDRESS$', 'Additional Info: $OUTPUT$' | /usr/bin/mailx -s 'Service alert: $HOSTNAME$ is $SERVICESTATE$ **, $SERVICEDESC$' $CONTACTEMAIL$ and on linux its # 'notify-by-email' command definition define command{ command_namenotify-by-email command_line/usr/bin/printf %b * Nagios *\n\nNotificati on Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddre ss: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $DATETIME$\n\nAdditional Info:\n\n$OUTPUT$ | /bin/mail -s ** $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/$ SERVICEDESC$ is $SERVICESTATE$ ** $CONTACTEMAIL$ } Does anyone know what the command should look like and how to test the above? Thanks =-=-=-=-=-=-=-=-=-=- generated by /dev/dave -=-=-=-=-=-=-=-=-=-=-=-= David SternUniversity of Maryland Institute for Advanced Computer Studies --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] service dependancies
I've been running nagios for a while and love it. One of the plugins I wrote ssh'es to various hosts and checks automounts by trying to read a file on each of our critical automount hosts. If the host doesn't respond to ssh, then the webpage obviously shows that and the various automounts as down. It would be nice if you didn't have to decode the webpage in that way ie if it only reported ssh as down due to the dependancy. I was hoping that servicedependency could do this something like define servicedependency{ host_name either specific hostnames or wildcard dependent_host_name dependent_service_description ssh service_description nfs* } Can this be done? using the above format? TIA =-=-=-=-=-=-=-=-=-=- generated by /dev/dave -=-=-=-=-=-=-=-=-=-=-=-= David SternUniversity of Maryland Institute for Advanced Computer Studies --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637alloc_id=16865op=click ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null