Re: [Nagios-users] Multiple parents: *any* or *all*?
If Any are up. It's a way of modelling a resilient network. On large network, it can become a bit messy (pairs of routers all cross-connected with virtual IPs, dynamic rerouting etc). In those cases, I tend to just add the virtual router IPs, or even skip the switches completely until getting to end links (which for the hosts I care about are usually single-homed international offices) -Original Message- From: Jack Bates [mailto:1j1...@nottheoilrig.com] Sent: 26 June 2012 10:28 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Multiple parents: *any* or *all*? When you list multiple other hosts in the parents directive of a host, does it mean that this host should be reachable if *any* of these parents are up, or only if *all* of these parents are up? -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Large scale installation
Doesn't it depend on how often you're performing the checks too? 1000 checks every 10 seconds is harder than 10,000 checks every hour. We have 589 hosts/3619 service on a 2 cpu 2.8GHz xeon with 1GB of ram, which does other things too. The machine is about 8-10 years old. The checks are scheduled every 4 minutes, however only 45% have run in the last 4 minutes. 95% have run in the last 15. This is with nagios2, which has issues like blocking when hosts are down (32 currently are) though. -Original Message- From: Randal, Phil [mailto:phil.ran...@hoopleltd.co.uk] Sent: Mon 11/06/2012 4:54 PM To: Nagios Users List Subject: Re: [Nagios-users] Large scale installation What's the spec of your nagios server? We're checking (around) 500 hosts, 4500 active, 5000 passive service with Nagios 3.4.1 in a CentOS 5.8 VM with 2GB RAM, 4 vCPUs, without problems with the help of check_mk /mk_livestatus (http://mathias-kettner.de/check_mk.html) Also using pnp4nagios and rrdcached and ramdisk for checkresults. Large installation config tweaks, and tuning the check result reaper frequency all help even out the load. Cheers, Phil -- Phil Randal Infrastructure Engineer Hoople Ltd | Thorn Office Centre | Hereford HR2 6JT Tel: 01432 260415 | Email: phil.ran...@hoopleltd.co.uk From: Andreas Brandino [mailto:ampra...@gmail.com] Sent: 10 June 2012 13:39 To: Nagios-users@lists.sourceforge.net Subject: [Nagios-users] Large scale installation Hi all, my nagios installation has currently 400+ hosts and around 1400 checks. As the server load grows, delays are appearing. Is any way to move a part of active checks to a second nagios server? And in that case how will these two nagios servers exchange data? If this is feasible can you point me to some documentation? Thank you Any opinion expressed in this e-mail or any attached files are those of the individual and not necessarily those of Hoople Ltd. You should be aware that Hoople Ltd. monitors its email service. This e-mail and any attached files are confidential and intended solely for the use of the addressee. This communication may contain material protected by law from being passed on. If you are not the intended recipient and have received this e-mail in error, you are advised that any use, dissemination, forwarding, printing or copying of this e-mail is strictly prohibited. If you have received this e-mail in error please contact the sender immediately and destroy all copies of it. http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check jboss
There was a guy that configured to monitory it by the tcp port, however, rarely the jboss came very slow and nagios does not report anything because the port is listening. Does anyone know a plugin to check a specific instance of jboss and the port or other things together? What part of jboss came slow? You can use check_http to pull the jmx console from the port (8082?) and ensure that 1) The page returns in a given timeout 2) The page returns a 200 OK message 3) The page is long enough Personally I couldn't care less if my jboss server is working, I'm much more concerned the application and websites deployed on it are working, so I check them instead (including using check_http to pull a static .gif, and using check_http to pull a database-generated page, which tells me if it's the webserver that's down, or if it's the database pool) http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Correct email use [WAS Confusion on nrpe]
When you upgrade to a threaded mail reader you will understand :-) Actually, replying to another thread to start your own is considered 'hijacking' There is probably a bit of snobbery with mail agents going on. Many people are stuck with abominations like Outlook, which are often set up to ignore traditional ways of handling emails. I personally haven't used mutt for about 6 years thanks to work pressures. However most people have used google and come across archives of email threads. If the OP were to visit http://www.mail-archive.com/nagios-users@lists.sourceforge.net/thrd2.htm l#37996, he'll see the kind of threading problem his methodology can cause, rather than having to take other people's word that their mailing clients are better than his. http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Map
I have a question that I hope someone give me some guidance on. How do I view the map? What I try it still looks all jumbled together. The map is built automatically based on the parent/child relationship you have set in your host definitions. The default automap is fairly unusable once you hit a non-trivial number of hosts. We have 386 on ours, and nobody bothers. It doesn't help if you want to map certain services instead of hosts either. If you're keen on a map, you need to look at something like nagvis to do that. We have a nice map of the country/world showing connections to various overseas bureaus. Not too useful from a support point of view (most of it is green for most of the time), but is does give management something nice to look at. From: Macleod, Chris [mailto:cmacl...@edrington.co.uk] Sent: 11 November 2010 15:36 To: Nagios Users List Subject: Re: [Nagios-users] Map The map is built automatically based on the parent/child relationship you have set in your host definitions. For instance if you make Switch1 the parent of Server1 and switch1 parent is local host you will have a map of Server1-Switch1=Localhost This also means if switch1 goes down nagios knows server1 will be unreachable - helps reduce alerts. From: Edwin Zoeller [mailto:edwin.zoel...@ama-assn.org] Sent: 11 November 2010 15:20 To: Nagios Users List Subject: [Nagios-users] Map I have a question that I hope someone give me some guidance on. How do I view the map? What I try it still looks all jumbled together. Thanks, Ed http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] IP and hostname mapping control
We have a service that checks dns addresses. We run it against some of our internal DNS servers (we're checking the DNS server is resolving, so the check belongs on the dns server rather than the host you're interested in). define service{ use infrastructure-dns-service host_name dc1025,dc1026,dc1030 service_description DNS-myserver check_command check_dns!myserver.com!4.5.6.7 } define command{ command_namecheck_dns command_line$USER1$/check_dns -H $ARG1$ -s $HOSTADDRESS$ -a $ARG2$ } (I believe check_dns is a standard plugin) So you'd have: define service { uselocal-service host_name local.dns.server service_description DNS resolving foo.mylan.com check_command check_dns!foo.mylan.com!192.168.0.1 } -- The probability of someone watching you is proportional to the stupidity of your action. Paul Weaver Systems Development Engineer BBC FMT BETG TDD SDT From: Network Operation Center FMC Luxemburg [mailto:n...@eurofmc.com] Sent: 29 June 2010 06:40 To: Nagios Users List Subject: [Nagios-users] IP and hostname mapping control Hi everybody, I'm looking for a way to check out the mapping between a hostname and IP address. Example: IP 192.168.0.1 exists and if the hostname foo.mylan.com is not associated with this IP, I would have an alarm. Indeed the script below returns no alarm : define host { use unix-server host_name foo.mylan.com display_namefoo address 192.168.0.1 check_command check_http } define service { uselocal-service host_name foo.mylan.com service_description HTTP local check_command check_http } Any idea? Thanks a lot François -- Network Operation Center LUXEMBURG http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] non sticky passive checks
We use the following check to ensure that a backup has completed. The backup job calls in success (or fail) at the end, and runs every day (86,400 seconds). The freshness_threshold The backupFailed check command contains the folllowing. Note that normally active checks are disabled, however the check_freshness and freshness_threshold will triger an active check after n time intervals (93,600 * 1second in my case). This is nagios 2, so YMMV define service{ use defaultservice host_name myhost service_description Nightly Backup active_checks_enabled 0 passive_checks_enabled 1 check_freshness 1 freshness_threshold 93600 max_check_attempts 1 check_command backupFailed } define command{ command_namebackupFailed command_line$USER1$/reportError 2 The backup on $HOSTADDRESS$ failed to report in time } With reportError being a bash script: #!/bin/bash echo $2 exit $1 -- What's with all the carrots? What do they need such good eyesight for anyway? Paul Weaver Systems Development Engineer BBC FMT BETG TDD SDT Work: 020 8225 8109 Mobile: 07736 481006 Room 1244, BBC Television Centre, Wood Lane, London, W12 7RJ From: Pankaj [mailto:penguinh...@gmail.com] Sent: 08 January 2010 07:30 To: Nagios Mailinglist Subject: [Nagios-users] non sticky passive checks Hi, I've configured passive service checks and am reporting back the results using nsca. I would like to turn the alerts to be critical after 5 minutes if the monitored servers fail to send any results. Currently the check continues to remain in the last state. How can I achieve this? service_common.cfg define service{ usegeneric_service name passive_service active_checks_enabled 0 passive_checks_enabled 1 flap_detection_enabled 0 register 0 is_volatile0 check_period 24x7 max_check_attempts 1 normal_check_interval 5 retry_check_interval 1 check_freshness0 contact_groups admins check_command check_dummy!0 notification_interval 120 notification_period24x7 notification_options w,u,c,r stalking_options w,c,u } define service { use passive_service service_description HW Sensors hostgroup_name hgp_webservers, hgp_db, hgp_search, hgp_image check_command check_dummy active_checks_enabled 0 passive_checks_enabled 1 } define command{ command_name check_dummy command_line $USER1$/check_dummy $ARG1$ } -- ( 2b || ! 2b ) -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] meaningful map?
I'm trying to have a meaningful map like I have found on http://nagios.sourceforge.net/images/screens/big/statusmap.jpg I thought it had something to do with dependency but that doesn't seem to work. My question is how do I do this? The statusmap is only really of use if you have a small number of hosts. The layout in that example is covered by the parent part of the config -- http://nagios.sourceforge.net/docs/3_0/networkreachability.html -- Come build with us! The BlackBerryreg; Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9#45;12, 2009. Register now#33; http://p.sf.net/sfu/devconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSClient++ - Anyone else having difficulty reachingthe site?
http://isitup.org/nsclient.org Yes, it's working. -- If we can't fix it, it ain't broke. Paul Weaver Systems Development Engineer News Production Facilities, BBC News From: Anthony [mailto:anthony-nag...@hogan.id.au] Sent: 10 September 2009 07:39 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] NSClient++ - Anyone else having difficulty reachingthe site? Regarding NSClient++ Anyone else having issues reaching its homepage at nsclient.org? Still able to reach files at http://sourceforge.net/projects/nscplus/ -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] searching howto use snmp
From: Menard, Chris [mailto:chris.men...@aspect.com] nrpe is a small agent that is installed on remote server. The main nagios process makes requests to nrpe to collect remote stats. Many network devices will have SNMP running already, for some (especially nat-in-a-box devices) it's the only way. Many corporations will prefer monitoring systems use SNMP to their own agents. can anyone point me WHERE can I found a non-cryptographic information on how to get info on disk usage on remote hosts? Using SNMP and Nagios, you could run the following plugin http://www.monitoringexchange.org/cgi-bin/page.cgi?g=Detailed%2F2045.htm l;d=1 (We've made a few additions) http://195.50.87.86/check_hd.txt However as Jim has said, getting a good book on nagios (like the Wolfgang book) is a good start if you don't understand now the nagios system works, once you understand the basics, you can add in plugins from monitoringexchange. -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Unable to Login to Nagios
Hmm, I'm sure you're really not Kevin Mitnick You'll probably find a .htaccess file somewhere on your system. The exact configuration will vary depending on how you initially installed nagios. This file will contain a username, a .htpasswd file will contain the (encrypted) password. http://httpd.apache.org/docs/2.3/misc/password_encryptions.html may give you some pointers. If you care about understanding why you have a problem, and how *you* can fix it in future, people on this list will respond to polite emails. If you only care about results, I'm sure one of many people on this list will be happy to take a credit card number at $100/hour. -- In a world without walls and fences, who needs windows and gates? Paul Weaver Systems Development Engineer News Production Facilities, BBC News From: Kevin Mitnikc [mailto:teckad...@gmail.com] Sent: 14 July 2009 22:55 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Unable to Login to Nagios I am unable to login to the Nagios web GUI. When I open a browser to access Nagios, I am unable to authenticate. I haven't made any changes or addons since yesterday. What do I need to do to reset the password and login? Am I able to do this somehow in the terminal? Cheers, Kevin http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitor windows remote desktop from linux
Yeah, I understand what you mean. The problem is that before you can monitor, you have to identify what exactly needs to be monitored. And that's where the troubleshooting comes in. I disagree. The OP would like to monitor remote desktop logins. As an admin, I care that RDP is broken, not (neccersarilly) that the clock is out of sync. From then on, it could be anything from an authentication issue to a protocol issue to an encryption issue, clock skew (I'm not sure if the encryption in RDP is time-sensitive. I don't think it is, but wanted to mention it), and the monitoring needs to reflect the specific problem you are monitoring for. If you don't know what exactly is going wrong, there is a high likelihood that you are monitoring the wrong thing and don't actually detect when the users are experiencing a problem - just as you saw with the sockets. An ideal check would be for some meatware to sit infront of a computer, log in with a few user names, check that the login is succesful, and push a button which sets the nagios state to green. If they have a problem, push a different button to set the light to red. You may be able to get a perl module that can control remote desktop, have a system log in, run a command, and log out. A quick google doesn't show anything up, and experimentation with rdesktop -u USER -d DOMAIN -p PASS -s SOMETHING.EXE $HOSTADDRESS$ seems to leave the session logged in. You could create SOMETHING.EXE which made a call back to nagios to say I'm working, but you'd need to get windows to log out afterwards. A bit of windows scripting would probably do that for you, so you would have something like nagiosside $ok = 0; $port = 2345; while ($ok == 0) { $port++; $ok = listenOnPort($port) } run_in_foreground(rdesktop -u USER -d DOMAIN -p PASS -s remote_check.wsh $nagiosaddress $port $HOSTADDRESS$); my ($status, $info) = receivedMsg(); print $info; exit $status; windowsside my ($nagiosaddress, $port) = shift; open_socket($nagiosaddress:$port); print_socket(OK,Remote Desktop OK for user $USER); close_socket(); logout(); For already-written plugins, check_x224 will give a better check than check_tcp. -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring traffic on switch port viaNagios/Nagiosgrapher
Our ciscos respond to the standard IF-MIB::ifSpeed oid (I think it's .1.3.6.1.2.1.2.2.1.5), we have a perl script wrapped around that which calculates traffic over a 20 second period, but you could use a temporary file to record the traffic if you were interested in a longer time. Be warned though, at 1gbit, a 32bit snmp counter could wrap around every 30 seconds -- once you look at measuring 10gibt you really need a switch and client that're capable of doing 64bit snmp counters. The output is pumped into nagiosgrapher, with traffic out of the port multiplied by -1 The plugin might say something like OK: Port GigabitEthernet9/15 on 10.129.138.1 transferred 241211 bits/s in (0.02% utilisation) and 684 bits/s out (0% utilisation) We only bother measuring the inter-switch ports -Original Message- From: Stefan-Michael Guenther [mailto:nagio...@in-put.de] Sent: Fri 6/26/2009 7:44 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Monitoring traffic on switch port viaNagios/Nagiosgrapher Hello, I like to monitor the traffic on different HP switches via Nagios and Nagiosgrapher. Does aynone of you have a plugin that calculates the in- and outgoing traffic on every single port? Or do you know the corresponding OIDs? I have checked the MIBs for the HP ProCurve 2824 / 2810 but wasn't really successful in finding the right MIBs. Thanks for any suggestions or hints, Stefan -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net http://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Reboot Notification
Sure it will. uptime is a counter that increases from 0 to infinity (essentially) and resets on reboot. The snmp check I use says that if that counter is less than 50,000ms (~8.3 minutes), show a WARNING. If you check every minute, you'd see a WARNING status for 7 or 8 minutes, then a reset to OK. You're pretty lucky if your box reboots inside a minute -- it takes about 2 minutes for most of our boxes to get through POST! them) or that it's checking _very_ frequently. There's no magic 'some box just rebooted' protocol... Hmm, a trap coming from the PSU (sudden current change), or switch (link going down then up) Most monitoring systems I've seen use uptime, keep a track of the last value, and the time it happened, then alert if that value resets to close to zero (or even if it returns outside of a predicted range). The reason you keep a track of the value is to avoid broken alerts when the snmp counter rolls over. http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] WMI/WQL querying of HP boxes with Insight Manager
While it's not specifically a nagios question, it's related to plugins. After an afternoon of tearing my hair out, I've confirmed that I really hate windows. However, after finding a nice little package to query WMI from linux (wmi-client, it's tied into samba), I had hopes of interrorgating the HP Insight Manager we have installed on many boxes. Unfortunatly I can't find any examples of using WQL to query it, and while the information I can get out of windows (number of sticks of physical memory), is great for our cmdb, it's not something I really want to monitor with nagios. Has anyone got any experience with using WMI, specifically in talking to HP? (I want to monitor some boxes that I can't install anything on, even snmp, let alone npre or our own agents) http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] HTTP and Web Form based Auth
How are folks checking site functionality with sites that use auth *other* than basic (apache) auth? I didn't find any plugins that support populating forms and posting. I use a perl script and WWW:Mechanize. Here's a snippet of a check which works on a site that stores you username as a cookie, having entered it on a form. This check will keep that cookie, and only log in (the login process can be very slow, but I want to know how quickly the rest of the site is) if the cookie gets invalidated. I have a similar check that always logs in, without using a cookie. Cookies are stored in /tmp/pluginname.hostname.cookies.dat my $mech = WWW::Mechanize-new( autocheck = 1 ); my $jar = $0; $jar =~ s/.*\///g; $jar .= .$HOST.cookies.dat; $mech-cookie_jar(HTTP::Cookies-new(file = /tmp/$jar, autosave = 1)); foreach my $WIRE (split(/,/,$WIRES)) { my $url = http://$HOST/wbmain/HHwires.aspx?Wire=$WIREview=2;; $mech-get($url); # If we need to log in, do so if ($mech-uri() =~ /loginp.aspx/) { $mech-form_number(1); $mech-field(txtname, user.name); $mech-field(txtPassword, password); $mech-tick(remembermachine); $mech-click(); $logins++; } my $content = $mech-content; my $result = 0; foreach (split(/\n/, $content)) { $result++ if (/hhviewwiredetail.aspx/); } $results-{$WIRE} = $result; } -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring switches via SNMP --
checking for init_snmp in -lnetsnmp... no configure: error: snmp library not found I looked through the Ubuntu package website, and tried installing any package I could find that's related to SNMP, but already have them all installed. It's probably a -dev package, which you need to compile, but not to run, the plugin. Try installing libsnmp-dev snmp is already the newest version. libsnmp-base is already the newest version. libsnmp15 is already the newest version. snmpd is already the newest version. libsnmp-perl is already the newest version. Is there an inherent incompatibility between Ubuntu 8.10 and the check_snmp plugin? Do I need to go the all Perl/CPAN route? I can perform snmpwalks, but I don't know of any other way to find out what I'm missing. No, I personally prefer using perl to write my own plugins though You rarely need to use cpan with ubuntu -- The probability of someone watching you is proportional to the stupidity of your action. Paul Weaver Systems Development Engineer News Production Facilities, BBC News http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp asthey present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://www.creativitycat.com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] report : Insufficient Data
You need to keep all the of the .log files in (nagios)/var, the only historical data nagios stores stays in those files. I use rsync every day to back up these files offsite, as we've got 18 months of data. -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News -Original Message- From: Meyer Jerome [mailto:jerome.me...@baldata.ch] Sent: 12 May 2009 14:56 To: MAD Cc: Nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] report : Insufficient Data Thanks you Marc-André for your email! Ok but does that means that it will be impossible to eliminate all undetermined data ? Now, I'm still in a test phase and when I'll starting Nagios in production phase, I don't like to have any zombies or unknown datas! It isn't any solutions? jérôme -Ursprüngliche Nachricht- Von: MAD [mailto:sri.lu...@free.fr] Gesendet: Dienstag, 12. Mai 2009 12:44 An: Meyer Jerome Cc: Nagios-users@lists.sourceforge.net Betreff: Re: [Nagios-users] report : Insufficient Data To create its graphs, Nagios parses its log files. So if you delete them (or they are lost), it will be like Nagios wouldn't have been running for the period missing from the log file. For example, if the log files from May the 2nd to May the 4th are missing, you won't have any data on the status, the check results, etc... Nagios would have ran, and so no graphs either. Marc-André - Mail Original - De: Meyer Jerome jerome.me...@baldata.ch À: Nagios-users@lists.sourceforge.net Envoyé: Mardi 12 Mai 2009 12h26:40 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: [Nagios-users] report : Insufficient Data Hi Last week on Thursday I've stopped NAGIOS, deleted files under /var/nagios and restart it (NAGIOS)! Yesterday Nagios gives me Infos like these (when I choose report period : Last Week) : UP Total:3d 10h 17m 9s Total_Time:48.98% Known_Time:100% Undetermined Insufficient Data:3d 13h 42m Total_Time:51s 51.02% All Total:7d 0h 0m 0s What means this insufficient data? How Nagios calculate this availability report? Give any some Lectures about « how interpreted Nagios graphs » ? How do we must considered these messages? http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios to graph ethernet traffic.. can it ?
We use nagiosgrapher with the following config (The plugin output includes the phrase transfered nnn bits/s) Which produces some graphs. define ngraph{ #Tweak service name to fit naming conventions in dev/live environment service_nameCisco Traffic graph_log_regex (\d+) bits/s in graph_value InBitsS graph_units Bits/s graph_legendIn Bits/s hideyes } define ngraph{ service_nameCisco Traffic typeCDEF graph_units Bits/s graph_value NegInBitsS graph_legendIn Bits/s graph_calc InBitsS,-1,* rrd_plottypeAREA rrd_color 00 pageBits/s hideno } define ngraph{ service_nameCisco Traffic service_nameCisco Traffic graph_log_regex (\d+) bits/s out graph_value OutBitsS graph_units Bits/s graph_legendOut Bits/s rrd_plottypeAREA rrd_color ff pageBits/s -Original Message- From: James Pratt [mailto:jpr...@norwich.edu] Sent: Wed 4/29/2009 1:09 AM To: lance raymond; nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] nagios to graph ethernet traffic.. can it ? Hi Lance, I could be wrong, but I think most use either nagiosgraph or pnp4nagios http://www.pnp4nagios.org/pnp/start for graphing w/rrdtool... (I'm getting there someday, cacti works for now, let us know how it goes though!) Cheers! jamie -Original Message- From: lance raymond [mailto:lraym...@weatherflow.com] Sent: Tuesday, April 28, 2009 5:04 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] nagios to graph ethernet traffic.. can it ? ok, I now have (thanks to the group) my server up and currently just watching one box... many more to be added. We also run cacti for ethernet traffic, etc. and wondering can this be done as well. I am looking on the nagios exchange site and don't see anything that can take that type of data, then graph it. If it is possible, what requirements do I need (and should I have already). I still need to install nagios graph and thought I would just start with my 1st simple machine, then add some ethernet traffic in/out to him and once he is checking everything I can simply add other hosts... Thanks as always.. -- Register Now Save for Velocity, the Web Performance Operations Conference from O'Reilly Media. Velocity features a full day of expert-led, hands-on workshops and two days of sessions from industry leaders in dedicated Performance Operations tracks. Use code vel09scf and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net http://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Register Now Save for Velocity, the Web Performance Operations Conference from O'Reilly Media. Velocity features a full day of expert-led, hands-on workshops and two days of sessions from industry leaders in dedicated Performance Operations tracks. Use code vel09scf and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Register Now Save for Velocity, the Web Performance Operations Conference from O'Reilly Media. Velocity features a full day of expert-led, hands-on workshops and two days of sessions from industry leaders in dedicated Performance Operations tracks. Use code vel09scf and Save an extra
Re: [Nagios-users] Check-ping Vs. Mac address table expiration
A work around would be to run the check every 4 minutes -- the arp entry won't time out then (It takes our boxes less than a milisecond to respond to an arp, 397ms would imply to me that something else is going on -- isdn link reaquiring etc) -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News -Original Message- From: Drew Weaver [mailto:drew.wea...@thenap.com] Sent: 27 March 2009 12:25 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Check-ping Vs. Mac address table expiration A very small number of hosts on my network serve important roles but aren't always in the MAC address table of the switch they're connected to. Their MAC address assignment gets timed out by the default (i think its 5 minutes) because no data has been sent to them in that amount of time. For whatever reason, no matter what I do it seems like check-ping always thinks there is a problem. The response from the rarely used host looks like: Pinging 192.168.1.8 with 32 bytes of data: Reply from 192.168.1.8: bytes=32 time=397ms TTL=63 Reply from 192.168.1.8: bytes=32 time1ms TTL=63 Reply from 192.168.1.8: bytes=32 time1ms TTL=63 Reply from 192.168.1.8: bytes=32 time1ms TTL=63 397ms because the switch has to do ARP, etc... Any suggestions how I can avoid this? So far i've tried adding -p 5 to the check-ping in hopes that it would somehow use the average of the 5 pings. thanks, -Drew -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] A group of Nagios users are:
From: Martyn [mailto:mar...@chetnet.co.uk] Subject: [Nagios-users] A group of Nagios users are: Just on a lighter note, what do we call a bunch of Nagios users; Nagiothions? Relaxed :) -- The probability of someone watching you is proportional to the stupidity of your action. Paul Weaver Systems Development Engineer News Production Facilities, BBC News -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using '!' in check_* argument
-Original Message- From: MAD [mailto:sri.lu...@free.fr] Sent: 06 March 2009 17:41 To: Marc Powell; Nagios Users Subject: Re: [Nagios-users] Using '!' in check_* argument I'm using Nagios 3.0.6 on a CentOS 5.2 I tryed escaping the (!) with a (\) but it didn't work. I forgot to tell (and I've just realized it is probably important) I use NagiosQL to create the service. Is it possible that NagiosQL didn't escape the (! ) but the (\) when he writes the cfg file ? Look at the actual config file generated. You may need to escape the \ -- so you'd enter \\\! -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios authentication with Active directoy
I want to authenticate my nagios with AD. i read couple of document on net but no luck..any one using this method of authentication please let me know i will appricate your help. I'm assuming you mean the Nagios UI, in which case you'll want to look for mod_auth_ldap and active directory. The Nagios UI is just a web page, authentication is done via your web server. Assuming Apache, then mod_auth_ldap, or mod_auth_kerberos should be where you are looking. The way we do it is thus (specifically for ubuntu, YMMV) Install libapache2-authenntlm-perm # apt-get install libapache2-authenntlm-perm Edit your apache config thus: % Directory /usr/local/nagios/sbin % Include /etc/ntlm/htaccess % Options +ExecCGI % AllowOverride None % /Directory % % Alias /nagios /usr/local/nagios/share % Directory /usr/local/nagios/share % Include /etc/ntlm/htaccess % % Options +ExecCGI % AddHandler cgi-script .cgi % /Directory And finally create /etc/ntlm/htaccess Thus: % PerlAuthenHandler Apache2::AuthenNTLM % AuthType ntlm,basic % AuthName Nagios % require valid-user % % PerlAddVar ntdomain MYDOMAIN dc25 dc26 % PerlAddVar ntdomain OTHERDOMAIN dc25 dc26 % PerlAddVar ntdomain THIRDDOMAIN dc25 dc26 % % PerlSetVar defaultdomain MYDOMAIN % PerlSetVar fallbackdomain OTHERDOMAIN % PerlSetVar splitdomainprefix 1 % PerlSetVar ntlmdebug 0 Replance dc25 with your local domain controller, dc26 with a backup one, MYDOMAIN with your main domain, OTHERDOMAIN, THIRDDOMAIN, etc with any other domains This does the AD authentication, and also allows machines to use transparent passthrough authentication. Each user will need a nagios account, unless you have a default_user_name in your cgi.cfg (We do this) -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] alternative Status Map?
-Original Message- From: Tomasz Chmielewski [mailto:man...@wpkg.org] Sent: 05 March 2009 15:56 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] alternative Status Map? Are there alternative Status Maps for Nagios? Current implementation has at least these limitations: 1. With lots of monitored hosts, the map becomes unreadable / does not fit on the screen. That's true 2. All nagios users can see the whole network structure for all hosts. Even if a given web user is not authorised to view certain hosts, he/she will see them anyway on the status map (without icons/descriptions). It can be even viewed as a security flaw. Personally I'm not a fan of maps, very few cases I think they're useful, however NagVis might be what you want http://www.nagvis.org/ This kind of map could be useful for a server room http://www.nagvis.org/sites/default/files/screenshots/nagvis-1.3-rack.pn g And perhaps this kind if you have a geographically spread system. http://www.nagvis.org/sites/default/files/screenshots/nagvis_map_2.png -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] lazy nagios und cygwin...
Hello, we are running nagios 3.0.5 under cygwin quite successfully in a relatively simple setup That's surprising, but good news. I hope you don't rely on it (most check results come via NRPE from NSCA++ clients on windows machines). Everything is well, but sometimes a awkward thing happens: nagios just stops doing anything. - It continues to run : the processes are visible in the Windows process table - It ceases to execute checks: checks just stop bein executed - It ceases to write anything to a logfile - It also ceases to write anything to a debug file with Debug=16 (last entry is nothing suspicous) Interesting definition of quite successfully. After killing the nagios processes and restarting the windows service (nagios is registered as a windows service using the cygrunsrv facility) everything is back to normal. What I find most annoying is that even in the CGI output everything seems normal - just that the last checks are several hours overdue, but no alarm or other signal is shown. We are not sure if this is cygwin/windows related or if it is a problem which also happens in the normal unix environment I've never seen anything like it. Our main nagios installation has failed in the past -- the disk has filled up, preventing status updates, and the power has failed -- not the box UPS power, but the Aircon GS power, so we had to shut down as much as possible. If it happens again we wont be shutting down nagios, too essential. The Aircon should be powered by diesel generators, they didn't kick in (not nagios monitored you see!) Comments would be very welcome. I will now turn up debugging further in the hope that then we see something interesting. Cygwin != unix. At the core, windows deals with files and sockets in a very different way to linux and solaris (and probably other unixes) It wouldn't surprise me if the something changed with the status file. If you can't install a linux distro onto the box, run up a virtualbox environment to host nagios in. It wouldn't be perfect, but will probably be better than cygwin. -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] So its running how I like it
We use an ubuntu based PXE builder, but the same principle. We've deployed half a dozen nagios instances so far, another dozen to do by the end of the year. The config is stored in subversion, installing nagios hasn't been streamlined, as to be honest we rarely need to install from scratch. (standard ubuntu-server build, with a few additions like ssh and snmpd) apt-get install subversion apache2 expect libbit-vector-perl libcalendar-simple-perl libcarp-clan-perl libconfig-inihash-perl libdate-calc-perl libdate-manip-perl libdbd-mysql-perl libdbi-perl libdigest-hmac-perl libdigest-sha1-perl libgd-gd2-perl libgraphics-magick-perl libhash-case-perl libhash-withdefaults-perl libhtml-parser-perl libhtml-tagset-perl libhtml-tree-perl libio-stringy-perl liblocale-gettext-perl libnagios-object-perl libnet-daemon-perl libnet-dns-perl libnet-ip-perl libnet-ping-external-perl libnet-snmp-perl libnet-socks-perl libnet-telnet-perl libnumber-format-perl libpcre3 libperl5.8 libplrpc-perl librpc-xml-perl librrds-perl libterm-readkey-perl libtext-charwidth-perl libtext-iconv-perl libtext-wrapi18n-perl libtimedate-perl liburi-perl libwww-mechanize-perl libwww-perl libxml-libxml-common-perl libxml-libxml-perl libxml-namespacesupport-perl libxml-parser-perl libxml-sax-perl libxml-simple-perl sqsh su - cd / wget --user=username --password=pass http://svn/config/nagios/nagios-base.tar.gz tar -zxvf nagios-base.tar.gz rm nagios-base.tar.gz cd /usr/local/nagios svn --username username --password username co http://svn/config/nagios/libexec svn co http://svn/config/bristol etc addgroup --system nagios adduser --system --home /usr/local/nagios --shell /bin/false --no-create-home nagios usermod -a -G nagios npf usermod -a -G nagios www-data chown -R nagios:nagios /usr/local/nagios chmod -R g+rwX /usr/local/nagios ln /usr/local/nagios/etc/apacheconfig /etc/apache2/sites-available/nagios ln /usr/local/nagios/etc/init.d.nagios /etc/init.d/nagios ln -s /etc/apache2/sites-available/nagios /etc/apache2/sites-enabled/ ln -s /usr/lib/libgd.so.2.0.0 /usr/lib/libgd.so.1 /etc/init.d/apache2 reload update-rc.d nagios defaults cd /usr/local/nagios/var comments.dat downtime.dat nagios.log objects.cache retention.dat status.dat -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 822 58109 Room 1244 Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: James Pratt [mailto:jpr...@norwich.edu] Sent: 17 February 2009 23:58 To: Martyn; nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] So its running how I like it -Original Message- From: Martyn [mailto:mar...@chetnet.co.uk] Sent: Tuesday, February 17, 2009 4:46 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] So its running how I like it But how can I make an install ISO that will allow me to install afresh to a new server, ask for username, password IP address and so on. Then once the install is complete a I a running Nagios Server. Thanks Martyn Martyn, no real clue on what you are talking about , but it sounds like you want to build a linux system using kickstart.(?).. (it's a red hat linux thing, AFAIK) http://www.google.com/search?q=redhat+kickstart cheers, jamie -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and tikiwiki integration
More a tikiwiki question. Tell them you want to link directly to an article. There should be a way. With mediawiki, for example, you can link to http://wiki.mycorp.co.uk/wiki/index.php?title=mypage#mysection E.G. To doucment PING on host Barney, you would link to http://wiki.mycorp.co.uk/wiki/index.php?title=barney#PING -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 822 58109 Room 1244 Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: Assaf Flatto [mailto:assaf.fla...@ssp-intl.com] Sent: 18 February 2009 15:45 To: Nagios User list Subject: [Nagios-users] Nagios and tikiwiki integration Hello list I'm trying to integrate the nagios notes_url capability with my wiki at the office. nagios is 3.0.6 (from source ) and the tikiwiki is from RPM , each software is installed on a separate machine . The tikiwiki requires authentication , the internal capability of the tikwiki DB . I followed some how-to's found on the net - but none seem to work. Has anyone been able to do such a thing ? Thanks -- Assaf Flatto SSP Ops Team Linux System Administrator 169 Euston Road, London, NW1 2AE IMPORTANT . this email and the information in it may be confidential, legally privileged and/or protected by law. It is intended solely for the use of the person to whom it is addressed. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Please also delete all copies of this email and any attachments from your system. We cannot guarantee the security or confidentiality of email communications. We do not accept any liability for losses or damages that you may suffer as a result of your receipt of this email including but not limited to computer service or system failure, access delays or interruption, data non-delivery or mis-delivery, computer viruses or other harmful components. Copyright in this email and any attachments belong to Select Service Partner UK Limited. Should you communicate with anyone at Select Service Partner UK Limited by email, you consent to us monitoring and reading any such correspondence. Nothing in this email shall be taken or read as suggesting, proposing or relating to any agreement concerted practice or other practice that could infringe UK or EC competition legislation. Select Service Partner UK Limited is a company registered in England and Wales (company number 05687183) whose registered office is at 1 The Heights, Brooklands, Weybridge. Surrey. KT13 0NY -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] So its running how I like it
The dependencies, hand-config user-creation, and the creation of the ghost files, can be done with a proper RPM -- which then means that two installations (by different people) are able to help each other a bit easier. It may also help the original OT poster get a canned install with little hand-monkeying, so easier to approach a canned ISO or a VM disk image. Whether it's worth it depends on how much you'll use it. The config should always be seperate, and I'd suggest libexec too (We're always adding custom plugins). If you run in a mixed mode environment (some 32 bit, some 64), separate your architecture-independent perl scripts from your binaries by using two libexec directories. Paul, did you think of using the RPM, and indicating the shortfalls (such as the perl dependencies that aren't portable to the host's RPM dependencies -- those always pop up) or other issues with the RPM? We have 35 custom .debs in a local repository. One of our nagios checks runs apt-get update every night for security checking, so we see which machines need upgrades. We haven't written a plugin that needs a perl module which isn't in the OS repository. It's easier to keep uptodate that way, and there's arround 1400 perl modules in the apt Repositories. The testing that needs to go into the installation scripts (to do it right) for something like a nagios install is much more intensive than most, which files to remove in preperation for upgrade, which to purge, which not to purge, what to do on failed upgrades, etc. Much easier to run one command after another and check for errors yourself. -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] having different view in nagios interface
thank u very much Does someone have a piece of software that use Nagios::Statuslog to share? perl -e ' use Nagios::StatusLog; my $host = hostname; my $description = service description; my $dataFile = /usr/local/nagios/var/status.dat; my $log = Nagios::StatusLog-new( Filename = $dataFile, Version = 2.0); my $obj = $log-service( $host, $description ); print Service $description on $host is in state .$obj-{last_hard_state}.\n; print * This has been acknowleged\n if $obj-{problem_has_been_acknowledged}; ' -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring fans
I am using check_snmp_fans plugin to monitor fans on some of my ESX hosts on blades. But I can't seem to monitor fans on my actual physical servers and workstations. I get error Error: walkoid() returned nothing. Does anyone know why I am getting this error? The SNMP oid that check_snmp_fans searches for isn't supported on your non-blade machines -- do those machines respond to normal snmp? Do they have any server snmp extensions to let you monitor things like PSU health etc? -- Create and Deploy Rich Internet Apps outside the browser with Adobe(R)AIR(TM) software. With Adobe AIR, Ajax developers can use existing skills and code to build responsive, highly engaging applications that combine the power of local resources and data with the reach of the web. Download the Adobe AIR SDK and Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] having different view in nagios interface
for example i would like to be able to mask the state of some less important services. a good example is the need of update of server. They are pllaneed each week then i would like to keep them on nagios to have a full view of what need to be done when it s time to do We have two systems we're still playing with (after 18 months) One is a separate perl webpage we wrote, which prints the services in a heigerchial way, and goes red/yellow/green at various points. A Bit like a check_cluster style plugin. Really critical issues filter to the top, other issues are dealt with when people have some free time (at least in theory). It uses a perl Nagios::Statuslog to get the status, which is a right pig (it takes 4.5 seconds to import the log), but as it sits there on a plasma automatically refreshing, it doesn't matter too much how long it takes to display. For our dev system, we use the following link /nagios/cgi-bin/status.cgi?host=allservicestatustypes=28hoststatustype s=3serviceprops=42sorttype=1sortoption=6 Which only displays services that aren't acknowleged or in downtime, with the most recent at the top. We have too many users, and they change too often, to have indiviudual contacts. We use an ntlm module to do IE passthrough authentication with active directory, and default everyone to the same user, who has visibility of every host and service. -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News -- Create and Deploy Rich Internet Apps outside the browser with Adobe(R)AIR(TM) software. With Adobe AIR, Ajax developers can use existing skills and code to build responsive, highly engaging applications that combine the power of local resources and data with the reach of the web. Download the Adobe AIR SDK and Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] unable to send mail to external id
Dear Sir, I have configure my nagios in Linux and I am getting email notification in my local mail id but I didn't get mail to my external id(Gmail or yahoo) . can you please help. Nagios sounds like it is configured fine, it sounds like your local machine isn't configured to send emails though - nagios uses the native linux way of sending emails, which often isn't configured. Try the following: # echo Test from nagios|mail -s Test root # echo Test from nagios|mail -s Test u...@yourcompany.com # echo Test from nagios|mail -s Test u...@gmail.com That should send three emails. I would guess the first one works. If the others don't work, you need to set up your mail transfer agent (exim, sendmail etc) to point to a smarthost, your companies internal smtp gateway. Alternativly, you could do the following 1) Change your command definition define command{ command_namenotify-by-email command_line *** } Replace the end of the line, which probably says something like | /bin/mail -s Host $HOSTSTATE$ alert for $HOSTNAME$! $CONTACTEMAIL$ With | $USER1$/sendEmail.pl $CONTACTEMAIL$ ** $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ ** E.G. (this only works for nagios 2.9) /usr/bin/printf %b * Nagios 2.9 *\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$ | $USER1$/sendEmail.pl $CONTACTEMAIL$ ** $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ ** 2) Add sendEmail.pl (http://195.50.87.86/~npf/sendEmail.pl) to your libexec directory, and change the mailServer and from to be valid for your local network. -- Experience is what you get when you were expecting something else. Paul Weaver Systems Development Engineer News Production Facilities, BBC News -- Create and Deploy Rich Internet Apps outside the browser with Adobe(R)AIR(TM) software. With Adobe AIR, Ajax developers can use existing skills and code to build responsive, highly engaging applications that combine the power of local resources and data with the reach of the web. Download the Adobe AIR SDK and Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Which books are best for Nagios 3
On Jan 19, 2009, at 11:25 AM, James Miller wrote: Recently I upgraded us from 2.9 to 3.0.6 and I'm looking for recommendations for a book or two on 3.0 to take my knowledge to the next level. I've found the following books available and I'm wondering which ones are the best: Any suggestions would be _greatly_ appreciated! 5. This list. Try to discover the answer to as many questions as you can yourself by reading the docs and source and by trial. Try to understand the why of an answer you discover or provided by someone else. I consider it a much more interesting and varied resource than the few standard examples a book might have; plus it's free =). Only once you reach a certain level. There are no doubt many lurkers who have trouble getting started. I know I did. I got a copy of Building a Monitoring Infrastructure with Nagios (Feb07) and found that invaluable, but we're still on nagios version 2 A book is nicer to read on the train too -- Create and Deploy Rich Internet Apps outside the browser with Adobe(R)AIR(TM) software. With Adobe AIR, Ajax developers can use existing skills and code to build responsive, highly engaging applications that combine the power of local resources and data with the reach of the web. Download the Adobe AIR SDK and Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Still cannot access nagios page
I think the .htaccess files will be ignored, your directory stanza: Directory /usr/local/nagios/sbin sets AllowOverride None Having said that, it should work -- does the apache error log show anything? What are the permissions on /usr/local/nagios/etc/htpasswd.users ? -- In a world without walls and fences, who needs windows and gates? Paul Weaver Systems Development Engineer News Production Facilities, BBC News From: Khairuzzamri [mailto:k_za...@kannal.com.my] Sent: 21 January 2009 15:18 To: Nagios Users Mailinglist Subject: [Nagios-users] Still cannot access nagios page Hi everyone, I'm still stuck at the authentication popup. I cannot login as even if I'm providing the correct username and password the authentication popup will still reappear. I have done: 1. 'make install-init' 2. have copy '/etc/httpd/conf.d/nagios.conf' content into '/etc/httpd/conf/httpd.conf' nagios.conf content: ScriptAlias /nagios/cgi-bin /usr/local/nagios/sbin Directory /usr/local/nagios/sbin # SSLRequireSSL Options ExecCGI AllowOverride None Order allow,deny Allow from all # Order deny,allow # Deny from all # Allow from 127.0.0.1 AuthName Nagios Access AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user /Directory Alias /nagios /usr/local/nagios/share Directory /usr/local/nagios/share # SSLRequireSSL Options None AllowOverride None Order allow,deny Allow from all # Order deny,allow # Deny from all # Allow from 127.0.0.1 AuthName Nagios Access AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user /Directory 3. create 2 .htaccess file in both '/usr/local/nagios/sbin' and '/usr/local/nagios/share' .htaccess content: AuthName Nagios Access AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users require valid-user I'm using Redhat 9 as O/S, nagios 3.0.3.tar.gz and nagios-plugins-1.4.12.tar.gz. Please help identify my problem. Any suggestion is highly appreciated. Thanks regards. -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios not updating/dead
Today we had a problem where, more or less, Nagios was not doing anything at all. All the services had last check times that were a couple of hours old. Also I could not force checks through the Web-GUI. I tried restarting Nagios, but that did not work. It wasn't until after I rebooted that machine that things started working again. After the reboot, we did some investigation and we found this in the Apache Are you sure you didn't simply run out of space on the relavant partition, and a reboot deleted a bunch of temporary files? Either that, or could the parition have become read-only (some disk fault)? -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News -- SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada. The future of the web can't happen without you. Join us at MIX09 to help pave the way to the Next Web now. Learn more and register at http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Error in Commit
Hi there, I am getting the following error while doing running the commit command on Nagios: Could not stat command file () Usr/local/Nagios/var/rw/Nagios.cmd It may also be related to the capitalisation given below. Does it really state Can't Stat Usr/local/Nagios/var/rw/Nagios.cmd or Can't Stat /Usr/local/Nagios/var/rw/Nagios.cmd It should say Can't Stat /usr/local/nagios/var/rw/nagios.cmd (note all lowercase) Unless you have a very strange setup. -- SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada. The future of the web can't happen without you. Join us at MIX09 to help pave the way to the Next Web now. Learn more and register at http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Failover monitoring and acknowlegements
I thought I couldn't rsync the files because the running Nagios process on the failover server wouldn't notice the changed files. However, if I can have a script run check_nagios via check_by_ssh from a cron job (I don't know yet- I've never used check_by_ssh before), then Nagios doesn't need to be running all the time on the failover server. The cron job can start the failover Nagios if the primary Nagios isn't running, and stop the failover Nagios when the primary returns. Since the configurations on both servers would be the same, rsync could even keep Nagios itself, updated on the failover server without me having to do it manually. That the kind of thing what I was thinking. We already use subversion to keep our libexec directories the same over (currently) 3 sites, and keep the configs Uptodate, but the var directory gets a nightly rsync to the backup server. If you want to do load balancing across nagios instances though, you should look at the full hog -- There is no time like the present for postponing what you ought to be doing. Paul Weaver Systems Development Engineer News Production Facilities, BBC News - From: Paul Weaver [mailto:[EMAIL PROTECTED] Sent: Wednesday, December 03, 2008 11:09 AM To: Chris Beattie; nagios-users@lists.sourceforge.net Subject: RE: [Nagios-users] Failover monitoring and acknowlegements For just a failover installation, are there any issues with rsyncing the files from master to slave every minute? -- Watch out for that tree Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 8225 8109here Mobile: 07736 481006 Room 1244, BBC Television Centre, Wood Lane, London, W12 7RJ From: Chris Beattie [mailto:[EMAIL PROTECTED] Sent: 03 December 2008 15:22 To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Failover monitoring and acknowlegements Er, both servers are Nagios 3.0.5 running on CentOS 5.2, sorry. From: Chris Beattie [mailto:[EMAIL PROTECTED] Sent: Wednesday, December 03, 2008 9:53 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Failover monitoring and acknowlegements I'm setting Nagios up in a failover configuration like is described in the documentation, with the primary server feeding check results to the failover server via NSCA. That part's working fine, but is there a good way to also keep host and service comments, acknowledgements, and scheduled downtimes up-to-date as well? This message (including any attachments) is intended only for the use of the individual or entity to which it is addressed and may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, you are hereby notified that any use, dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, notify us immediately by telephone and (i) destroy this message if a facsimile or (ii) delete this message immediately if this is an electronic communication. Thank you. This message (including any attachments) is intended only for the use of the individual or entity to which it is addressed and may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, you are hereby notified that any use, dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, notify us immediately by telephone and (i) destroy this message if a facsimile or (ii) delete this message immediately if this is an electronic communication. Thank you. This message (including any attachments) is intended only for the use of the individual or entity to which it is addressed and may contain information that is non-public, proprietary, privileged, confidential, and exempt from
Re: [Nagios-users] SPAM: : Checking for Apache Errors
I just ran through a 639,000 line gz compressed apache log file, using zcat access.log.gz|awk '{print $9}'|sort|uniq -c in 7 seconds That gives you the return code (404, 302, 200), and the number. Percenting it is an exercise for the reader, but something like tac /var/log/apache/access.log|head -1|awk '{print $9}'|sort|uniq -c -- There is no time like the present for postponing what you ought to be doing. Paul Weaver Systems Development Engineer News Production Facilities, BBC News From: Paulus, Jake [mailto:[EMAIL PROTECTED] Sent: 03 December 2008 23:59 To: Jonah Horowitz; nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] SPAM: : Checking for Apache Errors I would really recommend Splunk for this task over Nagios...assuming you don't generate more than 500 MB of logs per day, it's free! Assuming you really wanted to do it in Nagios, you'd probably be stuck writing your own plugin in C in order to make it fast enough to comb throw MB of logs without too much overhead (like the Nagios avail.cgi does...) -Jake From: Jonah Horowitz [mailto:[EMAIL PROTECTED] Sent: Wednesday, December 03, 2008 6:46 PM To: nagios-users@lists.sourceforge.net Subject: SPAM: :[Nagios-users] Checking for Apache Errors I'm trying to check my apache logs to make sure there is not more than a reasonable number of 404 errors for any given chunk of time. Does anyone have a apache log checking plugin? I would prefer something that allowed me to alarm if the number of 404s exceeded a certain percentage. I didn't see anything on nagiosexchange, and check_log is a bit too generic for what I'm trying to do. If nothing exists, I'll probably write something. -- Jonah Horowitz * Monitoring Manager * [EMAIL PROTECTED] W: 415-348-7694 * F: 415-348-7033 * M: 415-513-7202 LookSmart - Premium and Performance Advertising Solutions 625 Second Street, San Francisco, CA 94107 - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Failover monitoring and acknowlegements
For just a failover installation, are there any issues with rsyncing the files from master to slave every minute? -- Watch out for that tree Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 8225 8109here Mobile: 07736 481006 Room 1244, BBC Television Centre, Wood Lane, London, W12 7RJ From: Chris Beattie [mailto:[EMAIL PROTECTED] Sent: 03 December 2008 15:22 To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Failover monitoring and acknowlegements Er, both servers are Nagios 3.0.5 running on CentOS 5.2, sorry. From: Chris Beattie [mailto:[EMAIL PROTECTED] Sent: Wednesday, December 03, 2008 9:53 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Failover monitoring and acknowlegements I'm setting Nagios up in a failover configuration like is described in the documentation, with the primary server feeding check results to the failover server via NSCA. That part's working fine, but is there a good way to also keep host and service comments, acknowledgements, and scheduled downtimes up-to-date as well? This message (including any attachments) is intended only for the use of the individual or entity to which it is addressed and may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, you are hereby notified that any use, dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, notify us immediately by telephone and (i) destroy this message if a facsimile or (ii) delete this message immediately if this is an electronic communication. Thank you. This message (including any attachments) is intended only for the use of the individual or entity to which it is addressed and may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, you are hereby notified that any use, dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, notify us immediately by telephone and (i) destroy this message if a facsimile or (ii) delete this message immediately if this is an electronic communication. Thank you. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check for crash?
I'm wondering if anyone can give me suggestions as to how to monitor a specific process to make sure it keeps running properly. The specific issue I have is that I have a Java Webstart applet running on OpenBSD 4.4 that, from time to time, randomly crashes. I can't just use check_process (I don't think so, at least) because when it crashes the process never actually exits- it just stops working. I have to go in and manually kill it before I can restart it. When monitoring something new 1) how do you personally detect that its crashed 2) can that be automated? 3) If so, write a plugin. 4) If not, look at the things the applet has an affect on. For example: Imagine a webbrowser pointing at a refreshing webpage. You can tell the webbrowser is up (check_process), but that doesn't tell you it's actually refreshing the page. To check that, you could additionally monitor the apache log on the webserver for the last time the IP address loaded the page Something like tac /var/log/apache2/access.log|grep $IP|head -1|awk '{print $4 $5}' Without knowing what the applet does though, it's difficult to know how to monitor it. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check snmp
i think you should use check_tcp to see if the port is available. SNMP is UDP based, and wont respond without a valid community string You could always do an snmp check to see if snmpd is running ;) If you want to use check_snmp to check the snmp just ask for the Uptime.oid (or any oid as you say) as long as it replies the service is available. #Any old oid, any old oid, any any any old oid? - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios for client management?
Nagios seems to be a good client monitor system. I was wondering if anyone also uses it as a client management system? By management, I mean running update scripts and other software on various Linux clients. The closest thing we do is monitor patches on machines. For debian machines, we use a modified check-apt perl script (originally from http://www.logix.cz/michal/devel/nagios, modified to do things like run apt-get update first) For windows machines, we snmp-poll the server and compare the listed KB patches against an internal list For solaris machines we're looking at check_solaris_pca - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] several separate status maps with one nagios process- possible?
-Original Message- From: Tomasz Chmielewski [mailto:[EMAIL PROTECTED] Sent: 30 October 2008 11:40 Is it possible to set up the web interface so that it shows several *separate* status maps (i.e., in different browser windows on different displays)? My status map doesn't fit on one screen for some time now, and it's getting worse as I keep adding new hosts (more than 100 now). Once you get to an installation of your size, you should probably look at other mapping tools, the built in statusmap isn't that great. NexSM and NagVis are two candidates - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Advanced authentication
applications. Once they're logged in there, we would pass their username over to Nagios via URL querystring or hidden form value, etc (not going for maximum security). Indeed not, anyone who wants can pretend to be anyone else. Still, if it's an internal intranet, and you capture the IP addresses, you can find out whodunit On the Nagios server we would use PHP to populate the REMOTE_USER Apache ENV variable with the passed username so that Nagios can compare that to the contacts and provide them with their applicable views. I'm afraid you can't do that. You would set the REMOTE_USER (or any other env) for the scope of that PHP page ONLY. One option would be to edit all the nagios CGIs to look for the username in the query string, and set the variable at the top of the CGI (in main). Or replace all instances of ENV.REMOTE_USER with a function which Another option might be to execute the cgi's from a PHP page which pre-sets the username. You could use mod_rewrite to rewrite *.cgi to nagportal.php?cgi=*.cgi. In both cases you would have to maintain the username throughout the session. If you edit all the links in nagios on the fly to pass in a new username, (change status.cgi to status.cgi?username=bob), you would have a problem with people emailing links to others. One option for this would be to use PHP's session capabilities to store the username. You would set a cookie with the username in and read it on each call. That would work with both the php and the editing-the-cgi way. Both of those solutions are a fair amount of effort. A final solution could be using the apache module SetEnvIf. Something like SetEnvIf Cookies:.*loginname=(username);.* REMOTE_USER=\1 But I don't think SetEnvIf will let you do that. What you're trying to do isn't that easy, as it goes against normal authentication practices. Is it necessary to modify the Nagios/cgi config in any way to achieve what we're doing? Perhaps Is is necessary to modify the Apache config? Perhaps Is is necessary to set any other ENV variables besides REMOTE_USER? No - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Unmanaged switch
On Monday October 20 2008 11:38:04 am Paul Weaver wrote: How about a new command called fakecheck, and apply to a dumbswitch host with a 127.0.0.1 address. It would take a list of machines, and check them all. If any of them are up, then the fakecheck would return OK, otherwise it would return an error state. Just for understanding, you would check if the hosts which are connected to this dumbswitch are up? But they are tested anyhow.. If you have 10 devices on the unmanaged switch, and all 10 are off, it's likely the switch is broken -- so set the switch to off, and the devices to unreachable If 9 are off, and one is on, the switch *can't* be broken You could check the current state of each device (check_cluster), but 1) All devices go off, check_cluster goes bad 2) All devices then go unreachable 3) Check_cluster stays bad for ever (won't go good until a device goes good, which won't happen if they are unreachable and aren't checked), or it goes unreachable (inheriting the state of the devices), or goes good (if it treats unreachable as OK). None of those is the correct action. Pings are cheap. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] mediawiki - include the nagios status report
Use an iframe to include http://nagios/cgi-bin/status.cgi?servicegroup-allstyle=summary -- you'd need a mediawiki plugin to allow the use of iframes The use of actualize does ask questions.. -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 822 58109 Room 1244 Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: Bianca [mailto:[EMAIL PROTECTED] Sent: 20 October 2008 10:58 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] mediawiki - include the nagios status report hello all, i´m creating a webpage with mediawiki and i want to include there the Status Summary For All Service Groups. but i want this status report to acutalize itself. how can i manage this? -- --- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Unmanaged switch
How about a new command called fakecheck, and apply to a dumbswitch host with a 127.0.0.1 address. It would take a list of machines, and check them all. If any of them are up, then the fakecheck would return OK, otherwise it would return an error state. Something like #!/usr/bin/perl my $up = 0; my @hosts = qw/10.1.2.3 10.1.2.4 10.1.2.5 10.1.2.6/; foreach $addr (@hosts) { if (pingCheck($addr)) { $up++; } } if ($up 0) { print OK: Switch must be up\n; exit 0; } else { print CRITICAL: Switch may be down\n; exit 2; } (If you're good at programming you could probably get a list of children out of the config -- complex configs take a long time to read in using perl though) Another way of monitoring would be to have it powered off an SNMP enabled MDU which can tell the current being drawn. You could then measure that. -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 822 58109 Room 1244 Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: Mike [mailto:[EMAIL PROTECTED] Sent: 20 October 2008 09:48 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Unmanaged switch Folks, What is the best way to include an unmanaged (read: IPless) switch in Nagios? Obviously I can't monitor it directly, but I'd like Nagios to be aware of it in the network heirachy so that it is aware that it is a possible poit of failure. Can I just define it as a host with no IP address and make it a parent/client of the devices that attach to it or is there a better way to approach this? Mike. -- --- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] mediawiki - include the nagios status report
-Original Message- From: Heigl, Bianca [mailto:[EMAIL PROTECTED] Sent: 20 October 2008 11:56 To: Paul Weaver Subject: AW: [Nagios-users] mediawiki - include the nagios status report Thanks a lot, i will try this. Sorry, I didn´t mean actualize, I meant that it should update itself every 90 minutes, also at the mediawiki page. Using Actualise raises a warning flag that a manager has got involved :) A snapshot every 90 minutes would be harder -- you'd need a cron job running to download the view -- wget should help. Nagios can give different views depending on the user, if you wanted that you'd have to think again about your downloading. As long as you don't mind the nagios authentication, an iframe would seem the best solution. You'll need to install something like http://www.mediawiki.org/wiki/Extension:Website_in_iFrame onto your mediawiki server though. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How do I check if a website is up?
However if you use a squid proxy, running the service on the cache would be a better choice. If not, running on your external gateway might be a good idea. After all, if your internet connection goes down, it doesn't mean www.google.com isn't available -- it means you can't see it. -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 822 58109 Room 1244 Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: Assaf Flatto [mailto:[EMAIL PROTECTED] Sent: 20 October 2008 16:03 To: Nagios User list Subject: Re: [Nagios-users] How do I check if a website is up? No , you don't have to define a host . you define a service check with the fix parameters ( as i gave in the previous mail ) and assign this check to an existing host . that host will execute the check and report the status returned from the command ( personally i'd run it from the nagios server itself) . On Monday 20 October 2008 15:58:25 Josh Wells wrote: Ok so if the website is external to my network I need to create a host definition? What would that look like for a URL? I've only made host definitions for internal hosts using the IP address. Because the website is a hosted solution I would rather define it by URL than IP address. Is this possible? -Original Message- From: Assaf Flatto [mailto:[EMAIL PROTECTED] Sent: Monday, October 20, 2008 10:46 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] How do I check if a website is up? Not nessaceily . If you have that website in your network , you can just define the check_http check to that host. also you can add in one of the parameters the specific URL you wish to query . -u, --url=PATH URL to GET or POST (default: /) Or you can define a check on the nagios server it self to query the web site like this : # 'check_website' command definition define command{ command_namecheck_website command_line$USER1$/check_http -H www.website.com } If not , then you need to build a new host definition and service check for that host . On Monday 20 October 2008 15:18:53 Josh Wells wrote: Do I have to define a remote host such as www.website.com in order to apply the check_http service against that host? -- Assaf Flatto SSP Ops Team Linux System Administrator IMPORTANT . this email and the information in it may be confidential, legally privileged and/or protected by law. It is intended solely for the use of the person to whom it is addressed. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Please also delete all copies of this email and any attachments from your system. We cannot guarantee the security or confidentiality of email communications. We do not accept any liability for losses or damages that you may suffer as a result of your receipt of this email including but not limited to computer service or system failure, access delays or interruption, data non-delivery or mis-delivery, computer viruses or other harmful components. Copyright in this email and any attachments belong to Select Service Partner UK Limited. Should you communicate with anyone at Select Service Partner UK Limited by email, you consent to us monitoring and reading any such correspondence. Nothing in this email shall be taken or read as suggesting, proposing or relating to any agreement concerted practice or other practice that could infringe UK or EC competition legislation. Select Service Partner UK Limited is a company registered in England and Wales (company number 05687183) whose registered office is at 1 The Heights, Brooklands, Weybridge. Surrey. KT13 0NY -- --- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere
Re: [Nagios-users] Nagios 3 structure
I would suggest putting your nagios configuration into subversion. Initially you can see who makes changes, and roll back any you don't like. You could then move to adding hooks in to check the config before hand. Perl scripts are all you need for batch updates (like adding n number of hosts to a new service group, which I started doing a couple of days ago) Our (nagios 2) layout is a little like this: ./etc/nagios.cfg inherits commands.cfg (all commands) inherits time.cfg (all time periods) inherits template.cfg (default template -- hourly notification, 24/7 checks, 4 minute checks, 30 second retrys) inheirts directories contacts system-infrastructure (routers, dns, etc) system-system1 system-system2 system-system3 Each of those has their own templates, which overrides things like contact details, and sub directories based on the system (so system-foo contains components midtier, database, media, etc) Each directory eventually has a single file (usually hostname.cfg) for each host, containing host definition and services. E.g. ./system-live/template_live.cfg ./system-live/quantel/template_live_quantel.cfg ./system-live/quantel/bidir1.cfg ./system-live/quantel/clipcache06.cfg ./system-live/quantel/clipcache01.cfg ./system-live/midtier/template_live_midtier.cfg ./system-live/midtier/newsjtcapp02.cfg ./system-live/midtier/newsjtcapp23.cfg ./system-live/midtier/newsjtcapp11.cfg ./system-live/other/template_live_other.cfg ./system-live/other/newsjtcmpp26.cfg ./system-live/other/newsjtcvtr01.cfg ./system-live/other/newsjtcmpp16.cfg ./system-live/other/newsjtccodec04.cfg -- Paul Weaver Systems Development Engineer -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Ronaldo A. Bueno Filho Sent: 11 September 2008 05:11 To: Taylor Dondich Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Nagios 3 structure Good! I will take a look. Is this like NagiosQL? In the future, I will install centreon 2. I think it will not be a problem, but just to confirm: is there any problem if I use centreon 2 if in the past I have used Lilac? Thanks -- Ronaldo A. Bueno Filho - Mensagem Original De: Taylor Dondich [EMAIL PROTECTED] Para: Ronaldo A. Bueno Filho [EMAIL PROTECTED] Cópia: nagios-users@lists.sourceforge.net Assunto: Re: [Nagios-users] Nagios 3 structure Data: 10/09/08 12:33 You could take a look at Lilac, a configuration tool I wrote. This way you do not have to deal with the layout of the file structure, but instead work inside an easy to use GUI. Lilac supports Nagios 3 and full dependencies, escalations, etc. www.lilacplatform.com is the project page. Otherwise, you can structure the files in any way you want. Don't feel like you need to maintain the structure. You can ALWAYS move them around at a later time, because the cfg_dir directive in the Nagios main configuration file is recursive in nature, so changing it later on will not disrupt it. Taylor On Wed, Sep 10, 2008 at 1:18 AM, Ronaldo A. Bueno Filho lt;[EMAIL PROTECTED]gt; wrote: gt; gt; gt; Hi people! gt; gt; The reason I am sending this e-mail is that I am not sure how I can organize gt; the Nagios 3 structure. gt; I just made a tree layout (shown below) showing how I thought about the gt; organization (folders and files). gt; The environment consists in several customers, my own network and some other gt; services that I can use someday, like to monitor web pages, etc. gt; I must organize the files in order facilitate the management, because I will gt; update, change, delete and insert new services, customers, configurations gt; and stuff. gt; gt; I also have some questions regarding host dependency, service dependency, gt; host escalation and service escalation. These questions are after the tree gt; layout. gt; gt; objects (main folder) gt; | gt; | gt; |--hosts (folder) gt; | |--host_groups (folder) gt; | | |--hostgroup1.cfg gt; | | |--hostgroup2.cfg gt; | | |--hostgroupn.cfg gt; | | gt; | |--own_corp (folder) gt; | | |--corp_host1.cfg gt; | | |--corp_host2.cfg gt; | | |--corp_hostn.cfg gt; | | gt; | |--customers (folder) gt; | |--customer1 (folder) gt; | | |--cus1_host1.cfg gt; | | |--cus1_host2.cfg gt; | | |--cus1_hostn.cfg gt; | | gt; | |--customer2 (folder) gt; | | |--cus2_host1.cfg gt; | | |--cus2_host2.cfg gt; | | |--cus2_hostn.cfg gt; | | gt; | |--customern (folder) gt; ||--cusn_host1.cfg gt; ||--cusn_host2.cfg gt; ||--cusn_hostn.cfg gt; | gt; |--services (folder) gt; | |--services_groups (folder) gt; | | |--servicegroup1 gt; | | |--servicegroup2 gt; | | |--servicegroupn gt; | | gt; | |--services (folder) gt
Re: [Nagios-users] using nagios in my own website
The host/service status is stored in /usr/local/nagios/var/status.dat (or wherever you nagios install is). This is a plain text file with the status of every host and service in it. You can read it yourself however you want, but some languages may have a module for reading the file -- perl has a Nagios::StatusLog module in cpan, or often via your distribution (With ubuntu apt-get install libnagios-object-perl) You could also, I believe, look at NDO, which sends nagios data out to a mysql database. -- 09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0 Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 8225 8109 Mobile: 07736 481006 Room 1244, BBC Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of localyee Sent: 04 September 2008 19:13 To: nagios-users Subject: [Nagios-users] using nagios in my own website hi,i'm using the nagios to monitor the service,and i just want to get the data of nagios module,and don't want to using the website of nagios,how can i display the data in my own website, how can i get the data with the interface of nagios,can any one give some suggestion. your kind help would be great appreciated. 2008-09-05 localyee http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] multiple nagios instances on one machine
But if people want different additions to nagios, want tweaks to the cgi's (we have a link into our documentation system for example), etc. You may want to keep them completely separate. Running nagios in a VM would seem the obvious answer though. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Marc Powell Sent: 01 September 2008 16:26 To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] multiple nagios instances on one machine On Sep 1, 2008, at 8:12 AM, Steven Battaille wrote: Hello Is there anyone who can tell me if it is possible to run more than one instance of nagios on the same machine? This could be very handy to have different nagios installations for different customers. Seems a lot more complicated than the nagios way of handling that with CGI Authorization. When logged in, users are only able to see the hosts and services they are contact for. It works well here for many thousands of contact groups... -- Marc -- --- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] single command, multiple disks to monitor
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Chandra Bahadur Hi I would like to monitor two partitions(/dev/sda1 and /dev/sda3) with the same command.. How do I specify those ? Is it by separating with commas or by just space as below ?? command[check_disk]=/usr/local/nagios/libexec/check_disk -w 10% -c 5% -p /dev/sda3 /dev/sda1 or command[check_disk]=/usr/local/nagios/libexec/check_disk -w 10% -c 5% -p /dev/sda3,/dev/sda1 check_disk -w 10% -c 5% -p /dev/sda3 -p /dev/sda1 -p /mnt/foo Warns if either partition is below 10% Run check_disk -h for other things you can do http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] ADSL modem checking
I don't have any pcs with default routes out of two network cards to test, but perhaps using ping -I eth0 www.yahoo.co.uk and ping -I eth1 www.yahoo.co.uk would work? -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 822 58109 Room 1244 Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of J. Bakshi Sent: 27 August 2008 12:13 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] ADSL modem checking Hi all, nagios is running here at my local server with ssh,mysql,smtp,http monitoring. And now I like to use *nagios* to monitor my 2 ADSL modems. The environment is as below 1 my linux server has 3 lan crads eth0 ( connected with hub ) , eth1, eth2 2 There is an ADSL modem 220BX connected with eth1. ( 8MB connection used as primary ) This modem is providing Internet connection from provider1 (say) Another modem ADSL modem TAD100 connected with eth2. ( 512 KB connection used as backup ) This modem is providing Internet connection from provider2 (say) 3 simple check_ping to modem IP can monitor the accessibility to both the modem. BUT how can I check that the internet connection is really provided by the modems. simple ping to yahoo.com can't solve the problem as any one modem is used as default route at a particular time. How can I check the internet is really present on the other modem not used as default route ? In other words say eth1 is used as default route and the traffic is going through ADSL modem 220BX. Then How can I check that the internet connection is alive at eth2 ? Many thanks in advanced. -- ~~ Joydeep Bakshi, Linux System Admin Kolkatainfoservices Pvt Ltd, 23A Royd Street, Kolkata 700016, India Work Phone 91 033 40014784 http://infoservices.in/ ~~~ -- --- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] (Nagios-Ping)
From: Paulus, Jake [mailto:[EMAIL PROTECTED] Sent: 27 August 2008 13:50 To: Paul Weaver; Nagios Users Mailinglist Subject: RE: [Nagios-users] (Nagios-Ping) I have this issue as well. In my case I believe it exists because servers are improperly multi-homed; this is a server configuration problem, not a network one. I have brought this issue to the server administrator's attention and they are unaware of any impact (probably because TCP retries are hiding this) it causes to users and so have chosen to take no action. -Jake I'd be tempted to believe this is the case in two of the worst offenders -- both outside provided machines -- but one of the machines only has a single connection to a switch, so the compass swings back to some OSPF/HSRP issue. As to people ignoring issues that TCP hides, we have that problem. A friend from a fairly large company was suffering from a 3% packet loss on one switch last week, which caused havoc with a UDP based system they had. The network guys refused to believe there was a problem because outlook was still working (even 80% packet loss doesn't affect exchange -- it's slow and crashy enough anyway) http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] expire a passive check result.
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Rui Miguel Silva Seabra Hello, When using passive checks, you *should* do the following: define service { ... check_freshness 1 freshness_threshold 660 ... Helpful, though sparse, hints? :D Rui I've being playing with passive checks recently. We have some trap-style alerts. Most of these traps we can back up with a polling check, -- e.g. 1) disk fails, we get a notification instantly, but we can then snmp check the machine to confirm the disk has failed, and note when it's fixed. 2) A server reboots, we get a trap, but an uptime check would confirm the situation and allow the service-status page to show an error on the server until it has being running for 15 minutes (we have servers that get into reboot loops, it's nice to know how long the server is out for). In those instances, I believe the best soltuion is to have a service set up for normal polling every x minutes, but accept external service state commands to force them to critical/warning (or just force a check) Some errors though, we want to show on the service problem state page for x minutes (as they can lead to issues, and it's handy to have a red blob to point you in a possible direction), but we have no way of knowing when the fault is fixed. Some of our servers don't accept any kind of polling for disk/fan/etc states, but do send traps. I know that the alert history can show these problems, and we can set the service to be volatile (?) but that's a different page which involves support people taking time out of their youtubing to look for errors. For now, I have these logged to a database at the moment, and have a service that looks at the results in the table for the last n minutes, which isn't ideal. What I'd like is a semi-volatile option -- a passive service that when triggered, remains warning/critical for m minutes before returning to OK. I was thinking of firing an event handler to sleep 600; set_service_to_ok, with some form of locking, would that be the right solution? http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_disk_smb question
-Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Assaf Flatto Sent: 20 August 2008 13:13 To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] check_disk_smb question did you define the command to go with it ? Some thing like this ? define command{ command_namecheck_disk_samba command_line$USER1$/check_disk_smb -H $HOSTADDRESS$ -s Company -u foo -p bar -w $ARG1$ -c $ARG2$ See if you have it in the commands or checkcommands file. The way nagios works 1) You define a host 2) You define a command 3) You define a service, which belongs to a host, and has a command In this case you'd want something like this. This will allow you to monitor any share on any host, with a username/password of foo/bar Commands.cfg: define command{ command_namecheck_disk_samba command_line$USER1$/check_disk_smb -H $HOSTADDRESS$ -s $ARG1$ -u foo -p bar -w $ARG2$ -c $ARG3$ } Host.cfg: define service { use generic-service host_nameharrold service_description SMB check_commandcheck_disk_smb!Company!80!90! } (as an aside, the version of check_disk_smb I have (or rather the version of smbclient) seems to fail once the file share goes over about 2TB, so I have to use snmp in those cases, however that's fairly irellevent.) http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_snmp_storage ..... ERROR : Unknown storage /var
1) Are you sure that /var is a seperate partition? Run df And look at the mounted on column 2) Run snmpwalk 127.0.01 -v2c -C public hrStorageDescr And see what partitions snmp lists -- Power corrupts. Power failure corrupts absolutely Paul Weaver Systems Development Engineer News Production Facilities, BBC News -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Agnello George Sent: 19 August 2008 14:19 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] check_snmp_storage . ERROR : Unknown storage /var hi all i have installed the check_snmp_storage plugin but i can ot get the ot put for /var space . I can however get the out put for / partition . Following is the command ./check_snmp_storage -H 127.0.0.1 -C public -m / -w 80% -c 90% OK : /: 32%used(327MB/1024MB) : 80 % | /=326MB;819;921;0;1024 but when i try this ./check_snmp_storage -H 127.0.0.1 -C public -m /var -w 80% -c 90% ERROR : Unknown storage /var Does any one have a solution for this !! Thanks a million !! -- Regards Agnello D'souza www.linux-vashi.blogspot.com www.bible-study-india.blogspot.com http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Solaris via SNMP?
From: Serafin, Chris Sent: Monday, August 18, 2008 4:01 PM To: Nagios-users@lists.sourceforge.net Subject: [Nagios-users] Monitoring Solaris via SNMP? I'm trying to monitor CPU, memory and disk space via SNMP on a bunch of Solaris machines and running into errors (nothing returned from check_snmp_cpu.pl). Is anyone monitoring Solaris this way, and if so how/what plugin? From: Doug Veldhuisen Are you using the Solaris snmp stack or the net-snmp version on your Sun box? There is a difference, big difference in the data obtained. I think what your looking for comes from the Net-SNMP stack. Not totally sure, I have a couple of Sun boxes I plan on putting up under Nagios myself. I monitor about a dozen solaris boxes using a variety of custom plugins (as well as things like check_tcp, check_ssh, ping etc), but snmp wise I use: 1) a check_hd one, [www.peters-webcorner.de/nagios], then I modified to add support for multiple oses, reconfigured output, changed help to be clearer, and added support for snmpv3 2) a check_process_by_snmp, which from the style I think I wrote from scratch before I started storing my libexec directory in subversion, that checks a process is running (optionally with a parameter) 3) a check_mem_by_snmp, another internal one which checks the free memory on a box On the machine I use net-snmp 5.0.9 in /usr/sfw/sbin/snmpd. I think (I'm not a solaris admin), it attected to snmpdx -- Sun Solstice Enterprise Master Agent supplied by the SUNWmibii. You can run snmpwalk -v2c -cpublic hostname .1 to see what fun information is available -- Power corrupts. Power failure corrupts absolutely Paul Weaver Systems Development Engineer News Production Facilities, BBC News http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagvis ideas?
It seems to me, that there are 2 basic ways to view our conglomeration of systems. One is geographic, and the 2nd is from a network topology point of view. Frankly I think a nagvis presentation of both would be useful. The network topology version is fairly straightforward (I think, but I would like to here others point of view on this). The geographic version however, frankly I am struggling with. If I show the systems on the large area map, they are fairly spread out. I am thinking of doing that, and making dynamic links for each cluster. These links would be color coded based upon the status of all of the computers in that cluster, and would provide a link to a more detailed view. I'd love to hear what other peoples thoughts on this are. Our old monitoring solution was a product called solarwinds, the main view was a map which had little icons for various machines (it didn't monitor a lot more, great for monitoring all the interfaces on the network, but useless for system monitoring) I felt the map was pointless. If something did break, you would get a little red dot in the corner. When I implemented Nagios I decided to ignore nagviz for a bit. The only map I can see being off use would be something like http://www.nagvis.org/sites/default/files/screenshots/c_by_dave_rearden_ 2.png If you had a monitor hanging in the apps room. but even then, we rarely have hardware problems, and if we do, the machine will have a flashing red led. And even if we didn't, they're all labelled. If you are truly physically seperated (not just different rooms in the same building, on the same power feed) http://www.nagvis.org/sites/default/files/screenshots/nagvis_map_2.png Could show you areas where there are invading tanks knocking out your infrastructure, but a network topology might be better. (Of course if you use nagios to monitor your country's defence systems, a physical map might be a good way of doing it) 1 of the 3 nagios installations I know about here have a default screen of service problems. Our installation has a custom-written screen which highlights problems affecting users. The other isn't a real nagios installation, it was provided by an external company -- it barely monitors hosts, and only does about 20 anyway, the statusmap is fine for that. I'd be interested to know how many people find maps helpful for anything other than impressing management. http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Config documentation
You want documentation for the plugins -- nagios doesn't do any checking by itself, so it depends where you get them from. http://nagios.sourceforge.net/docs/3_0/plugins.html is a starting point. For your specific requirement, you should probably look at the check_tcp plugin -- Watch out for that tree Paul Weaver Systems Development Engineer News Production Facilities, BBC News -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Jim Miller Sent: 18 June 2008 15:57 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Config documentation Can anyone tell me where I might find documentation to tell how to monitor different aspects of a server. For example, I want to monitor my servers to ensure they are always responding on port 104. How do I do that? Is there somewhere I can look that wil explain all the options available to me? Thanks Jim http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
You can use check_ping -- I assume the boxes are pingable, and you want to check they are pingable, even if they do nothing else that can be monitored. If you don't want to check they work via ping, why are you even modelling them? -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Israel Brewster Sent: 16 April 2008 19:01 To: Nagios Users Mailinglist Subject: [Nagios-users] Hosts w/o services Running nagios 3.0, I have set up a number (most) of my hosts without services, since all I am interested in monitoring is the hosts themselves. However, this is causing a number of little annoyances, such as the hosts not showing up when I type their name into the nagios sidebar search, throwing numerous warnings if I do a verify from the command line, giving me a blank page when I click the hostname link from the hostgroup overview page, etc. Is there any way around these issues other than adding a check_dummy service check to each host? Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- --- This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java .sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SNMP disk checking in Windows NTFS mounted system
The NTFS Mounted Point File System in Windows is a File System mounted in a Subdirectory, but still a Mounted File System., but a got your point. Do you Known any other way in other to get the free space percent in this kind of cenario ? A couple of options 1) Create a second mountpoint for the drive as a letter -- I have a server that has C:\mysql\data as a separate drive. I've additionally mounted this as M:, and can use SNMP to look at the M: drive. I've had issues with SNMP on windows 2000 before, it sometimes fails to report the correct value. 2) You can also use samba, pass in a host, share, username and password, the plugin used smbclient to do a DIR. Seems to work for me, I knocked up a plugin for internal use which does this, basically it does: open(PH, smbclient //$host/$share -U$username \$password\ -c \dir\ 2/dev/null); While (PH) {$lastLine = $_;} Then checks for /NT_STATUS_BAD_NETWORK_NAME/, /NT_STATUS_ACCESS_DENIED/ and no output, then checks for the nnn blocks of size xx. Yyy blocks If it finds it, it takes the number used as 100*(blockused/(blockused+blockfree))), which gives me x% It then returns this value, and nagiosgrapher parses it to plot a graph. (Don't ask why we have mysql on a windows server!) http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] help : Nagvis
If your normal nagios installation is at http://foo.com/nagios/ the base is /nagios/nagvis. If it's at http://foo.com/mynagios, base is /mynagios/nagvis. If you've set up Apache to make http://foo.com/ the place to go to for the nagios web client, your htmlbase is /nagvis It's the absolute URL of the nagvis, which is a subdirectory off your normal nagios web client. -- Power corrupts. Power failure corrupts absolutely Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 8225 8109 Mobile: 07736 481006 Room 1244, BBC Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED] Sent: 09 April 2008 13:12 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] help : Nagvis Importance: High Hi All I am installing nagvis , while editing the nagvis.ini.php , I have come across the following commands. In the base value I have given the path /usr/local/nagios/share/nagvis-1.2.2 but I am unable to proceed for htmlbase htmlcgi as i am unable to locate /nagios/nagvis /nagios/cgi-bin Highly appreciate someone's help on this paths In this section are all needed path variables defined. This example shows all values of the paths section: [paths] base=/usr/local/nagios/share/nagvis/ htmlbase=/nagios/nagvis htmlcgi=/nagios/cgi-bin Value Default Description base/usr/local/nagios/share/nagvis/ Absolute physical NagVis path htmlbase/nagios/nagvis Absolute html NagVis path htmlcgi /nagios/cgi-bin Absolute html NagVis cgi path The information contained in this e-mail message is intended only for the use of the individual or entity to which it is addressed. If you are not the intended recipient, you should return it to the sender immediately. Please note that while we scan all e-mails for viruses we cannot guarantee that any e-mail is virus-free and accept no liability for any damage caused by any virus transmitted by this email. http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. ATT58872708.gif- This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_hd (free diskspace via snmp) reporting wrong values
I use a slightly modified (allow cheks of /var as well as c:) check_hd plugin to check disk space on both windows and unix. It works fine for the most part, except for a windows machine with 2 partitions. The E: partition report is fine OK: hd e:\.* 1% full, 27.4GB free. [EMAIL PROTECTED], [EMAIL PROTECTED] The C partition wasn't OK: hd c:\.* 83% full, 0.7GB free. [EMAIL PROTECTED], [EMAIL PROTECTED] Investigating revealed the plugin working correctly DBG: fullsize 8388447 * 512 = 4294884864 DBG: usedsize 6984708 * 512 = 3576170496 However the C: drive wasn't using 3.5G, it was using about 210*6984708 So I doubt it's the allocationunit that's a problem. It now reports the correct value, which may be linked to a restart of the snmp service. OK: hd c:\.* 65% full, 1.4GB free. [EMAIL PROTECTED], [EMAIL PROTECTED] Has anyone seen this kind of behavior before? http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and cluster setup...few questions
Lacking the time and knowledge to understand monitoring clusters, I concocted a home brew web page that has the concept of a virtual service, and virtual service groups. It's all configured in an XML file. I have a virtual service called clipcache 01, one called clipcache 02, and so on. Easch service has two checks pulled from the nagios status file, using hostname and service description. If both are OK, then the virtual service is ok, otherwise it's critical. I then have a group called Clipcaches, which looks at the number of virtual services, and is critical if none ar running, warning if 1 is running, or OK if 3 or more are running. A group called live system monitors the clipcaches group, amongst others, and exports it's status back up. It also looks for shceduled downtime and acknowlegments. Another group that's a member of live system is midtier, which consists of checks of virtual service Main midtier (which monitors a certain process on one of 3 machines, must be on one, and one only, to be OK), Search instance, which is OK if 2 instances are found on one of 4 machines, warning if 1, and critical if 0, and a few other checks. The program then displays this as a tree on a webpage, expanding branches with problems, it gives a quick comforting overview of the whole system, while nagios' Service Problems page gives a list of things to fix (which might not be of immediate importance to the overall health of the system, but need fixing anyway) It does it using Perl's Nagios::StatusLog module. No idea how well it scales, and I'm sure there's a better way of doing it. It's definatly a work in progress, and has made me think a lot more about defining system health. -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 822 58109 Room 1244 Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Tarak Patel Sent: 09 October 2007 14:32 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Nagios and cluster setup...few questions Hi all, Here is a quick background of my current setup for monitoring: I have an in-house tool monitoring clusters. The tool simply uses ssh to launch perl scripts on remote machines and grab all of the output to stores it on a central location in a logfile. This output is parsed and for any pre-defined tags (WARNING/CRITICAL/ERROR). If any of these tags are noticed the message is logged using syslog. The scripts residing on remote hosts is a collection of perl functions. Each one is executed one after another. Some of these functions utilize a status file from previous run to verify if state of items changed from last time. Some of these functions can be given a special argument to set the current state as default state for next iteration of checks. Cluster are monitored from the head nodes since not all nodes are accessible from central location. Head node checks contain a special function that simply use DSH to launch checks on all nodes. After looking at nagios and its check_cluster plugins I realized I would really like to monitor each of the nodes individually since I want to be able to disable a particular check on a particular node. Also I want to be able to use status files for some of the checks. As of now I have yet to find any plugin that utilizes a status file to monitor hosts. All plugin simply use current output from commands to verify the status. I will be using active checks on the clusters therefore I will configure nrpe on all nodes. My plan of attack was to simply use head node as a gateway and all nodes and services to be defined on the head node (under nrpe). From central location I can simply execute a check_nrpe type script to verify backend nodes. I still haven't figured out how I can use status files from each iteration of checks to validate status. I'd appreciate some inputs as to what are the best options in monitoring clusters where backend nodes are hidden from the central monitoring server. Also some help with use of state files. Thanks all, TP. -- --- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views
Re: [Nagios-users] FW: Nagios: Error while writing to pipe
I'm no expert in python or passive results, however you appear to be opening the file for write access. Have you tried opening for append (w+ perhaps?) -- Even if you're on the right track, you'll get run over if you just sit there. Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 8225 8109 Mobile: 07738 481006 Room 1244, BBC Television Centre, Wood Lane, London, W12 7RJ -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED] Sent: 25 September 2007 15:39 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] FW: Nagios: Error while writing to pipe Hi All, I am new to Nagios, I am trying to receive SNMP Trap from Windows client machine. I am using Python script to see how Nagios can create an event/alert by checking Passive Service Check Result submitted by this script. I have attached the Python script which is used write Passive Service check Result to Nagios external command file for creating an alert/event. But when I run this script -- [EMAIL PROTECTED] eventhandlers]# pwd /usr/local/nagios/libexec/eventhandlers [EMAIL PROTECTED] eventhandlers]# python snmptraphandling.py 10.207.40.203 CRITICAL 12 2 [EMAIL PROTECTED] eventhandlers]# - I get following error: [EMAIL PROTECTED] rw]# tail -f nagios.cmd [1190731734] PROCESS_SERVICE_CHECK_RESULT;10.207.40.203;snmp_trap_handling_critical;2 ; 12 tail: nagios.cmd: file truncated tail: nagios.cmd: cannot seek to offset 0: Illegal seek [EMAIL PROTECTED] rw]# -- nagios.cmd is a pipe: [EMAIL PROTECTED] rw]# ls -l total 0 prw-rw 1 nagios nagcmd 0 Sep 25 20:18 nagios.cmd [EMAIL PROTECTED] rw]# [EMAIL PROTECTED] rw]# pwd /usr/local/nagios/var/rw why I am getting Illegal seek error. I have tried to use this script to a normal read write file and it works fine, but while writing to a pipe it is showing error. If the problem is due to the script then it could be here output = open('/usr/local/nagios/var/rw/nagios.cmd', 'w') print return_code results = [ + mytime + ] + PROCESS_SERVICE_CHECK_RESULT; \ + host + ; + snmp_trap_handling_ + service_suffix + ; \ + return_code + ; + mondata_res + \n output.write(results) It could be file(pipe) permission problem, I have no idea. I have followed below link to give permission to nagios.cmd http://nagios.sourceforge.net/docs/2_0/commandfile.html Thanks Regards, Pankaj http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Best way to monitor application clusters
I've recently started using nagios in our development environment, and have knocked a few plugins for some of our programs (i.e. monitor a log on a remote server to make sure it's growing, but not growing too fast or too slow, or jumbo pings between two remote machines), which is very impressive. One thing I would like to monitor is a group of hosts/services, and flag a warning if x% are not available, and a critical if y% are offline. A common example would be checking DNS services. If you have 4 DNS servers, you don't want to be woken up at 3AM if one falls offline, but if 3 are offline you would, and if 4 are offline you want an APB. You still want to see the servers are offline though on a webpage, and possible a notification in work hours. I'm aware of host/service groups, being one way of doing it, however I'm unsure if notifications can be set based on % of hosts/services available in a group. Another way would be a virtual host, with a custom check_host_alive which checks all hosts in a collection, and returns an OK/critical/warning based on the number of failures, and likewise with virtual services. The original hosts could then be monitored separately, or even not at all. For example, a service I would like to check is whether 3 mysql databases are in sync with each other. I currently have a web page that compares the log positions. It seems to me that logically the service should run on the mysql boxes, however I only want it running on Another example would be I have a piece of java software (call it A) that must run on at least one of 4 machines, and preferably on 2 of them. I don't care which machine it's on, but if it's not running I want to be notified in red lights. I could have a simple virtual service A, which would critical on 0, warn on 1 and OK on 2 or more. This would be attached to virtual host A, which would critical on 0, warn on 1 and OK on 2 or more of the servers that the service runs on. I'd also like a simple login to the web page which would only display the clusters of services/hosts, rather than the total view, which would allow our support engineers to easilly see real problems, and allow management to sleep hapilly with lots of green lights. I must admit I'm leaning to the virtual host/service thing, but I was wondering if there's a standard/better way of monitoring these kind of things? Thanks http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null