Re: [Nagios-users] nagios/opsview with qpage
Darren, I don't know if this helps..but I use qpage for nagios and other *nix alerts. I call qpage directly from /usr/local/bin..here is my info command.cfg # 'host-notify-by-sms' command definition define command{ command_namehost-notify-by-sms command_line/usr/local/bin/qpage -s localhost -p name1,name2,name3 -f $NOTIFICATIONTYPE$ - Host $HOSTALIAS$ is $HOSTSTATE $ } # 'service-notify-by-sms' command definition define command{ command_nameservice-notify-by-sms command_line/usr/local/bin/qpage -s localhost -p name1,name2,name3 -f $NOTIFICATIONTYPE$ - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ } contacts.cfg define contact{ contact_nameITpager alias Operations On-Call service_notification_period 24x7 host_notification_period24x7 service_notification_optionsw,u,c,r,f host_notification_options d,u,r,f service_notification_commands service-notify-by-sms host_notification_commands host-notify-by-sms email administra...@whatever.huh } qpage's running process: qpage -C /etc/qpage.cf -q 15 On Tue, 2010-05-25 at 13:16 -0400, Darren Hill wrote: Hi, I've seen a few examples of qpage working with nagios (I'm using opsview as a frontend to nagios). Qpage is working ok manually, but when I try and have nagios send the page nothing comes out. I'm using ubuntu 8.04 LTS server running opsview 3.7 Here's my misccommands.cfg, contacts.cfg define command{ command_name notify-by-pagerscript command_line /usr/local/nagios/libexec/notifications/qpage -s localhost -p $PAGER$ -f $HOSTNAME$ $SERVICEDESC$ `SERVICEOUTPUT$' $HOSTNAME$ } define contact { contact_nameadmin/01default alias Administrator use global-contact service_notification_period 24x7 host_notification_period24x7 notification_level 1 email hid...@whatever pager INETPager host_notification_commands notify-by-email,notify-by-pagerscript host_notification_options u,d,r,f service_notification_commands notify-by-email,notify-by-pagerscript service_notification_optionsw,c,r,u,f } Any ideas? qpage is running with /usr/local/bin/qpage -q 5 in the background waiting. Thanks! -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Telesync Switches is possible?
ReynierPM, We have alot of 12-24port Allied Telesyn switch's, but all of them are un-managed switches. If the switch you have is 'managed' (you can log into an ip address, or get snmp information from the switch) then nagios can monitor it with snmp. But in my experience, I have been unable to monitor un-managed switch's. I guess a workaround would be to monitor the port of a managed switch that the allied telesyn is plugged into for status (up/down). Jayson On Sat, 2009-12-12 at 09:43 -0500, ReynierPM wrote: Hi every: As the subject said: it's possible to monitoring a Allied Telesync switches? I have almost 3 of them in my infraestructure and need to be monitored but don't know how to do this. Any help? The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you. -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring external internet connectivity
Shadhin, I had our nagios ping google.com for awhile, but if something happens to the DNS server internally then it gave a false reading saying the internet was down when it was a DNS issue. Eventually I figured I could ping my ISP's IP address instead of hostname, and that seemed to work well. Im sure a place like google has more than 1 ip address associated with it, so I would pick an external IP address of a known site/company and plug it in. I also have SMS enabled/setup on my nagios machine so that when the internal network is down, an SMS message is generated and sent to the techie phones. (can't get email if the network is down, eh?) Hope this helps, Jayson Broughton From: shadih rahman [mailto:shadhi...@gmail.com] Sent: Tuesday, June 30, 2009 1:44 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] monitoring external internet connectivity All, What is the best way to check if our network has connection to commodity internet or not. What I am trying to achieve is to ensure internal network has connection to Internet. One thing is to ping google check. Is this a good idea or can someone suggest a better way to do this. Please advise on this. Thanks -- Cordially, Shadhin Rahman The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you.-- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring external internet connectivity
I gotta 2nd Marc on that. We did get permission from our ISP to ping their network, as we are usually the first to know when one of our 28+ networks go down in the country :-) I too would get permission. Because if they don't know who you are, some attentive little admin at x site might see your possibly mis-configured nagios server as a DOS attack against their network and block you. Thus leaving you with a false response that your network is down :-) Or even if it's not mis-configured, having your server pinging their server at regular intervals might seem abit off to someone that's monitoring logs. Jayson -Original Message- From: Marc Powell [mailto:m...@ena.com] Sent: Tuesday, June 30, 2009 2:05 PM To: Nagios Users Mail-list Subject: Re: [Nagios-users] monitoring external internet connectivity On Jun 30, 2009, at 2:43 PM, shadih rahman wrote: All, What is the best way to check if our network has connection to commodity internet or not. What I am trying to achieve is to ensure internal network has connection to Internet. One thing is to ping google check. Is this a good idea or can someone suggest a better way to do this. It is never a good idea to steal the resources of some third party for your own purposes, no matter how infinite you think those resources to be. If you and hundreds, thousands or millions of other people all think it's just fine to ping Google all the time, Google takes the brunt of that resource hit. I would recommend that you poll your router's snmp tables to get interface status (ifStatus, ifOperStatus). Alternately, place a test device or purchase a small colo server somewhere outside your network that you can use as the target for your tests; or work with someone with similar needs to arrange permission to ping devices on each other's network. -- Marc -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] socket timeout after 20 sec
Try opening a telnet session to that port (to test the firewall theory) telnet ip_windows_server 1248 you should get: Trying ip address Connected to ip address. Escape Character is '^\]'. Then Ctl-C to exit out of it (if your in linux that's what it looks like) 50 Seconds is pretty long. I have a server that is monitored by nagios, 2000+ miles away and 14 traceroute hops. It was taking longer than 20 seconds, so I set it to 60 seconds, and it takes ~30seconds..so I just doubled in case of network latency. ~Jayson -Original Message- From: Andreas Ericsson [mailto:a...@op5.se] Sent: Thursday, May 14, 2009 1:33 PM To: Anirudh Srinivasan Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] socket timeout after 20 sec Anirudh Srinivasan wrote: Hey Folks, I know socket timeout after 20 sec has been discussed so many time in all the forums and post, but still this is not so clear to me. Nagios server : RHEL 5 NC_NET used on Windows server. In startup.cfg the port is 12489 and the ip_passive is ip address of the nagios server. Even through the Linux cli i get the same error ./check_nt -H ip_windows_server -v FREEDISKSPACE -l C -w 80 -c 90 -p 1248 -t 50 CRITICAL - socket timeout after 50 sec I really have been breaking my head on this. Please suggest some idea or your experience There is a firewall blocking the traffic somewhere along the way, or you have managed to build a cyclic token-ring network (if you don't know what those are, consider yourself lucky and look over your firewalls). -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Register now for Nordic Meet on Nagios, June 3-4 in Stockholm http://nordicmeetonnagios.op5.org/ Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Checking Bacula Server
Eduardo, Bacula has a wiki that show's you how to setup active and passive checks via nagios. Go to the below link and check out 'General Howtos' http://wiki.bacula.org/doku.php?id=howtos ~Jayson From: Eduardo Barreto [mailto:lec...@gmail.com] Sent: Monday, April 27, 2009 2:15 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Checking Bacula Server Hi all, Does anybody know how to check a Bacula Server? I've installed check-bacula but it's not working fine. Anybody to give me a clue? Thanks in advance Eduardo The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you.-- Register Now Save for Velocity, the Web Performance Operations Conference from O'Reilly Media. Velocity features a full day of expert-led, hands-on workshops and two days of sessions from industry leaders in dedicated Performance Operations tracks. Use code vel09scf and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] downtime large number of hosts?
Shai, The example scripts are actual scripts that you can use..for example: http://www.nagios.org/developerinfo/externalcommands/commandinfo.php?command _id=36 #!/bin/sh # This is a sample shell script showing how you can submit the DISABLE_HOST_SVC_NOTIFICATIONS command # to Nagios. Adjust variables to fit your environment as necessary. now=`date +%s` commandfile='/usr/local/nagios/var/rw/nagios.cmd' /bin/printf [%lu] DISABLE_HOST_SVC_NOTIFICATIONS;host1\n $now $commandfile That is the 'example' script. Just create a file and name it whatever you want..for me I use the hostname in the file name. DISABLE_HOST_SVC_NOTIFICATIONS_HOSTNAME, then chmod the file for execution: chmod u+x DISABLE_HOST_SVC_NOTIFICATIONS_HOSTNAME Edit the file and replace host1 with the hostname that you want to affect, the hostname has to be spelled the same way that it is in your .cfg file. Now you can either run the file manually when you want a downtime, or set it to run in a cronjob. For you since you want a large list of hosts that you want to disable notifications, then try the disable hostgroup script. ~Jayson From: Shai [mailto:sha...@gmail.com] Sent: Tuesday, January 13, 2009 8:53 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] downtime large number of hosts? Hi, I've seen the help on the nagios external command, but I'm not exactly sure how to make the script. Can someone show me an example? Shai On Fri, Jan 9, 2009 at 5:12 PM, Jayson Broughton jbrough...@truecos.com wrote: We have some linux servers here that peg-out on CPU/Mem when they are doing a full backup, so I have used the wonderful 'Nagios External Command List' website to custom tailor my alert notifications for hosts/hostgroups every day during a certain time. Specifically the DISABLE_HOST_SVC_NOTIFICATIONS and DISABLE_HOST_SVC_CHECKS. Then when it's time to re-monitor, a crontab entry runs: ENABLE_HOST_SVC_NOTIFICATIONS ENABLE_HOST_SVC_CHECKS. Check out those 4 options on: http://www.nagios.org/developerinfo/externalcommands/commandlist.php The examples give you an excellent base script to work off of. ~Jayson -Original Message- From: Marc Powell [mailto:m...@ena.com] Sent: Thursday, January 08, 2009 8:05 AM To: Nagios Users Subject: Re: [Nagios-users] downtime large number of hosts? On Jan 8, 2009, at 8:51 AM, Mahmoud Hanafi wrote: Is there a easy way to schedule down time for large number of hosts/ services at the same time? The downtime web interface doesn't accept wildcard characters. If they are all in the same hostgroup you can click on the hostgroup name (in ()'s) and schedule downtime for all of them at once, otherwise, no not that I am aware of outside of some simple scripting to send the appropriate external command for each host through the external command file. -- Marc -- Check out the new SourceForge.net Marketplace. It is the best place to buy or sell services for just about anything Open Source. http://p.sf.net/sfu/Xq1LFB ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you. -- Check out the new SourceForge.net Marketplace. It is the best place to buy or sell services for just about anything Open Source. http://p.sf.net/sfu/Xq1LFB ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True
Re: [Nagios-users] Could not complete ssl handshake
Harry, There are afew thing you can check that cause this problem. 1. Check versions of the nrpe on both the server and the client 2. Did you install check_nrpe from source or rpm? You might want to check if one is compiled with ssl support and the other isn't compiled for SSL support (I had that problem, server was compiled for SSL and listening for SSL connections, client wasn't) 3. I know with RHEL (what we use here) you have to set up the SELinux permissions to allow the port through, along with editing the /etc/xinetd.d/nrpe file and send a hup to xinetd so it can talk to the server/client. Here's a copy of my xinetd file in /etc/xinetd.d/ a. # default: on b. # description: NRPE (Nagios Remote Plugin Executor) c. service nrpe d. { e. flags = REUSE f.socket_type = stream g. port= 5666 h. wait= no i. user= nagios j.group = nagios k. server = /usr/local/nagios/bin/nrpe l. server_args = -c /usr/local/nagios/etc/nrpe.cfg --inetd m.log_on_failure += USERID n. disable = no o. only_from = NAGIOS.SERVER.IP.ADDRESS p. } 4. Try running a higher debug on the server and manually run check_nrpe from the server to the client, check logs. Hope this helps! ~Jayson Broughton From: Hart, Harry M. CTR USJFCOM JTCI [mailto:harry.hart@jfcom.mil] Sent: Thursday, January 08, 2009 4:37 AM To: Nagios Users Mailinglist Subject: [Nagios-users] Could not complete ssl handshake I know I've seen this error on this forum before but can not remember what resolved the problem. I installed NRPE on a Linux system to talk to another Linux machine that is the Nagios server. I do the check_nrpe and get Could not complete SSL handshake. It works fine when I do it from the server to the remote system. Thanks for any help on this one. Harry Harry M. Hart Systems integrator SAIC USJFCOM JIOC DCGS-A (757) 203-7422 DSN 668-7422 harry.h...@intel.jwfc.jfcom.smil.mil harry.h...@jwfc.ic.gov The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you.-- Check out the new SourceForge.net Marketplace. It is the best place to buy or sell services for just about anything Open Source. http://p.sf.net/sfu/Xq1LFB___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] downtime large number of hosts?
We have some linux servers here that peg-out on CPU/Mem when they are doing a full backup, so I have used the wonderful 'Nagios External Command List' website to custom tailor my alert notifications for hosts/hostgroups every day during a certain time. Specifically the DISABLE_HOST_SVC_NOTIFICATIONS and DISABLE_HOST_SVC_CHECKS. Then when it's time to re-monitor, a crontab entry runs: ENABLE_HOST_SVC_NOTIFICATIONS ENABLE_HOST_SVC_CHECKS. Check out those 4 options on: http://www.nagios.org/developerinfo/externalcommands/commandlist.php The examples give you an excellent base script to work off of. ~Jayson -Original Message- From: Marc Powell [mailto:m...@ena.com] Sent: Thursday, January 08, 2009 8:05 AM To: Nagios Users Subject: Re: [Nagios-users] downtime large number of hosts? On Jan 8, 2009, at 8:51 AM, Mahmoud Hanafi wrote: Is there a easy way to schedule down time for large number of hosts/ services at the same time? The downtime web interface doesn't accept wildcard characters. If they are all in the same hostgroup you can click on the hostgroup name (in ()'s) and schedule downtime for all of them at once, otherwise, no not that I am aware of outside of some simple scripting to send the appropriate external command for each host through the external command file. -- Marc -- Check out the new SourceForge.net Marketplace. It is the best place to buy or sell services for just about anything Open Source. http://p.sf.net/sfu/Xq1LFB ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you. -- Check out the new SourceForge.net Marketplace. It is the best place to buy or sell services for just about anything Open Source. http://p.sf.net/sfu/Xq1LFB ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] best way to analyse traffic on a network-device
We use Cacti to analyze traffic information on a network device. Actually, we use it to monitor and analyze all network traffic on ports on our switches. I know in the past that Cacti and Nagios have gone hand in hand when it comes to monitoring with either tool. Here's a very short bit on cacti/nagios from the nagioswiki page. I don't know what other people do, but I have cacti running on the same server as nagios. http://nagioswiki.com/wiki/index.php/Cacti and cacti's website: http://cacti.net If you want to integrate cacti with nagios, there seems to be a-few plugins on nagiosexchange.org for cacti-nagios integration. This one looks promising: C2n - Cacti to Nagios c2n.cgi is a filter/wrapper for the frontend of Cacti to include it in Nagios. It includes the Cacti interface in the right frame of the Nagios web interface. It filters out any possiblity to switch from one host to another using cacti menus. Therefore the authentication system of Nagios will take place and the authentication system of Cacti is unused. http://www.nagiosexchange.org/cgi-bin/page.cgi?g=Detailed%2F2743.html;d=1 ~Jayson Jayson Broughton Linux Systems Administrator True Oil Companies jbrough...@truecos.com -Original Message- From: Ralf Prengel [mailto:ralf.pren...@comline.de] Sent: Tuesday, December 16, 2008 6:13 AM To: nagios-user Mailinglist Subject: [Nagios-users] best way to analyse traffic on a network-device Hallo, what is the best way to analyse traffic on a network-device. I need graphs with average-values in Kb like mrtg is building. Using snmp and/ or new plugins is possible. -- i.A. Ralf Prengel Customer Care Manager Comline AG Hauert 8 D-44227 Dortmund/Germany Fon +49231 97575- 904 Fax +49231 97575- 905 EMail ralf.pren...@comline.de -- www.comline.de Vorstand Stephan Schilling, Erwin Leonhardi Aufsichtsrat Dr. Franz Schoser (Vorsitzender) HR Dortmund B 14570 USt.-ID-Nr. DE 124727422 -- SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada. The future of the web can't happen without you. Join us at MIX09 to help pave the way to the Next Web now. Learn more and register at http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you. -- SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada. The future of the web can't happen without you. Join us at MIX09 to help pave the way to the Next Web now. Learn more and register at http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitor cpu usage...
How about the -C flag? ./check_proc -w 10 -c 20 -C ApplicationName -metric=CPU ~Jayson From: Craig A [mailto:tabmo...@gmail.com] Sent: Monday, December 15, 2008 2:01 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] monitor cpu usage... Hi all, I've looked at the documentation, at the mailling lists, and googled around, and don't explicitly see it so thought i'd try here On our system, I've set up nagios and it's monitoring a few things (http, etc) just fine. However, one of our applications is giving us grief and sometimes starts consuming 80% of a cpu (it's a quad-core machine). So i'd like to set up a monitor to see how often this happens and to get notified. Looking around, it seems like 'check_procs' with the CPU metric should do what i want. However, it doesn't seem to behave quite as I'd expect. If I look at 'top' on this linux system, for our 'problem' application, i see this (and it will be constant like this for 1 hour): PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 20022 ourapp 18 0 1428m 1.0g 13m S 104 25.7 442:18.39 java So when I run this command, I'd expect to see 1 process flagged: [r...@ours libexec]# ./check_procs -w 10 -c 20 --metric=CPU CPU OK: 173 processes I've tried adding the '-u' flag but made no difference. Is there something I'm missing if I want to capture and be notified when a process goes above a certain CPU usuage (ie. 80%)? Any help or information you could provide would be appreciated. Thanks The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you.-- SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada. The future of the web can't happen without you. Join us at MIX09 to help pave the way to the Next Web now. Learn more and register at http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring HP Hardware - NRPE/NSCA on Windows
Hey Curtis, I can vouch for the check-hpasm utility that Gerhard recommends. We run a shop of 20+ HP servers (all sorts of HP servers) and I use the check-hpasm utility for our redhat servers. This utility has saved my neck a few times when hardware has failed in remote sites. What I have done was install the plugin under /usr/nagios/libexec/ and called it from the nagios server via nrpe. Although the only problem I do have with the plugin is that at times it will give false readings on the first check, but cleared on the 2nd check. Logs from hpasmcli show nothing, so I would assume it has something to do with the plugin being called by nrpe and just hiccupping. I solved this by setting my notification thresholds for the service to notify via email after the 3rd check. Here's what a memory module error looks like: CRITICAL - dimm module 5 @ cartridge 2 needs attention (dimm is degraded) And in a perfect world: OK - System: `proliant dl585 g1', S/N: 'x',ROM:A01 02/14/2007', hardware working fine Jayson From: Gerhard Lausser [mailto:[EMAIL PROTECTED] Sent: Wednesday, December 03, 2008 5:57 AM To: 'Curtis LaMasters'; nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Monitoring HP Hardware - NRPE/NSCA on Windows Hi Curtis, with http://www.consol.com/opensource/nagios/check-hpasm you have two possibilities local (linux): the plugin runs on the HP and parses the output of the hpasmcli command remote (linux,windows): the plugin runs on the Nagios server and uses SNMP In both cases you need to install the hpasm RPM or the Windows Insight Management Agents. Gerhard _ Von: Curtis LaMasters [mailto:[EMAIL PROTECTED] Gesendet: Dienstag, 2. Dezember 2008 23:17 An: nagios-users@lists.sourceforge.net Betreff: [Nagios-users] Monitoring HP Hardware - NRPE/NSCA on Windows What is the best method for doing this. I would prefer to have some sort of script that parses information on the remote host on HP hardware. I have found a few that use SNMP but I'm not sure how that works with NRPE/NSCA. For Dell hardware I use check_dell.exe from http://www.itefix.no http://www.itefix.no/ . Anything similar would be great. Curtis LaMasters http://www.curtis-lamasters.com http://www.builtnetworks.com The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you.- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE way too fragile ?
Guillaume, We actually have the same problem here where Nagios is setup. I use the NRPE daemon on both windows and linux servers. Two of our servers do backups early in the morning and during that time we get NRPE Timeout messages from the two servers. I have set the timeout on the server and the clients to timeout after 30 seconds, thinking that would fix it. But alas, we still get timeout messages. I haven't had much time to see what I can do to fix this, so for now our solution is to not email out warning messages from those two servers (I have the thresholds set enough where the critical messages still gives enough window to take care of the problem before going too critical) I have come to the conclusion that the servers are running the backup and eating up so much processing that the nrpe times out trying to connect and send information. If you find a solution or even an idea to try, feel free to let me know and I'll give it a shot! Jayson Broughton Linux Systems Administrator True Computer Operations Dept. True Oil LLC -Original Message- From: Guillaume Rousse [mailto:[EMAIL PROTECTED] Sent: Wednesday, October 08, 2008 4:44 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] NRPE way too fragile ? Hello list. I'm using nrpe quite heavily for testing lots of local service on all my machines. It work usually well, but seems a bit unreliable: too much often, nrpe itself fails to accept incoming connections, and test fails: CHECK_NRPE: Socket timeout after 10 seconds. stracing nrpe process shows it is probably waiting itself on another connection: [EMAIL PROTECTED] ~]# strace -p 22444 Process 22444 attached - interrupt to quit select(6, [5], NULL, [5], {0, 17}) = 0 (Timeout) accept(5, 0, NULL) = -1 EAGAIN (Resource temporarily unavailable) It usually recovers itself alone, but that's enough to cause much unwanted notifications, even if all monitored services have nrpe itself as dependency. I'm using ssl encryption, as usually advised, but I'm planning shifting to plain-text connection (everything occurs on a distinc VLAN, without user access). Does everyone else has similar experience ? -- Guillaume Rousse Moyens Informatiques - INRIA Futurs Tel: 01 69 35 69 62 - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null