[Nagios-users] Service check goes HARD too quick if multiple service are in problem state
Hi, I have had this problem previously and posted here but not go nowhere with it. Ill have another bash. Basically my nagios machine is checking too frequently and firing out alerts too quickly Its ignoring the retry_interval value, the max_check_attempts value and ingoring the notification_interval value in the escalations. I have check interval of 5 minutes in OK state Retry interval of 3 minutes when in problem state Notification interval of 3 minutes I believe that below is the problem and multiple service checks in problem state at the same time is casuing this. Ive just seen this on 1 of my hosts: It appears its accumulating the service checks (even though they are different checks) into a final HARD state. Prior to 17:18 all was fine on this host!!! Then at 17:18 a SQL check went to warning state and to SOFT 1 Checked again at 17:21 which is the 3 minute interval I have told it too when in problem and its still warning so onto SOFT2 Then a different service check on that host goes critical - but for the first time 17:22 memory usage and it puts this to HARD 3 - even though this actual check for memory should be SOFT1 An alert then got sent straight out for the Memory check even though it was actually only check 1/3 on that particular service Here is the copy and past from the History of the host [01-15-2013 17:18:24] SERVICE ALERT: SERVER;SQL LOCK TIMEOUTS;WARNING;SOFT;1;WARNING - 2.3067 lock timeouts / sec for _Total, 2.0667 lock timeouts / sec for Key, 0. lock timeouts / sec for RID, 0.2400 lock timeouts / sec for Page, 0. lock timeouts / sec for Object, 0. lock timeouts / sec for Metadata, 0. lock timeouts / sec for HoBT, 0. lock timeouts / sec for File, 0. lock timeouts / sec for Extent, 0. lock timeouts / sec for Database, 0. lock timeouts / sec for Application, 0. lock timeouts / sec for AllocUnit [01-15-2013 17:21:24] SERVICE ALERT: SERVER;SQL LOCK TIMEOUTS;WARNING;SOFT;2;WARNING - 1.3056 lock timeouts / sec for _Total, 1.1833 lock timeouts / sec for Key, 0. lock timeouts / sec for RID, 0.1222 lock timeouts / sec for Page, 0. lock timeouts / sec for Object, 0. lock timeouts / sec for Metadata, 0. lock timeouts / sec for HoBT, 0. lock timeouts / sec for File, 0. lock timeouts / sec for Extent, 0. lock timeouts / sec for Database, 0. lock timeouts / sec for Application, 0. lock timeouts / sec for AllocUnit [01-15-2013 17:22:04] SERVICE ALERT: SERVER;MEMORY USAGE;CRITICAL;HARD;3;CRITICAL: physical memory: Total: 10G - Used: 9.81G (98%) - Free: 192M (2%) critical Does anybody please have any idea why my server is checking too frequently and alerting too frequently and why its totting up different service checks? This machine has done nothing but not work right since it was loaded a couple months ago. Im using the come config files on it as I did on the previous box I had - only difference was that was running 3.3.1 - I had none of these problems on that install. This is a Nagios 3.4.1 install on a Ubuntu 12.04 desktop 32 bit OS Thanks in advance -- Master SQL Server Development, Administration, T-SQL, SSAS, SSIS, SSRS and more. Get SQL Server skills now (including 2012) with LearnDevNow - 200+ hours of step-by-step video tutorials by Microsoft MVPs and experts. SALE $99.99 this month only - learn more at: http://p.sf.net/sfu/learnmore_122512___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_nt - MEMORY USAGE - incorrect results
Hi all, Using the supplied check_nt plugin to check Memory Usage on Windows servers. Some report correctly, others report a complete load of old tosh!!! I have tried 3 different versions of Windows OS, the version seems to make no odds. Doesn't matter if 32 or 64 bit either. Some examples MY primary domain controller - Windows Server 2008 R2 (x64) - 8GB ram installed Output from the check appears correct: Memory usage: total:8205.64 Mb - used: 2902.96 Mb (35%) - free: 5302.67 Mb (65%) Another 2008 R2 (x64) server box (has SQL installed on it) - 12GB ram installed Output thinks its got 24GB: Memory usage: total:24573.16 Mb - used: 1796.71 Mb (7%) - free: 22776.45 Mb (93%) A Server 2003 Standard (x86) box (an internal test web server) - 512MB ram installed Output thinks its got over 1GB: Memory usage: total:1257.50 Mb - used: 333.30 Mb (27%) - free: 924.20 Mb (73%) A Server 2012 (x64) box (with HyperV installed) - 28GB ram installed Output thinks tis got 32GB: Memory usage: total:32500.80 Mb - used: 16709.37 Mb (51%) - free: 15791.43 Mb (49%) Anybody any ideas as to why check_nt is returning incorrect info. I know its incorrect but Nagios doesn't so where exactly is it reading these values from? Thanks in advance for anybodies input. Regards -- Master Java SE, Java EE, Eclipse, Spring, Hibernate, JavaScript, jQuery and much more. Keep your Java skills current with LearnJavaNow - 200+ hours of step-by-step video tutorials by Java experts. SALE $49.99 this month only -- learn more at: http://p.sf.net/sfu/learnmore_122612 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Anybody use check_mssql_health plugin?
Hello Robert, This is my command: # 'check_mssql_health CPU Usage' command definition define command{ command_namecheck_sql_cpu command_line$USER1$/check_mssql_health -server $HOSTADDRESS$ -username $ARG1$ -password $ARG2$ -mode $ARG3$ } This is my service check for said server: define service{ useservice-sql-server,srv-pnp host_nameABCDEF service_descriptionCPU USAGE check_commandcheck_sql_cpu!user!pass!cpu-busy! } Cheers From: Werner, Robert [mailto:rwer...@pomwonderful.com] Sent: 06 December 2012 22:05 To: Nagios Users List Subject: Re: [Nagios-users] Anybody use check_mssql_health plugin? I didn't know you could check the CPU status with that plugin. What is the command definition that you are using? -- Robert G. Werner Oracle Apps Systems Administrator rwer...@pomwonderful.commailto:rwer...@pomwonderful.com 559.521.5089 From: Andrew Thompson [and...@fulgent.co.uk] Sent: Thursday, December 06, 2012 4:48 AM To: nagios-users@lists.sourceforge.netmailto:nagios-users@lists.sourceforge.net Subject: [Nagios-users] Anybody use check_mssql_health plugin? If so, ever come across this issue before? Everything works fine apart from 1 server and 1 check. 1 particular Windows 2008R2 server replies its CPU usage as a crazy percentage: [12-05-2012 14:47:35] SERVICE ALERT: XX;CPU USAGE;CRITICAL;HARD;3;CRITICAL - CPU busy 194180.74% [12-05-2012 14:44:35] SERVICE ALERT: XX;CPU USAGE;CRITICAL;SOFT;2;CRITICAL - CPU busy 116508.44% [12-05-2012 14:41:35] SERVICE ALERT: XX;CPU USAGE;CRITICAL;SOFT;1;CRITICAL - CPU busy 233016.89% When I check the server the CPU isn't even using 10% most of the time. It does this from the terminal aswell, with nagios user and also root user. Anybody have any ideas as to what can cause this please? Many Thanks -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Anybody use check_mssql_health plugin?
If so, ever come across this issue before? Everything works fine apart from 1 server and 1 check. 1 particular Windows 2008R2 server replies its CPU usage as a crazy percentage: [12-05-2012 14:47:35] SERVICE ALERT: XX;CPU USAGE;CRITICAL;HARD;3;CRITICAL - CPU busy 194180.74% [12-05-2012 14:44:35] SERVICE ALERT: XX;CPU USAGE;CRITICAL;SOFT;2;CRITICAL - CPU busy 116508.44% [12-05-2012 14:41:35] SERVICE ALERT: XX;CPU USAGE;CRITICAL;SOFT;1;CRITICAL - CPU busy 233016.89% When I check the server the CPU isn't even using 10% most of the time. It does this from the terminal aswell, with nagios user and also root user. Anybody have any ideas as to what can cause this please? Many Thanks -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios is ignoring the retry_interval setting
Hi, My nagios box has decided to stop listening to the retry_interval entry in my templates. My server template reads: define host{ name host-server check_period server_24x7 check_interval1 retry_interval3 max_check_attempts3 notification_period server_24x7 notification_interval 3 notification_options d,r notifications_enabled 1 contact_groupsservers email, servers sms event_handler_enabled 1 process_perf_data 1 retain_status_information1 retain_nonstatus_information 1 passive_checks_enabled 0 obsess_over_host 0 check_freshness 0 flap_detection_enabled 0 failure_prediction_enabled 0 } Now this is what happens: * Server goes down at 1pm. * I check the next scheduled check and it clearly states 1.03pm * But at 1.01pm it checks again and then spits out an email and text message saying the server is down. Completely ignoring the retry_interval setting!!! Id expect from the above: * 1pm server goes down * 1.03pm check 2 is done * 1.06pm check 3 is done and determined hard state. * At 1.06pm the notification should be sent out. Why is this, is something in my config wrong? Ubuntu 12.04 desktop and Nagios 3.4.1 Thanks -- Keep yourself connected to Go Parallel: VERIFY Test and improve your parallel project with help from experts and peers. http://goparallel.sourceforge.net___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios is ignoring the retry_interval setting
Hi Georgio, The whole test cfg I am using to try troubleshoot this can be found at: http://dl.dropbox.com/u/895609/test.cfg This is a direct copy of my main servers config but with the rest of the servers and some templates for other server checks taken out Kind Regards Andrew From: Andrew Thompson Sent: 29 November 2012 16:11 To: nagios-users@lists.sourceforge.net Subject: Nagios is ignoring the retry_interval setting Hi, My nagios box has decided to stop listening to the retry_interval entry in my templates. My server template reads: define host{ name host-server check_period server_24x7 check_interval1 retry_interval3 max_check_attempts3 notification_period server_24x7 notification_interval 3 notification_options d,r notifications_enabled 1 contact_groupsservers email, servers sms event_handler_enabled 1 process_perf_data 1 retain_status_information1 retain_nonstatus_information 1 passive_checks_enabled 0 obsess_over_host 0 check_freshness 0 flap_detection_enabled 0 failure_prediction_enabled 0 } Now this is what happens: * Server goes down at 1pm. * I check the next scheduled check and it clearly states 1.03pm * But at 1.01pm it checks again and then spits out an email and text message saying the server is down. Completely ignoring the retry_interval setting!!! Id expect from the above: * 1pm server goes down * 1.03pm check 2 is done * 1.06pm check 3 is done and determined hard state. * At 1.06pm the notification should be sent out. Why is this, is something in my config wrong? Ubuntu 12.04 desktop and Nagios 3.4.1 Thanks -- Keep yourself connected to Go Parallel: VERIFY Test and improve your parallel project with help from experts and peers. http://goparallel.sourceforge.net___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] SSL Enable web interface help
Hi all, I am publishing my Nagios web console to be accessible over the interweb, now I need to make it secure. I have loaded a test VM with Nagios 3.4.1 on, on Ubuntu 12.04 desktop to test on rather than play with SSL on the live box. I have followed this guide: http://blog.stefandanielschwarz.de/2010/02/howto-securing-nagios.html But I cant get it to work. When I browse https://localhost/nagios firefox returns: Secure Connection Failed An Error occurred during a connection to localhost SSL received a record that exceeded the maximum permissible length Error code: ssl_error_rx_record_too_long) For the cert setup I entered localhost for the Common Name. On the live box though I will enter the FQDN that I will hit over the internet abc.def.com for example Anybody give me any pointers on how to sort this out on the test box please? Thankyou -- Monitor your physical, virtual and cloud infrastructure from a single web console. Get in-depth insight into apps, servers, databases, vmware, SAP, cloud infrastructure, etc. Download 30-day Free Trial. Pricing starts from $795 for 25 servers or applications! http://p.sf.net/sfu/zoho_dev2dev_nov___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SSL Enable web interface help
Hello thanks for reply. Nagios is deffo running. I don’t have any firewall enabled on local machine Thanks From: romy rajoelison [mailto:cafriner...@yahoo.fr] Sent: 21 November 2012 12:23 To: Nagios Users List Subject: Re: [Nagios-users] SSL Enable web interface help Hi, Have you tried to type service nagios start? chkconfig nagios on make sure that firewall don't block nagios's port De : Andrew Thompson and...@fulgent.co.ukmailto:and...@fulgent.co.uk À : nagios-users@lists.sourceforge.netmailto:nagios-users@lists.sourceforge.net nagios-users@lists.sourceforge.netmailto:nagios-users@lists.sourceforge.net Envoyé le : Mercredi 21 novembre 2012 15h01 Objet : [Nagios-users] SSL Enable web interface help Hi all, I am publishing my Nagios web console to be accessible over the interweb, now I need to make it secure. I have loaded a test VM with Nagios 3.4.1 on, on Ubuntu 12.04 desktop to test on rather than play with SSL on the live box. I have followed this guide: http://blog.stefandanielschwarz.de/2010/02/howto-securing-nagios.html But I cant get it to work……… When I browse https://localhost/nagios firefox returns: Secure Connection Failed An Error occurred during a connection to localhost SSL received a record that exceeded the maximum permissible length Error code: ssl_error_rx_record_too_long) For the cert setup I entered localhost for the Common Name. On the live box though I will enter the FQDN that I will hit over the internet abc.def.com for example Anybody give me any pointers on how to sort this out on the test box please? Thankyou -- Monitor your physical, virtual and cloud infrastructure from a single web console. Get in-depth insight into apps, servers, databases, vmware, SAP, cloud infrastructure, etc. Download 30-day Free Trial. Pricing starts from $795 for 25 servers or applications! http://p.sf.net/sfu/zoho_dev2dev_nov ___ Nagios-users mailing list Nagios-users@lists.sourceforge.netmailto:Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Monitor your physical, virtual and cloud infrastructure from a single web console. Get in-depth insight into apps, servers, databases, vmware, SAP, cloud infrastructure, etc. Download 30-day Free Trial. Pricing starts from $795 for 25 servers or applications! http://p.sf.net/sfu/zoho_dev2dev_nov___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Performance Data from custom bash script
Hello, I have a custom written bash script for checking my websites and trawling around them. Now I need to learn and understand now to get performance data into nagios from this script so I can graph with PNP4Nagios Can anybody give me any pointers on how to get performance output from this script? I am by no means a programmer so this I feel might be a steep learning curve? Thankyou -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_sfd2d_oct___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Check Website Help
Hi all, I have websites that I monitor. Some require just a yes is it up or no its down by hitting the site and searching for text on it. Others require the site to be logged into and trawled to test SQL backend so I need to provide it with a username/password. I also want performance data as I have just started using PNP4Nagios. I am using a custom written bash script at the moment, and this works fine, however it doesn't output any performance data. I have looked at Webinject but this uses its own graphing tool (gnuplot) and to reduce confusuion I don't really want to run 2 graphing tools. So 2 questions: 1. Is there a plugin available that does all I need above? Or 2. Anybody point me in the right direction in how to add performance data to my already working bash script? (im not a programmer, however my collegue is!) Thanks in advance. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Check_Mk - no hosts
Hi, Trialing check_mk plugin as I was 2 nagios boxes in 1 web interface. I have it set up on a test box (but I used OMD for install of everything) Anyhow I have added my Windows 7 laptop to the main.mk file, installed the agent and checked the service is running. and then I run check_mk -I and then check_mk -U but my host doesn't appear? I can telnet from the nagios box to my laptop on port 6556 and can see all info in the terminal, but I cant get it into Nagios/Check_mk Somebody care to point out the most obvious simple step I have omitted? Thanks -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] 2 Nagios boxes in 1 web interface
Hello, At the moment I have 1 box setup, monitoring 2 sites (1 local and 1 remote) and if I go to www.blablabla.co.ukhttp://www.blablabla.co.uk it routes through to my office and nagios box and the web interface is displayed. I want to put a box in at the remote site so both boxes only check their own sites (need to reduce load and traffic over the VPN) Is there a plugin that allows both localhost/nagios web interfaces to be shown into 1 that is accessible via my www.blablabla.co.ukhttp://www.blablabla.co.uk address? Thanks -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] 2 Nagios boxes running together in different locations
Hi all, We run 1 nagios box in our Office at location A - this monitors all servers/websites etc in Location A and a remote office in Location B hooked up with a static VPN We have a bit of a tempermental firewall at the moment that keeps going down thus resulting in everything appearing down to Nagios in Location A and it alerting like a loonatic for all hosts/services (88/156) So question is: Can I put a second nagios box in Location B as some sort of slave - and have it cross reference nagios box in Location A and only alert if both boxes agree that said host(s)/service(s) is/are in down state? If so are there any guides/helps out on the www that will assist in this setup? Thanks Kind Regards Andrew -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Root_partition check not reading correctly
Check out the command lines and the way you are spelling disk/disc as you have 2 different spellings. command_line$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ command_line$USER1$/check_disc -w $ARGS$ -c $ARGS$ -p $ARGS$ Check the actual name of the file in the libexec folder. Also what is $ARGS$ shouldn’t it be a $ARGnumber$ as you correctly have in the top example? -Original Message- From: Wolf Halton [mailto:wolf.hal...@gmail.com] Sent: 19 March 2012 16:50 To: Nagios Users List Subject: Re: [Nagios-users] Root_partition check not reading correctly More stuff I am in commands.cfg and added check_disk as a command to check discks on the remote server as well as check_local_disk, which I understand to be about checking the nagios server disk. [code] define command{ command_namecheck_local_disk command_line$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ } define command{ command_namecheck_disk command_line$USER1$/check_disc -w $ARGS$ -c $ARGS$ -p $ARGS$ #command_line $USER1$/check_disc } The front-end error is now (Return code of 127 is out of bounds - plugin may be missing) On Mon, Mar 19, 2012 at 11:27 AM, Wolf Halton wolf.hal...@gmail.com wrote: I found it check_command check_disk! -w 20% -c 10% --path=/ It was missing the ! Thanks for helping me sort it out. Wolf On Mon, Mar 19, 2012 at 10:37 AM, Wolf Halton wolf.hal...@gmail.com wrote: I am not sure how I am launching the service, which I admit is bad, but I inherited the setup. I am not sure if it ever worked properly, because it took a catastrophic rampant application running amok, spewing 9GB files to clue me in. I am running nagios 3.3.1 and nagios-plugins 1.4.15 the application is check_local_disk but I think that must have been a check from the previous nagios the one in libexec is check_disk so I tried [code] define service{ use local-service host_name LTS-MASTERKEY-000 service_description Root Partition check_command check_disk -w 20% -c 10% --path=/ notifications_enabled 1 max_check_attempts 3 check_interval 5 retry_interval 3 check_period 24x7 notification_interval 15 notification_period 24x7 notification_options w,c,r contact_groups admins } [/code] and nagios will not start - says there is a configuration error On Fri, Mar 9, 2012 at 7:49 AM, Claudio Kuenzler c...@claudiokuenzler.com wrote: Please show the service definition. How do you launch the check? By ssh, by nrpe? Seems you're using the same IP address or dns name as the hostname value. Can you verify this? On Fri, Mar 9, 2012 at 1:19 PM, Wolf Halton wolf.hal...@gmail.com wrote: All my machines show a similar output, regardless of how much is available on their root partitions. Root Partition OK 03-09-2012 07:11:08 28d 22h 18m 15s 1/3 DISK OK - free space: / 15903 MB (86% inode=93%): Up to and including ones that are 100% full. No alarms - ever. Is a client app needed on the monitored clients that has not been mentioned? -Wolf -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org --- --- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org
[Nagios-users] Ackowledge this service link is missing
Hi all Nagios 3.3.1 on Ubuntu 11.04 Desktop. I have just come to acknowledge a critical service and much to my surprise I don't have the option too. Is this a known bug? Any help appreciated. Kind Regards Andrew T: 01386 834000 F: 01386 834100 and...@fulgent.co.ukmailto:and...@fulgent.co.uk www.fulgent.co.ukhttp://www.fulgent.co.uk/ Fulgent Technologies Limited, Haddonsacre, Station Road, Offenham, Evesham, WR11 8JJ. This communication contains information which is confidential and may also be privileged or protected by copyright. It is for the exclusive use of the addressee. If you are not the addressee please note that any distribution, reproduction, copying, publication or use of this communication or the information is prohibited. If you have received this communication in error, please telephone us immediately and also delete the communication from your computer. -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Ackowledge this service link is missing
Mark you are absolutely spot on. Thought I was going mad because I knew I had used it before! Cheers -Original Message- From: Frost, Mark {BIS} [mailto:mark.fro...@pepsico.com] Sent: 14 March 2012 13:02 To: Nagios Users List Subject: Re: [Nagios-users] Ackowledge this service link is missing Andrew, I don't know that this is it, but you can only acknowledge something that is in a hard critical state. That is, it has to have hit the threshold of failures before it is considered in a hard state. The service details should tell you if it is, but I've had times where I've thought the same thing only to realize that the service hadn't gotten all the way to a hard state just yet. Mark From: Andrew Thompson [and...@fulgent.co.uk] Sent: Wednesday, March 14, 2012 8:44 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Ackowledge this service link is missing Hi all Nagios 3.3.1 on Ubuntu 11.04 Desktop. I have just come to acknowledge a critical service and much to my surprise I don't have the option too. Is this a known bug? Any help appreciated. Kind Regards Andrew T: 01386 834000 F: 01386 834100 and...@fulgent.co.ukmailto:and...@fulgent.co.uk www.fulgent.co.ukhttp://www.fulgent.co.uk/ Fulgent Technologies Limited, Haddonsacre, Station Road, Offenham, Evesham, WR11 8JJ. This communication contains information which is confidential and may also be privileged or protected by copyright. It is for the exclusive use of the addressee. If you are not the addressee please note that any distribution, reproduction, copying, publication or use of this communication or the information is prohibited. If you have received this communication in error, please telephone us immediately and also delete the communication from your computer. -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Does nagios refuse the \ character?
Does Nagios allow \ characters in check commands? If I run this from the terminal: Check_mssql_health --servername IP ADDRESS --username domain\\domain user --password QWERTY --mode connected-users It connects off to my SQL Database and reports back the connected users. I have put the following command into my servers.cfg file # 'check_mssql_health Connected users' command definition Define command{ Command_name check_sql_connections Command_line $USER1$/check_mssql_health --server $HOSTADDRESS$ --username $ARG1$ --password $ARG2$ --mode $ARG3$ And my service description reads: Define service{ Useservice-sql-server Host_nameSERVER Service_descriptionSQL CONNECTION TIME Check_command check_sql_connections!domain\\domain user!QWERTY!connected-users! But when the check runs it outputs with a critical error saying it cant connect to the database because of a login error and its tried to use domaindomainuser. It has stripped out the \\ Is this normal behaviour of nagios? Do I need quotes or something? Thanks Kind Regards Andrew -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] FW: Does nagios refuse the \ character?
Ignore I sorted it by putting some in the check command Define service{ Useservice-sql-server Host_nameSERVER Service_descriptionSQL CONNECTION TIME Check_command check_sql_connections!domain\\domain user!QWERTY!connected-users! } -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_mssql_health plugin by consol labs connection issues
Hi all Ubuntu 11.04 desktop 32 bit SQL 2005 server Check_mssql_health-1.5.8.2 Just testing this plugin on 1 of my SQL servers. Im using the domain admin to test with so I know it has full Windows auth access. I have installed the DBD:Sybase package. If I run from a terminal: Libexec# ./check_mssql_health --hostname 192.168.1.35 --username domain\\domain admin --password ABCDE --mode connected-users Will throw back: CRITICAL - cannot connect to 192.168.1.35. DBI connect(';host-192.168.1.35;port=1433','domain\domain user',...) failed: (no error string) at ./check_mssql_health line 2175 Nothing is logged in the SQL server event log for security logins So if I try Tsql -H 192.168.1.35 -p 1433 -U domain\\domain user -P ABDCE It returns Locale is en_GB.UTF-8 Locale charset is UTF-8 1 If I then type in 1SELECT COUNT (*) FROM master..sysprocesses WHERE spid 51 2go It replies with 11 (1 row affected) 1 My SQL server event log also shows a successful security connection for the domain account from my nagios box IP. So connection fine its fine during testing with tsql But with the check_mssql_health script it says it cannot connect. Part of the script around line 2175 reads if ($self-{handle} = DBI-connect( $self-{dsn}, $self-{username}, $self-{password}, {RaiseError = 1, AutoCommit = 0, PrintError = 1 })) { $retval = $self; I have changed the entry in /etc/freetds/freetds.conf to read [global] # TDS protocol version # tds version = 8.0 Can anybody help me further with this? cheers -- Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Host Availability Report Strange Output
Can anybody tell me what is meant by an Undetermined state and the reason being insufficient data? I added a host last week on Friday 24th and its had no issues at all but my weekly email report gave this output for it HOST Availability Report Last Updated: Mon Feb 27 00:01:18 GMT 2012 Nagios(r) Core(tm) 3.3.1 - Logged in as nagiosadmin HOST 'A WEBSITE' 19-02-2012 00:01:00 to 26-02-2012 23:59:00 Duration: 7d 23h 58m 0s First assumed host state: First assumed service state [Unspecified \/] [Unspecified \/] Report period: Backtracked archives: [[ Current time range ] \/] 4 [Update] [ Availability report completed in 0 min 0 sec ] HOST State Breakdowns: trends.cgi?host=FULGENT+HELPDESKt1=1329609660t2=1330300740includesoftstates=noassumestateretention=yesassumeinitialstates=yesassumestatesduringnotrunning=yesinitialassumedhoststate=0backtrack=4 State Type / Reason Time % Total Time % Known Time UP Unscheduled 4d 23h 59m 0s 62.502% 100.000% Scheduled 0d 0h 0m 0s 0.000% 0.000% Total 4d 23h 59m 0s 62.502% 100.000% DOWN Unscheduled 0d 0h 0m 0s 0.000% 0.000% Scheduled 0d 0h 0m 0s 0.000% 0.000% Total 0d 0h 0m 0s 0.000% 0.000% UNREACHABLE Unscheduled 0d 0h 0m 0s 0.000% 0.000% Scheduled 0d 0h 0m 0s 0.000% 0.000% Total 0d 0h 0m 0s 0.000% 0.000% Undetermined Nagios Not Running 0d 0h 0m 0s 0.000% Insufficient Data 2d 23h 59m 0s 37.498% Total 2d 23h 59m 0s 37.498% All Total 7d 23h 58m 0s 100.000% 100.000% State Breakdowns For HOST Services: Service % Time OK % Time Warning % Time Unknown % Time Critical % Time Undetermined SITE CHECKavail.cgi?host=FULGENT+HELPDESKservice=SITE+CHECKt1=1329609660t2=1330300740backtrack=4assumestateretention=yesassumeinitialstates=yesassumestatesduringnotrunning=yesinitialassumedhoststate=0initialassumedservicestate=0showscheduleddowntime=yes 62.502% (100.000%) 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 37.498% Average 62.502% (100.000%) 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 37.498% Now I understand that the Unscheduled uptime is right as its 5 days into the weekly report before the host was added. But I don't understand the bottom undetermined stats Anybody know if this? Thanks -- Try before you buy = See our experts in action! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-dev2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Exclude Notifications during certian hours
Hi all, I have all my servers (approx. 70) monitored by PING using the 24x7 time period and notifications set to 24x7. Im happy with this as I want to know when my servers go down at all times as a lot are critical and under strict SLA's. However I have the need to exclude notifications for 1 physical host and 6 virtual hosts that sit on this physical host during the hours of 00:30-06:30 every night. These machines are not under any SLA and often reboot during the night for various reasons and I'm done with waking up at all hours for these non-critical servers! What is the quickest way for me to exclude these 7 hosts from notifying during these hours whilst leaving all other servers at 24x7? Ive had a read the nagios docs but it didn't really help me as I didn't really understand it. I have edited my server.cfg to show the scenario I want to achieve and that can be found at the link below. http://dl.dropbox.com/u/895609/servers.cfg The layout etc is identical to my live configs, I have just edited the number of hosts/names/contacts etc (the 3 hosts named NOT CRITICAL are the example for the 7 hosts I first talked about) What is the quickest and most effective way for me to keep checking ALL my servers 24x7 and notifying 24x7, BUT exclude 7 particular servers from checks/notifications between 00:30-06:30 every night? Thanks in advance for your help. -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Warning: Duplicate definition found for service 'XXXX' on host 'YYYY'
Nagios Users, Im trying to restructure my Nagios to make the web interface easier to read. My test servers.cfg file is thus: http://dl.dropbox.com/u/895609/office_servers.cfg When I test my config I get: Warning: Duplicate definition found for service '' on host '' And when I look at my Nagios Web Interface every host is getting the same checks. Can be seen at http://i27.photobucket.com/albums/c197/thomo123/Nagios/nagios.jpg SERVER1 needs just PING, UPTIME and C:\ DISKSPACE SERVER2 is correct in the services listed it needs SERVER3 needs just PING, UPTIME and C:\ DISKSPACE I have looked at the Nagios cheats page http://nagios.sourceforge.net/docs/3_0/objecttricks.html But I cant use any of these as the -w and -c values for the x:\DISKSPACE percentages vary server to server How can I sort this out please? I have tried everything that my Nagios knowledge has provided me and im stuck. Thankyou in advance -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Warning: Duplicate definition found for service 'XXXX' on host 'YYYY'
What happens Werner is I worship the ground you walk on!! Removing the hostgroups line from the service definitions makes Nagios behave exactly the way I want it to do! Thankyou so much! -Original Message- From: Werner Flamme [mailto:werner.fla...@ufz.de] Sent: 09 February 2012 15:15 To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Warning: Duplicate definition found for service '' on host '' -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Andrew Thompson [09.02.2012 12:17]: Nagios Users, Im trying to restructure my Nagios to make the web interface easier to read. My test servers.cfg file is thus: http://dl.dropbox.com/u/895609/office_servers.cfg When I test my config I get: Warning: Duplicate definition found for service '' on host '' Andrew, what happens when you remove the hostgroups line from each service definition? AFAIK this causes the services to be executed on any host in that hostgroup, while you intend to check one server per service (as I understood). HTH Werner -BEGIN PGP SIGNATURE- Version: GnuPG v2.0.18 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iEYEARECAAYFAk8z4ucACgkQk33Krq8b42NacACfbbQx2PhWpX3d2QD5VSJaO3I5 emcAn1CSrRFo1CNsCMGMVKqgNdhIqzi2 =VY1E -END PGP SIGNATURE- -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NDOUtils Error writing to data sink! Some output may get lost. xxx queued items to flush
Hi all, Ubuntu 11.04 32bit Desktop edition NDOUtils 1.4b9 Im trying to install NDOUtils so I can get Nagios writing its log files to MYSQL. But im not having much luck. I have followed the install guide to the letter: http://nagios.sourceforge.net/docs/ndoutils/NDOUtils.pdf When all said and done that fires out this entry in the nagios.log: [1328283956] ndomod: NDOMOD 1.4b9 (10-27-2009) Copyright (c) 2009 Nagios Core Development Team and Community Contributors [1328283956] ndomod: Could not open data sink! I'll keep trying, but some output may get lost... [1328283956] Event broker module '/usr/local/nagios/bin/ndomod-3x.o' initialized successfully. [1328283956] Finished daemonizing... (New PID=5771) So did a bit of googling and then :- Chmod 755 /usr/local/nagios/bin/ndo2db-3x Chmod 755 /usr/local/nagios/bin/ndomod-3x.o Restarted the ndomod and nagios services/processes and got somewhere [1328284715] ndomod: NDOMOD 1.4b9 (10-27-2009) Copyright (c) 2009 Nagios Core Development Team and Community Contributors [1328284715] ndomod: Successfully connected to data sink. 0 queued items to flush. [1328284715] Event broker module '/usr/local/nagios/bin/ndomod-3x.o' initialized successfully. [1328284715] ndomod: Error writing to data sink! Some output may get lost... [1328284715] Please check remote ndo2db log, database connection or SSL Parameters [1328284715] Finished daemonizing... (New PID=13760) So did a bit more googling and I then:- Chmod 775 /usr/local/nagios/etc/ndo2db.cfg Chmod 775 /usr/local/nagios/etc/ndomod.cfg Chown nagios:nagios /usr/local/nagios.etc.ndo* Restarted the ndomod and nagios services/processes and got a bit further - No check log/db/parameters error [1328285023] ndomod: NDOMOD 1.4b9 (10-27-2009) Copyright (c) 2009 Nagios Core Development Team and Community Contributors [1328285023] ndomod: Successfully connected to data sink. 2793 queued items to flush. [1328285023] ndomod: Error writing to data sink! Some output may get lost. 2554 queued items to flush [1328285023] Event broker module '/usr/local/nagios/bin/ndomod-3x.o' initialized successfully. [1328285023] Finished daemonizing... (New PID=17107) Then read to change the perms on the ndo.sock file, so: Chmod 775 /usr/local/nagios/var/ndo.sock But still the same output: [1328285748] ndomod: NDOMOD 1.4b9 (10-27-2009) Copyright (c) 2009 Nagios Core Development Team and Community Contributors [1328285748] ndomod: Successfully connected to data sink. 957 queued items to flush. [1328285748] ndomod: Error writing to data sink! Some output may get lost. 2374 queued items to flush [1328285748] Event broker module '/usr/local/nagios/bin/ndomod-3x.o' initialized successfully. [1328285748] Finished daemonizing... (New PID=19520) My 2 config files can be found here: Ndomod.cfg - http://dl.dropbox.com/u/895609/ndomod.cfg Ndo2db.cfg - http://dl.dropbox.com/u/895609/ndo2db.cfg Can anybody please help me? This has to be permission related im sure of it but I cannot solve it. Thankyou in advance -- Try before you buy = See our experts in action! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-dev2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Write nagios.log data to a SQL Server database hosted on a remote Windows server
Is this possible? c I know i can get Nagios to write its nagios.log data to MYSQL on the Nagios sevrer but is it possible to have it write its data to a remote Windows SQL server? Wondering if i can take advantage of Microsoft SQL Reporting Services to manipulate my data into reports ready to email? Or is there a Linux equivilant of SQL Reporting Services? Or can SQL Reporting services access MYSQL databases hosted on Linux machines? Anyboy any experience? Cheers -- Try before you buy = See our experts in action! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-dev2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] FW: I cant get notifications for service alerts
So this morning I have added a check_ping command to the host and left the check_nt on the service. Disabled host alerts and left service alerts activated for the disk check service group Nailed a drive space and got my service alerts in both email and sms format. Thanks for the explanation and pointing out to me where I was going wrong Patrick. Kind Regards Andrew From: Andrew Thompson Sent: 25 January 2012 21:32 To: nagios-users@lists.sourceforge.net Subject: Fwd: I cant get notifications for service alerts Hi Patrick, so what check do I suggest I add in fir the host? For example my servers cfg uses check-server-alive for the host and check_ping for the service. But the hosts are ok in that scenario as I only want to know when the host is up or down, I don't want to know when the service is critical. But in the disk space scenario I only want to know when the service us critical because the disk usage as hit the threshold, I don't need a host up our down check notification. hope this makes sense? Regards Sent from My Samsung Galaxy Original message Subject: I cant get notifications for service alerts From: Andrew Thompson and...@fulgent.co.ukmailto:and...@fulgent.co.uk To: nagios-users@lists.sourceforge.netmailto:nagios-users@lists.sourceforge.net nagios-users@lists.sourceforge.netmailto:nagios-users@lists.sourceforge.net CC: Hi all, I have just setup Nagios to monitor the diskspace on my servers. Nagios works fine monitoring and reporting diskspace but I cant get the notifications to work properly. I also ping all my servers and the PING checks I have send me host alerts to say UP or DOWN but I have disabled service alerts as I don't want them for the PING check. So I thought I would so the reverse and setup the disk check commands and disable host alerts and just have service alerts for when said server goes over the critical disk space limit I have sent. Problem is no service alerts get sent out no matter how long the host is in critical state! If I enable host and service alerts for the disk check command I get host alerts to say that the host is DOWN (which it isn't its CRITICAL) My test disk_check config file is located at the below link, thought it be best to provide the whole lot rather than snippets. http://dl.dropbox.com/u/895609/disk_space_test.cfg Can anybody explain why I don't get any service alerts when the host goes CRITICAL and how I can get them please without having host alerts enabled as I don't want them for my disk check commands Thankyou in advance -- Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Fwd: I cant get notifications for service alerts
Hi Patrick, so what check do I suggest I add in fir the host? For example my servers cfg uses check-server-alive for the host and check_ping for the service. But the hosts are ok in that scenario as I only want to know when the host is up or down, I don't want to know when the service is critical. But in the disk space scenario I only want to know when the service us critical because the disk usage as hit the threshold, I don't need a host up our down check notification. hope this makes sense? Regards Sent from My Samsung Galaxy Original message Subject: I cant get notifications for service alerts From: Andrew Thompson and...@fulgent.co.uk To: nagios-users@lists.sourceforge.net nagios-users@lists.sourceforge.net CC: Hi all, I have just setup Nagios to monitor the diskspace on my servers. Nagios works fine monitoring and reporting diskspace but I cant get the notifications to work properly. I also ping all my servers and the PING checks I have send me host alerts to say UP or DOWN but I have disabled service alerts as I don’t want them for the PING check. So I thought I would so the reverse and setup the disk check commands and disable host alerts and just have service alerts for when said server goes over the critical disk space limit I have sent. Problem is no service alerts get sent out no matter how long the host is in critical state! If I enable host and service alerts for the disk check command I get host alerts to say that the host is DOWN (which it isn’t its CRITICAL) My test disk_check config file is located at the below link, thought it be best to provide the whole lot rather than snippets. http://dl.dropbox.com/u/895609/disk_space_test.cfg Can anybody explain why I don’t get any service alerts when the host goes CRITICAL and how I can get them please without having host alerts enabled as I don’t want them for my disk check commands Thankyou in advance -- Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nofification Anomaly
Hi all, I have a strange notification problem that I cant seem to get to the bottom of. In a nut shell I am using escalations to limit the amount of notifications for any given host to 5. Host Down notification 1 comes via form of Email and Text Then I have Not 2-5 if down to come via email If it recovers between 2-5 it will email and text. The email works fine in all scenarios but I have a text message issue in the sense it doesn't send out a text message if notification 2 is a recovery text! Nagios doesn't even seem to run the notify by SMS command! Any other notification number is fine but just not notification number 2 being a recovery text! I know above is a bit gargled so an example of my following notification escalations may make it a bit clearer. # #!!# # # # HOST ESCALATIONS # # # # Use escalations to limit the amount of email and text message alerts that # # are recieved for hosts in a problem state # # # #!!# # #OFFICE SERVERS # define hostescalation{ host_nameHOST first_notification1 last_notification4 notification_interval 15 contact_groupsservers email escalation_optionsd,r } define hostescalation{ host_nameHOST first_notification 5 last_notification 0 notification_interval0 contact_groupsservers email escalation_optionsd,r } define hostescalation{ host_nameHOST first_notification 1 last_notification 1 notification_interval15 contact_groupsservers sms escalation_optionsd } define hostescalation{ host_nameHOST first_notification2 last_notification4 notification_interval 15 contact_groupsservers sms escalation_optionsr } define hostescalation{ host_name HOST first_notification5 last_notification0 notification_interval 0 contact_groupsservers sms escalation_optionsr } # #!!# # # # SERVICE ESCALATIONS # # # # Use escalations to limit the amount of email and text message alerts that # # are recieved for services in a problem state # # # #!!# # ### #OFFICE SERVERS # ### define serviceescalation{ host_nameHOST service_descriptionPING first_notification1 last_notification4 notification_interval 15
[Nagios-users] Scheduled downtime for mass hosts?
Hi all, Tonight I will be forming maintenance on over 50 of my servers and will be taking firewall and routing links out. I have 86 hosts that this will affect. Im going to put them into scheduled downtime in Nagios. I have my hosts divided into hostgroups. Is there a quick way to schedule all 86 hosts into Nagios downtime rather than having to click on each host in the web GUI and doing them individually? Thanks -- RSA(R) Conference 2012 Mar 27 - Feb 2 Save $400 by Jan. 27 Register now! http://p.sf.net/sfu/rsa-sfdev2dev2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Scheduled downtime for mass hosts?
Ah Benny, I didn't know you could schedule maintenance for each individual host group I have my 86 hosts arranged into 4 hostgroups so I will just do this. 4 clicks and job done, thought I was in for the long haul by clicking all 86 hosts 1 by 1. Thanks, you have saved me a load of time! From: Andrew Thompson Sent: 12 January 2012 12:14 To: nagios-users@lists.sourceforge.net Subject: Scheduled downtime for mass hosts? Hi all, Tonight I will be forming maintenance on over 50 of my servers and will be taking firewall and routing links out. I have 86 hosts that this will affect. Im going to put them into scheduled downtime in Nagios. I have my hosts divided into hostgroups. Is there a quick way to schedule all 86 hosts into Nagios downtime rather than having to click on each host in the web GUI and doing them individually? Thanks -- RSA(R) Conference 2012 Mar 27 - Feb 2 Save $400 by Jan. 27 Register now! http://p.sf.net/sfu/rsa-sfdev2dev2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Gammu SMS Daemon writes Nagios text alerts to MYSQL but they dont get sent out to mobile!
Hi all, I know this is a Nagios support list but my question it more a Gammu, MYSQL and Nagios combination help that I am after. Software: Gammu 1.28 MYSQL Server Ubuntu Desktop 11.04 32bit Nokia 6021 via USB cable Nagios 3.3.1 I have got everything working but 1 thing. Nagios uses: /usr/bin/printf ***NAGIOS ALERT*** \n\n $NOTIFICATIONTYPE$\n\n$HOSTALIAS$ is $HOSTSTATE$ | /usr/bin/gammu-smsd-inject TEXT $CONTACTPAGER$ Nagios writes the text alerts to the Outbox table of the database as it should. But that's where they stay; the daemon doesn't forward them out of the DB to the recipients phone! If I change the command line a little and tell Nagios to use this: /usr/bin/printf ***NAGIOS ALERT*** \n\n $NOTIFICATIONTYPE$\n\n$HOSTALIAS$ is $HOSTSTATE$ | /usr/bin/gammu --sendsms TEXT $CONTACTPAGER$ The phone sends any Nagios text alerts staright to the correct mobile phones so I know sending through the phone works fine. These are my Gammu relevant setups I have a smsd.conf file in /etc/init directory to try and keep the daemon alive #smsd # #This service maintains the SMS Daemon from the point the system is started until it is shut down again Description SMSD DAEMON Start on runlevel [2345] Stop on runlevel [016] Respawn Exec /usr/local/bin/smsd.sh Script in /usr/local/bin/smsd.sh #! /bin/sh Export LANG=en_US /usr/bin/gammu-smsd MYSQL /etc/gammu-smsdrc My gammu-smsdrc content: [gammu] Port = /dev/ttyUSB0 Connection = fbus Model = 6021 [smsd] Service = MYSQL Commtimeout = 10 Sendtimeout = 30 Recievefrequency = 30 Deliveryreport = log User = root Password = xx Pc = 127.0.0.1 Database = Nagios_Text If I run /sbin/initctl start smsd it replies with smsd start/running, process Then if straight away I run /sbin/initctl status smsd it replies with smsd stop/waiting Is this correct or is this the reason the messages never leave the DB? Somebody offer me any assistance on getting those messages out the DB - im so close, yet so far!! I need to present my boss with a full working Nagios setup to go live by Wed 4 Jan and I can nearly touch it. Please help. Thank you so much -- Ridiculously easy VDI. With Citrix VDI-in-a-Box, you don't need a complex infrastructure or vast IT resources to deliver seamless, secure access to virtual desktops. With this all-in-one solution, easily deploy virtual desktops for less than the cost of PCs and save 60% on VDI infrastructure costs. Try it free! http://p.sf.net/sfu/Citrix-VDIinabox___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Anybody help setting up Anag on an Android phone?
Trying to get the aNag app set up on my Samsung Galaxy. However when I add my Nagios instance it always throws back the same error: HttpResponseException: Not Found (time and date) I can access my nagios web interface over the internet by going to http://nagios.company.co.uk/nagios just fine I pop that into Anag and it refuses it with above error Have tried: http://nagios.company.co.uk/nagios http://nagios.company.co.uk/nagios/cgi-bin (which example says to use) http://LANip/nagios http://LANip/nagios/cgi-bin **LANip - phone is connected to same LAN as Nagios box None of them work If I try to go to http://nagios.company.co.uk/nagios/cgi-bin in a web browser directly I get Forbidden - You don't have permission to access /nagios/cgi-bin/ on this server. I can use the firefox add on just fine found here so my server must be accepting and set up right? https://addons.mozilla.org/en-US/firefox/addon/nagios-checker/ Anybody got this successfully setup on there phones and can offer me any assistance please? Thankyou -- Cloud Services Checklist: Pricing and Packaging Optimization This white paper is intended to serve as a reference, checklist and point of discussion for anyone considering optimizing the pricing and packaging model of a cloud services business. Read Now! http://www.accelacomm.com/jaw/sfnl/114/51491232/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Anybody help setting up Anag on an Android phone?
Thanks for the suggestion Seon, Its all sorted now - my phone automatically chages Nagios in the url to a capital N. Changed it to lower case and my phone now checks in with Nagios. Simple things huh.. :) Kind Regards Andrew T: 01386 834000 F: 01386 834100 and...@fulgent.co.ukmailto:and...@fulgent.co.uk www.fulgent.co.ukhttp://www.fulgent.co.uk/ Fulgent Technologies Limited, Haddonsacre, Station Road, Offenham, Evesham, WR11 8JJ. This communication contains information which is confidential and may also be privileged or protected by copyright. It is for the exclusive use of the addressee. If you are not the addressee please note that any distribution, reproduction, copying, publication or use of this communication or the information is prohibited. If you have received this communication in error, please telephone us immediately and also delete the communication from your computer. -- Cloud Services Checklist: Pricing and Packaging Optimization This white paper is intended to serve as a reference, checklist and point of discussion for anyone considering optimizing the pricing and packaging model of a cloud services business. Read Now! http://www.accelacomm.com/jaw/sfnl/114/51491232/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Is there a plugin available that will log into a website and navigate through it?
Hi all, Is there a plugin out there in the wild that will login into a website with a provided user/pass and then navigate its way around the site clicking a few links and then log back out again? Got a site that id like to trawl to check its availability. www.site.comhttp://www.site.com takes you to the login screen of where it asks for user/pass then a login button. www.site.com/mainmenu.aspxhttp://www.site.com/mainmenu.aspx takes you to the main menu where you can click links to navigate around the site, including a log out link Id like to be able to get the plugin/script to input user/pass and hit the login button then click 2 links and return back to the main menu where it will hit the log out link. In the order of www.site.comhttp://www.site.com www.site.com/mainmenu.aspxhttp://www.site.com/mainmenu.aspx www.site.com/link1.aspxhttp://www.site.com/link1.aspx www.site.com/mainmenu.aspxhttp://www.site.com/mainmenu.aspx www.site.com/link2.aspxhttp://www.site.com/link2.aspx www.site.com/mainmenu.aspxhttp://www.site.com/mainmenu.aspx www.site.com/logout.aspxhttp://www.site.com/logout.aspx Anything available to do this? Cheers -- All the data continuously generated in your IT infrastructure contains a definitive record of customers, application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-novd2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios Core and SMS alerts (or lack of) through Nagios
Hi all, Im at my wits end. Spent 2 days now trying to get Nagios to SMS me through gnokii but a lunar eclipse is a more frequent event! So i have Ubuntu 11.04 installed using postfix to email me alerts (works flawlessly) Now i also want a text message as a failover incase my Exchange server or internet line goes down. I have installed and configured gnokii to what i think is at a functioning level. I have a Nokia 6301 mobile hooked up to Ubuntu via USB cable. I can send a text message out through the phone via the CLI no issue at all with both my admin user logged in and also when i log in as the nagios user so i know i have the correct permissions on the ttyACM0 port that is assigned to the 6301. Will Nagios send an SMS - will it f*£$!! Here is my command.cfg relevant part # 'notify-host-by-sms' command definition define command{ command_name notify-host-by-sms command_line /usr/bin/printf %.120s * Nagios * $NOTIFICATIONTYPE$: Host $HOSTALIAS$ is $HOSTSTATE$ | /usr/bin/gnokii --sendsms $CONTACTPAGER$ } # 'notify-service-by-sms' command definition define command{ command_name notify-service-by-sms command_line /usr/bin/printf %.120 * Nagios * $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ | /usr/bin/gnokii --sendsms $CONTACTPAGER$ } Here is my contact.cfg define contact{ contact_name MYNAME use generic-contact alias MYNAME host_notification_options d,u,r,f service_notification_options w,u,c,r,f email MYEMAIL pager +44X host_notification_period 24x7 service_notification_period 24x7 host_notification_commands notify-host-by-email,notify-host-by-sms service_notification_commands notify-service-by-email,notify-service-by-sms Here is my templates.cfg define contact{ name generic-contact service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r,f,s host_notification_options d,u,r,f,s service_notification_commands notify-service-by-email, notify-service-by-sms host_notification_commands notify-host-by-email, notify-host-by-sms register 0 } what can be going on - why will nagios not SMS me? Its got to be a nagios config issue and not gnokii error as I can send text through CLI fine, Please help me, im going insane Thanks Kind Regards Andrew T: 01386 834000 F: 01386 834100 and...@fulgent.co.ukmailto:and...@fulgent.co.uk www.fulgent.co.ukhttp://www.fulgent.co.uk/ Fulgent Technologies Limited, Haddonsacre, Station Road, Offenham, Evesham, WR11 8JJ. This communication contains information which is confidential and may also be privileged or protected by copyright. It is for the exclusive use of the addressee. If you are not the addressee please note that any distribution, reproduction, copying, publication or use of this communication or the information is prohibited. If you have received this communication in error, please telephone us immediately and also delete the communication from your computer. -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null