Re: [Nagios-users-br] Problemas para enviar e-mail.
Prezado Heitor, Configurei o exim e o problema foi o mesmo. Notei o seguinte: Quando o nagios envia a mensagem automaticamente o maillog do exim fica assim 2010-01-18 16:41:51 1NWwXv-0008V8-4b = nag...@xxx U=nagios P=local S=564 2010-01-18 16:41:51 1NWwXv-0008V8-4b = $...@x R=smarthostT=remote_smtp_smarthost H=* 2010-01-18 16:41:51 1NWwXv-0008V8-4b Completed Quando eu forço o envio pelo próprio nagios fica assim: *Forçando o envio eu recebo o e-mail 2010-01-18 16:43:37 1NWwZd-5p-Ee = nag...@xxx U=nagios P=local S=586 2010-01-18 16:43:37 1NWwZd-5p-Ee = minhaco...@x R=hub_user_smarthost T=remote_smtp_smarthost H=*** 2010-01-18 16:43:37 1NWwZd-5p-Ee Completed As unicas coisas que eu achei diferente, estão destacadas. 2010/1/15 Heitor Felipe Lessa heitor.le...@hotmail.com Olá meu caro, Os arquivos estão corretos, como estão os hosts ou serviços? O exim é bem simples, pra te ajudar segue o arquivo de configuração, somente configure ele que fica na pasta do exim4 /etc/exim4/update-exim4.conf.conf. A forma que está configurada é apontando para um Relay que fará o envio sem a autenticação que é o recomendável. dc_eximconfig_configtype='satellite' dc_other_hostnames='Dominio do e-mail - teste.com.br por exemplo' dc_local_interfaces='IP do servidor Nagios' dc_readhost='Coloque o mesmo que other_hostnames' dc_relay_domains='' dc_minimaldns='false' dc_relay_nets='' dc_smarthost='Smarthost é o servidor que fará o Relay, 200.174.51.133 por exemplo' CFILEMODE='644' dc_use_split_config='false' dc_hide_mailname='true' dc_mailname_in_oh='true' dc_localdelivery='mail_spool' Como pode ver é bem simples, eu instalei o mesmo no Debian mas em outras distros o conf é o mesmo, qualquer dúvida poste. Lembre-se sempre de olhar o log para ver como anda os processos via # tail -f Forte abraço. Atenciosamente, Heitor Lessa From: paulo...@gmail.com Date: Fri, 15 Jan 2010 10:54:22 -0200 To: nagios-users-br@lists.sourceforge.net Subject: Re: [Nagios-users-br] Problemas para enviar e-mail. Prezado Heitor, Segue abaixo as configurações de todos os arquivos de templates, mas como um teste configurei um host teste sem utilizar nenhum template, passei todas as configurações. Vou procurar alguma informação a respeito da implementação do exim. Agradeço a ajuda. arquivo contacttemplates.cfg define contact { name generic-contact contactgroups admins host_notification_period 24x7 service_notification_period 24x7 host_notification_options d,u,r,f,s service_notification_options w,u,c,r,f,s host_notification_commandsnotify-host-by-email service_notification_commands notify-service-by-email register0 } arquivo hosttemplate.cfg define host { name generic-host max_check_attempts3 check_interval5 retry_interval1 check_period 24x7 event_handler notify-host-by-email event_handler_enabled 1 flap_detection_enabled1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 contact_groupsadmins notification_interval 0 notification_period 24x7 first_notification_delay 0 notification_options d,u,r,f,s notifications_enabled 1 stalking_options o,d,u failure_prediction_enabled1 register0 } arquivo servicetemplate.cfg define service { name generic-service is_volatile 0 max_check_attempts3 check_interval10 retry_interval2 active_checks_enabled 1 passive_checks_enabled1 check_period 24x7 obsess_over_service 1 check_freshness 0 event_handler notify-service-by-email event_handler_enabled 1 flap_detection_enabled1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 notification_interval 0
Re: [Nagios-users-br] Problemas para enviar e-mail.
Prezados, Creio que identifiquei a origem do problema. A variavel $CONTACTEMAIL$http://nagios.sourceforge.net/docs/2_0/macros.html#contactemailnão está sendo convertida para os endereços de e-mail estabelecidos no contacts.cfg. Alguem já vivenciou este problema e sabe como resolver? 2010/1/18 Paulo de Souza Nunes paulo...@gmail.com Prezado Heitor, Configurei o exim e o problema foi o mesmo. Notei o seguinte: Quando o nagios envia a mensagem automaticamente o maillog do exim fica assim 2010-01-18 16:41:51 1NWwXv-0008V8-4b = nag...@xxx U=nagios P=local S=564 2010-01-18 16:41:51 1NWwXv-0008V8-4b = $...@x R=smarthostT=remote_smtp_smarthost H=* 2010-01-18 16:41:51 1NWwXv-0008V8-4b Completed Quando eu forço o envio pelo próprio nagios fica assim: *Forçando o envio eu recebo o e-mail 2010-01-18 16:43:37 1NWwZd-5p-Ee = nag...@xxx U=nagios P=local S=586 2010-01-18 16:43:37 1NWwZd-5p-Ee = minhaco...@x R=hub_user_smarthost T=remote_smtp_smarthost H=*** 2010-01-18 16:43:37 1NWwZd-5p-Ee Completed As unicas coisas que eu achei diferente, estão destacadas. 2010/1/15 Heitor Felipe Lessa heitor.le...@hotmail.com Olá meu caro, Os arquivos estão corretos, como estão os hosts ou serviços? O exim é bem simples, pra te ajudar segue o arquivo de configuração, somente configure ele que fica na pasta do exim4 /etc/exim4/update-exim4.conf.conf. A forma que está configurada é apontando para um Relay que fará o envio sem a autenticação que é o recomendável. dc_eximconfig_configtype='satellite' dc_other_hostnames='Dominio do e-mail - teste.com.br por exemplo' dc_local_interfaces='IP do servidor Nagios' dc_readhost='Coloque o mesmo que other_hostnames' dc_relay_domains='' dc_minimaldns='false' dc_relay_nets='' dc_smarthost='Smarthost é o servidor que fará o Relay, 200.174.51.133 por exemplo' CFILEMODE='644' dc_use_split_config='false' dc_hide_mailname='true' dc_mailname_in_oh='true' dc_localdelivery='mail_spool' Como pode ver é bem simples, eu instalei o mesmo no Debian mas em outras distros o conf é o mesmo, qualquer dúvida poste. Lembre-se sempre de olhar o log para ver como anda os processos via # tail -f Forte abraço. Atenciosamente, Heitor Lessa From: paulo...@gmail.com Date: Fri, 15 Jan 2010 10:54:22 -0200 To: nagios-users-br@lists.sourceforge.net Subject: Re: [Nagios-users-br] Problemas para enviar e-mail. Prezado Heitor, Segue abaixo as configurações de todos os arquivos de templates, mas como um teste configurei um host teste sem utilizar nenhum template, passei todas as configurações. Vou procurar alguma informação a respeito da implementação do exim. Agradeço a ajuda. arquivo contacttemplates.cfg define contact { name generic-contact contactgroups admins host_notification_period 24x7 service_notification_period 24x7 host_notification_options d,u,r,f,s service_notification_options w,u,c,r,f,s host_notification_commandsnotify-host-by-email service_notification_commands notify-service-by-email register0 } arquivo hosttemplate.cfg define host { name generic-host max_check_attempts3 check_interval5 retry_interval1 check_period 24x7 event_handler notify-host-by-email event_handler_enabled 1 flap_detection_enabled1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 contact_groupsadmins notification_interval 0 notification_period 24x7 first_notification_delay 0 notification_options d,u,r,f,s notifications_enabled 1 stalking_options o,d,u failure_prediction_enabled1 register0 } arquivo servicetemplate.cfg define service { name generic-service is_volatile 0 max_check_attempts3 check_interval10 retry_interval2 active_checks_enabled 1 passive_checks_enabled1 check_period 24x7 obsess_over_service 1 check_freshness 0 event_handler
[Nagios-users] first_notification_delay problem
Hi all Can someone pls explane to me how first_notification_delay works ? I setup one host with one service for testing. I setup first_notification_delay for host to 5 and for service (http) to 10 so i expected that i get nitification after 10 minutes when i stop http service. But i always get first notification after 5 minutes, no matter what i set in config file. Then i changed delay for host to 3 because i thought that this is the problem, but after this change and nagios restart i got notification after 5 minutes again. That's very strange to me. I don't know what to check and how make it work. Is there any option that should i check ? Thank you Martin -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Multiple NRPE Processes
On Jan 19, 2010, at 1:37 AM, Juki wrote: nag...@pms # netstat -a | grep nrpe *.nrpe *.*0 0 49152 0 LISTEN pms.nrpe h1de4.n2 56147 5888 0 50540 0 TIME_WAIT pms.nrpe h1de4.n2 56148 5888 0 50540 0 TIME_WAIT pms.nrpe h1de4.n2 56149 5888 0 50540 0 TIME_WAIT I have tried to *kill* them (using kill command) but that doesn't seem to work because they just don't go away - weird right? :) I would like to kill all the pms.nrpe processes.. There's nothing to kill and these entries are appropriate... Welcome to the world of TCP and a greater understanding of how it all _really_ works. http://www.developerweb.net/forum/showthread.php?t=2941 How long are they sticking around? What OS are you using? -- Marc -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Filter status.cgi view in 1.3
Hi, We're running version 1.3 of Nagios. I'd like to have the status.cgi view to only show a couple of the 1000+ host we have monitored. I know I can add the name of the host as a parameter so it becomes something like status.cgi?host=MyHostName but can I also use this technique to filter out a number of hosts? Other ideas to get a page with just a few of all the hosts? Richard -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Filter status.cgi view in 1.3
On Jan 19, 2010, at 8:25 AM, Richard Hallgren wrote: Hi, We’re running version 1.3 of Nagios. Much of the rest of the world is running 3.x. I strongly encourage you to upgrade sooner rather than later. The conversion from 1.x to 2.x+ requires configuration changes and tribal memory of what exactly needs to be done, how best to do it and ability to find third party conversion scripts gets less and less likely the longer you wait. Additionally, less and less people on this list have actual experience with 1.x... You're already somewhere in the neighborhood of 5 years out-of-date. I’d like to have the status.cgi view to only show a couple of the 1000+ host we have monitored. I know I can add the name of the host as a parameter so it becomes something like “status.cgi?host=MyHostName” but can I also use this technique to filter out a number of hosts? No. Other ideas to get a page with just a few of all the hosts? You can do this with authentication enabled. Create groups of the limited view hosts, assign unique contacts to them, create auth usernames that match those contacts and log in with them. By default nagios will only show hosts and services that an authenticated user is a contact for. -- Marc -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Multiple NRPE Processes
Hi Marc, 2010/1/19 Marc Powell m...@ena.com There's nothing to kill and these entries are appropriate... Welcome to the world of TCP and a greater understanding of how it all _really_ works. http://www.developerweb.net/forum/showthread.php?t=2941 I will have a look at this How long are they sticking around? What OS are you using? I'm running Solaris 10. They are more less sticking around indefinitely. -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Warnings/errors with check_openmanage plugin
Gianluca Varenni gianluca.vare...@gmail.com writes: Here they are (embedded below) Thanks. This really puzzled me, because snmpwalk returned all the output needed, while my plugin didn't seem to fetch the needed values. Fortunately, I was able to reproduce this on a local 2650, which makes debugging much easier. It turns out that when using Net::SNMP::get_entries() to fetch the values, there will sometimes be a whitespace in the OID field of the data returned, while Net::SNMP::get_table() is OK. I really want to use get_entries(), which is significantly faster on servers with lots of physical disks. I have patched check_openmanage so that any excess whitespace is removed from the OID field before it is analyzed, so get_entries() can still be used. A beta version is available (check_openmanage-3.5.5-beta4) here: http://folk.uio.no/trondham/tmp/ This beta also addresses the controller identification for physical drives, where the controller id was reported as '-1'. On older hardware such as the 2650 and 750, the controller to which a physical disk is attached is not available as a connection to the enclosure, because no enclosure exists. The plugin will now probe for this information via the channel connection as well, which provides this info. This only implies that I probe an additional OID for this information. Please try this beta version and report your results. Cheers, -- Trond H. Amundsen t.h.amund...@usit.uio.no Center for Information Technology Services, University of Oslo -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Notification question
I'm having a question about the notifications that are being sent by nagios. I found out that if you put the notification interval at 0 you only get a mail when the service goes into the type you defined. I'll explain my question with an example. We monitor the disk space of several servers. The threshold for the warning and critical are set to 90 and 95. As soon as the service goes into warning because the threshold is at 90% we get an email with a warning status (so far so good) If the threshold increases to 91% we don't get an email anymore because the state doesn't change. (it's still warning). We now want Nagios to send an email that as soon as the threshold value has changed we get an email. (from 90 to 91) We don't want to get an email as long as the threshold stays at 90%(we already got this by mail) Does somebody knows a solution for this? Steven Battaille -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Variables in .cfg files -- Howto?
How can I use variables in my .cfg files that are only used in that file, and not across my entire install? -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Multiple NRPE Processes
On Jan 19, 2010, at 9:22 AM, Juki wrote: How long are they sticking around? What OS are you using? I'm running Solaris 10. They are more less sticking around indefinitely. 60 seconds seems to be the default/recommended setting on Solaris but can be as high as 10 minutes if the OS thinks it needs to be (http://docs.sun.com/app/docs/doc/817-0404/chapter4-51?a=view). I wouldn't recommend tuning this unless you know exactly what you are doing and why; strangeness may result. Perhaps you've tuned it already and this is the resulting strangeness... You're going to have better luck understanding this by asking about it on a Solaris support list as what you're asking about is a general TCP stack question, not specific to Nagios. -- Marc -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Variables in .cfg files -- Howto?
Mr Gabriel wrote: How can I use variables in my .cfg files that are only used in that file, and not across my entire install? You don't really. Nagios doesn't keep track of what lives in which file; it just loads everything up and parses the whole thing afterward. If you really need per-file variables, you'll probably have to do it before loading them into Nagios with some sort of external pre-parser, but whether that'll work for you or not is hard to say since it's not clear what you're trying to accomplish. -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification question
Steven Battaille wrote: I’m having a question about the notifications that are being sent by nagios. I found out that if you put the notification interval at 0 you only get a mail when the service goes into the type you defined. I’ll explain my question with an example. We monitor the disk space of several servers. The threshold for the warning and critical are set to 90 and 95. As soon as the service goes into warning because the threshold is at 90% we get an email with a warning status (so far so good) If the threshold increases to 91% we don’t get an email anymore because the state doesn’t change. (it’s still warning). We now want Nagios to send an email that as soon as the threshold value has changed we get an email. (from 90 to 91) We don’t want to get an email as long as the threshold stays at 90%(we already got this by mail) Does somebody knows a solution for this? You'll probably want to *not* set your notification interval to zero, and come up with a custom notification command to do what you want (i.e., check if that value has changed, and don't send anything if it hasn't). Naguios generally doesn't care about plugin output; it just looks at the result code. No result code change means, as far as Nagios is concerned, nothing has changed. One warning's no different from any other as far as the notification logic goes. -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_openmanage 3.5.4 status bug
Greetings, The status hash that is used to translate the voltageProbeStatus appears to be incorrect. Instead of %snmp_status, the hash used should be %snmp_probestatus. This is at line 3054 in check_openmanage v3.5.4. The corrected line reads: $status = $snmp_probestatus{$out-{voltageProbeStatus}}; I discovered this issue since one of my Dells has an almost dead battery but the check returned OOPS! Something is wrong with this server, but I don't know what. The global system health status is CRITICAL, but every component check is OK. This may be a bug in the Nagios plugin, please file a bug report. -- Ken McKinlay, CISSP, GCIA, GAWN, GSNA, GPEN Network Security Curtiss Wright Controls Embedded Computing ___ This e-mail and any files transmitted with it are proprietary and intended solely for the use of the individual or entity to whom they are addressed. If you have reason to believe that you have received this e-mail in error, please notify the sender and destroy this email and any attached files. Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of the Curtiss-Wright Corporation or any of its subsidiaries. Documents attached hereto may contain technology subject to government export regulations. Recipient is solely responsible for ensuring that any re-export, transfer or disclosure of this information is in accordance with applicable government export regulations. The recipient should check this e-mail and any attachments for the presence of viruses. Curtiss-Wright Corporation and its subsidiaries accept no liability for any damage caused by any virus transmitted by this e-mail. -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_openmanage 3.5.4 status bug
McKinlay, Ken ken.mckin...@curtisswright.com writes: The status hash that is used to translate the voltageProbeStatus appears to be incorrect. Instead of %snmp_status, the hash used should be %snmp_probestatus. This is at line 3054 in check_openmanage v3.5.4. The corrected line reads: $status = $snmp_probestatus{$out-{voltageProbeStatus}}; I discovered this issue since one of my Dells has an almost dead battery but the check returned OOPS! Something is wrong with this server, but I don't know what. The global system health status is CRITICAL, but every component check is OK. This may be a bug in the Nagios plugin, please file a bug report. Hi Ken, Thanks for the bug report and the patch. I guess that this was never discovered before, since voltage errors occur rarely. Also good to know that the little failsafe oops works as intended :) Your patch will be included in the next release, and a patched beta version is available here: http://folk.uio.no/trondham/tmp/check_openmanage-3.5.5-beta6 Cheers, -- Trond H. Amundsen t.h.amund...@usit.uio.no Center for Information Technology Services, University of Oslo -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification question
On Jan 19, 2010, at 10:15 AM, Steven Battaille wrote: If the threshold increases to 91% we don’t get an email anymore because the state doesn’t change. (it’s still warning). We now want Nagios to send an email that as soon as the threshold value has changed we get an email. (from 90 to 91) We don’t want to get an email as long as the threshold stays at 90%(we already got this by mail) Does somebody knows a solution for this? Nagios doesn't work like this. It does not know or care that it changed from 90 to 91 so it can't do anything with it. This speaks to the core of how nagios sees the world. education When nagios runs a plugin to check something, it will receive up to 3 things back from the plugin - - a numeric exit code that corresponds to the current state (OK, WARNING, CRITICAL, UNKNOWN) - some text comprised of human readable output[|optional performance data] Nagios only really cares about the state, everything else is just fluff. State changes are the only things that trigger notifications*, unless you have 'is_volatile' set. If the plugin returns WARNING for 90 and WARNING for 91, nagios only sees WARNING for both so the state hasn't changed and a new notification isn't necessary. If you set is_volatile, nagios will send a notification every time the state is not OK (90, 91, 90, 90, 90 would all generate notifications.) The part that you're interested in doing something with is in the human readable output. Nagios does nothing with this. It can be any text in any format, depending on what the plugin thinks is useful to you and what you've asked it to check. Since there's no possible way to know and parse an effectively infinite variety of plugin output content and format, nagios does none of it. It just makes it available in the GUI and as a macro for you to do whatever you want to with it on your own. Typically it's just sent to the notification command for inclusion in the notification. /education That being said, what you could do is to add some logic to your notification command for this case. Your notification command can be any script at all. The default is just a simple mail command but it could be something very complex. If you write your own, it would need to do things like keep track of the prior state and only send a notification of your criteria were met. -- Marc *outside of re-notifications due to notification_interval or escalations... -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Warnings/errors with check_openmanage plugin
This version seems to work ok now. Thanks! Have a nice day GV -- From: Trond Hasle Amundsen t.h.amund...@usit.uio.no Sent: Tuesday, January 19, 2010 7:50 AM To: Gianluca Varenni gianluca.vare...@gmail.com Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Warnings/errors with check_openmanage plugin Gianluca Varenni gianluca.vare...@gmail.com writes: Here they are (embedded below) Thanks. This really puzzled me, because snmpwalk returned all the output needed, while my plugin didn't seem to fetch the needed values. Fortunately, I was able to reproduce this on a local 2650, which makes debugging much easier. It turns out that when using Net::SNMP::get_entries() to fetch the values, there will sometimes be a whitespace in the OID field of the data returned, while Net::SNMP::get_table() is OK. I really want to use get_entries(), which is significantly faster on servers with lots of physical disks. I have patched check_openmanage so that any excess whitespace is removed from the OID field before it is analyzed, so get_entries() can still be used. A beta version is available (check_openmanage-3.5.5-beta4) here: http://folk.uio.no/trondham/tmp/ This beta also addresses the controller identification for physical drives, where the controller id was reported as '-1'. On older hardware such as the 2650 and 750, the controller to which a physical disk is attached is not available as a connection to the enclosure, because no enclosure exists. The plugin will now probe for this information via the channel connection as well, which provides this info. This only implies that I probe an additional OID for this information. Please try this beta version and report your results. Cheers, -- Trond H. Amundsen t.h.amund...@usit.uio.no Center for Information Technology Services, University of Oslo -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Variables in .cfg files -- Howto?
On Jan 19, 2010, at 10:39 AM, Mr Gabriel wrote: How can I use variables in my .cfg files that are only used in that file, and not across my entire install? Your question doesn't make any sense. Can you elaborate by example? -- Marc -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] first_notification_delay problem
Hi As addition to my first post about notification delay. I made some further investigation and found out some another strange behaviour (for me) and couldn't find any info in docs. I have first_nitification_delay for host se to 0 and i have tried change this setting for service and here are the results first column is value that i have set and second is real time till notification wa sent. 5 = 0 (immediately) 10 = 5 15 = 10 25 = 20 12 = 5 14 = 10 45 = 30 Can plesae explain someone to me why is it like this ? I don't understand, if it has some dependency from another settings or what. Thank you very much. Martin On Tuesday 19 January 2010 10:59:43 komodo wrote: Hi all Can someone pls explane to me how first_notification_delay works ? I setup one host with one service for testing. I setup first_notification_delay for host to 5 and for service (http) to 10 so i expected that i get nitification after 10 minutes when i stop http service. But i always get first notification after 5 minutes, no matter what i set in config file. Then i changed delay for host to 3 because i thought that this is the problem, but after this change and nagios restart i got notification after 5 minutes again. That's very strange to me. I don't know what to check and how make it work. Is there any option that should i check ? Thank you Martin --- --- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when ::: reporting any issue. Messages without supporting info will risk being ::: sent to /dev/null -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_cluster and service hardstate
Hi all, In the doc dealing with the service cluster with the check_cluster plugin, the example uses $SERVICESTATEID:host:Service$ But I'd like to use the hardstate. As far as I know, there is no variable giving the service hardstate. I had a look at the list http://nagios.sourceforge.net/docs/2_0/macros.html Do somebody know if there is a way to do this ? Thanks a lot. Regards, Gaël. -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_openmanage 3.5.5-beta6 snmp_detect_blade bug
Trond, Other little bug for your next release. Using check_openmanage 3.5.5-beta6 on a server loaded with OMSA 5.1.0 (a different box this time), in the snmp_detect_blade function it returned: INTERNAL ERROR: Use of uninitialized value in string eq at ./check_openmanage-3.5.5-beta6 line 599. Looking at the line and then doing my own SNMP query, that OID is missing in OMSA 5.1.0. However, by changing line 599 to first make sure a result has been set then the uninitialized value error is bypassed in the if statement: if ( $result-{$DellBaseBoardType} $result-{$DellBaseBoardType} eq '3') { Ken -Original Message- From: Trond Hasle Amundsen [mailto:t.h.amund...@usit.uio.no] Sent: Tuesday, January 19, 2010 12:19 PM To: McKinlay, Ken Cc: nagios-users@lists.sourceforge.net Subject: Re: check_openmanage 3.5.4 status bug McKinlay, Ken ken.mckin...@curtisswright.com writes: The status hash that is used to translate the voltageProbeStatus appears to be incorrect. Instead of %snmp_status, the hash used should be %snmp_probestatus. This is at line 3054 in check_openmanage v3.5.4. The corrected line reads: $status = $snmp_probestatus{$out-{voltageProbeStatus}}; I discovered this issue since one of my Dells has an almost dead battery but the check returned OOPS! Something is wrong with this server, but I don't know what. The global system health status is CRITICAL, but every component check is OK. This may be a bug in the Nagios plugin, please file a bug report. Hi Ken, Thanks for the bug report and the patch. I guess that this was never discovered before, since voltage errors occur rarely. Also good to know that the little failsafe oops works as intended :) Your patch will be included in the next release, and a patched beta version is available here: http://folk.uio.no/trondham/tmp/check_openmanage-3.5.5-beta6 Cheers, -- Trond H. Amundsen t.h.amund...@usit.uio.no Center for Information Technology Services, University of Oslo ___ This e-mail and any files transmitted with it are proprietary and intended solely for the use of the individual or entity to whom they are addressed. If you have reason to believe that you have received this e-mail in error, please notify the sender and destroy this email and any attached files. Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of the Curtiss-Wright Corporation or any of its subsidiaries. Documents attached hereto may contain technology subject to government export regulations. Recipient is solely responsible for ensuring that any re-export, transfer or disclosure of this information is in accordance with applicable government export regulations. The recipient should check this e-mail and any attachments for the presence of viruses. Curtiss-Wright Corporation and its subsidiaries accept no liability for any damage caused by any virus transmitted by this e-mail. -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] first_notification_delay problem
On Jan 19, 2010, at 11:57 AM, komodo wrote: Hi As addition to my first post about notification delay. I made some further investigation and found out some another strange behaviour (for me) and couldn't find any info in docs. I have first_nitification_delay for host se to 0 and i have tried change this setting for service and here are the results I have no experience with this feature and don't have time to dig into it right now but based on your unreliable behavior I'd suggest you make sure you don't have multiple copies of nagios running at the same time. If you've been stopping and starting a lot that may be the case. -- Marc -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_cluster and service hardstate
On Jan 19, 2010, at 12:42 PM, Gael Cheron wrote: Hi all, In the doc dealing with the service cluster with the check_cluster plugin, the example uses $SERVICESTATEID:host:Service$ But I'd like to use the hardstate. As far as I know, there is no variable giving the service hardstate. I had a look at the list http://nagios.sourceforge.net/docs/2_0/macros.html Do somebody know if there is a way to do this ? Having such a macro wouldn't make much sense, IMHO. It seems you might not understand the difference between HARD and SOFT states or I don't understand the problem you're trying to solve? What would it's value be while the service was in a SOFT state, presumably when you'd be interested in using it? OK? WARNING? CRITICAL? $LASTSERVICESTATEID$ might be close to what you're thinking about but I believe that it updates after every check, regardless of HARD/SOFT status. I think you're wanting check_cluster to do something only if the things it's watching are in a HARD state. My suggestion would be to pass $SERVICESTATETYPE:host:Service$ for each cluster element to check_cluster and teach check_cluster to read that and to do what you need based on your business logic. Hope that helps... -- Marc -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] first_notification_delay problem
I have no experience with this feature and don't have time to dig into it right now but based on your unreliable behavior I'd suggest you make sure you don't have multiple copies of nagios running at the same time. If you've been stopping and starting a lot that may be the case. -- Marc Thanks for the answer, but that is not a problem. There is only one nagios process. Martin -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null