Re: [Nagios-users-br] Delay em notificação e check

2009-03-26 Thread Alexander Queiroz
Então a latência tá alta
111.332 s
fiz as seguintes alterações:
service_inter_check_delay_method=n
host_inter_check_delay_method=1
service_interleave_factor=s
max_concurrent_checks=0
max_service_check_spread=1
max_host_check_spread=2
service_reaper_frequency=2
interval_length=10
auto_reschedule_checks=1
auto_rescheduling_interval=10
auto_rescheduling_window=15
use_agressive_host_checking=1
enable_flap_detection=0

CPU e disco está muito tranquilo

- Original Message - 
From: Jose Oliveira jotag...@gmail.com
To: Unofficial Brazilian (Portuguese) Nagios Users List 
nagios-users-br@lists.sourceforge.net
Sent: Wednesday, March 25, 2009 8:39 AM
Subject: Re: [Nagios-users-br] Delay em notificação e check


Olá

Como o seu chack_interval está 3 e o interval_length está 60, o nagios tenta
executar cada verificação de tres em tres minutos...

Veja e nos informe como está a latencia do seu nagios, e o consumo de CPU e
i/o do servidor. Se a latencia estiver alta, é por causa de um destes
dois...

No proprio detalhe do serviço voce pode ver isto, se clicar no serviço e
verificar o campo latency. O ideal é  que 1 segundo...

[]s
Jotagera



2009/3/24 Alexander Queiroz alexander...@terra.com.br

 Boa noite,
 Pessoal gostaria de receber alguma dica para deixar o nagios checando
 instântaneo pois quando um host
 cai, demora cerca de 10minutos para atualizar que o host está down e isso
 vem me causando problemas
 pois tenho que praticamente monitorar a ferramenta de monitoramento.
 Segue abaixo meu arquivo de configuração
 nagios.cfg
 =
 cfg_file=/etc/nagios/hosts.cfg
 cfg_file=/etc/nagios/services.cfg
 cfg_file=/etc/nagios/misccommands.cfg
 cfg_file=/etc/nagios/checkcommands.cfg
 cfg_file=/etc/nagios/contactgroups.cfg
 cfg_file=/etc/nagios/contacts.cfg
 cfg_file=/etc/nagios/hostgroups.cfg
 cfg_file=/etc/nagios/servicegroups.cfg
 cfg_file=/etc/nagios/timeperiods.cfg
 cfg_file=/etc/nagios/escalations.cfg
 cfg_file=/etc/nagios/dependencies.cfg
 cfg_file=/etc/nagios/hostextinfo.cfg
 cfg_file=/etc/nagios/serviceextinfo.cfg
 cfg_file=/etc/nagios/meta_commands.cfg
 cfg_file=/etc/nagios/meta_contact.cfg
 cfg_file=/etc/nagios/meta_contactgroup.cfg
 cfg_file=/etc/nagios/meta_dependencies.cfg
 cfg_file=/etc/nagios/meta_escalations.cfg
 cfg_file=/etc/nagios/meta_host.cfg
 cfg_file=/etc/nagios/meta_hostgroup.cfg
 cfg_file=/etc/nagios/meta_services.cfg
 cfg_file=/etc/nagios/meta_timeperiod.cfg
 resource_file=/etc/nagios/resource.cfg
 log_file=/var/log/nagios/nagios.log
 temp_file=/var/log/nagios/nagios.tmp
 status_file=/var/log/nagios/status.dat
 aggregate_status_updates=1
 status_update_interval=15
 nagios_user=nagios
 nagios_group=nagios
 enable_notifications=1
 execute_service_checks=1
 accept_passive_service_checks=1
 enable_event_handlers=1
 log_rotation_method=d
 log_archive_path=/var/log/nagios/archives/
 check_external_commands=1
 command_check_interval=1s
 command_file=/var/log/nagios/rw/nagios.cmd
 downtime_file=/var/log/nagios/downtime.log
 comment_file=/var/log/nagios/comment.log
 lock_file=/var/log/nagios/nagios.pid
 retain_state_information=1
 state_retention_file=/var/log/nagios/status.sav
 retention_update_interval=180
 use_retained_program_state=1
 use_syslog=0
 log_notifications=1
 log_service_retries=1
 log_host_retries=1
 log_event_handlers=1
 log_initial_states=1
 log_external_commands=1
 sleep_time=0.1
 service_inter_check_delay_method=n
 host_inter_check_delay_method=1
 service_interleave_factor=s
 max_concurrent_checks=0
 max_service_check_spread=1
 max_host_check_spread=2
 service_reaper_frequency=2
 interval_length=60
 auto_reschedule_checks=1
 auto_rescheduling_interval=10
 auto_rescheduling_window=15
 use_agressive_host_checking=1
 enable_flap_detection=0
 low_service_flap_threshold=25.0
 high_service_flap_threshold=50.0
 low_host_flap_threshold=25.0
 high_host_flap_threshold=50.0
 soft_state_dependencies=0
 service_check_timeout=20
 host_check_timeout=15
 event_handler_timeout=30
 notification_timeout=60
 ocsp_timeout=1
 perfdata_timeout=5
 obsess_over_services=1
 obsess_over_hosts=1
 process_performance_data=1
 service_perfdata_command=process-service-perfdata
 service_perfdata_file=/var/log/nagios/service-perfdata
 check_for_orphaned_services=1
 check_service_freshness=1
 service_freshness_check_interval=60
 check_host_freshness=0
 host_freshness_check_interval=60
 date_format=us
 illegal_object_name_chars=~!$%^*|'?,()=
 illegal_macro_output_chars=`~$^|'
 admin_email=admin
 admin_pager=ad...@localhost
 broker_module=/usr/lib/nagios/ndomod.o config_file=/etc/nagios/ndomod.cfg
 event_broker_options=-1
 =

 Um host para monitoramento;
 define host{
  host_name   Firewall_01
  aliasFirewall
  address192.168.70.1
  hostgroups   DATACENTER, Firewalls
  check_command   check_host_alive
  max_check_attempts   1
  check_interval   3
  active_checks_enabled  1
  passive_checks_enabled  0
  

Re: [Nagios-users] use of check_udp

2009-03-26 Thread Andreas Ericsson
Hijacking a message thread to start a new one is a poor way of getting
helpful answers. Please don't do that again.

Philip Brown wrote:
 Hmm..
 
 another snag I hit upgrading from plugins version 1.3.1, to 1.4.13.
 
 previously, it was perfectly valid to use
 
 check_udp -H hostname -p 
 
 and it worked.
 
 but now, you have to specify BOTH a 'send' string, and an 'expect' string.
 
 The help/error message is not entirely clear:
 
 With UDP checks, a send/expect string must be specified.
 
 It would be nicer if it more accurately said,
 
 With UDP checks, both a send AND an expect string must be specified.
 
 
 I was wondering why it was not sufficient to call it with -s. I did not 
 realize it required -e as well.
 Also: i didnt really know what to 'expect... but it turns out that
 
 check_udp -H host -p 2342 -s 'xyz' -e ''
 
 works ok.
 
 Too bad that trick does not seem to work for checking if a kerb5 kdc port is 
 up and valid.
 
 I know there is a check_kdc perl script, but it requires a couple of 
 external modules. I prefer things that are self-contained, or compiled.
 

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] NRPE vs. check_by_ssh

2009-03-26 Thread Andreas Ericsson
Kevin Keane wrote:
 Christopher McAtackney wrote:
 2009/3/25 Kevin Keane subscript...@kkeane.com:
   
 I think you are comparing apples and oranges here, because in most
 situations that I can think of, the decision is dictated by the network
 topology. If you are exclusively on a trusted private network,
 check_by_ssh really doesn't offer any benefits. Conversely, if your
 topology involves the Internet or some other untrusted network (WiFi),
 then you wouldn't want NRPE in the first place.

 The only exception to the above that I can think of is when it comes to
 deciding between using check_by_ssh over an untrusted network, vs. NRPE
 through some other kind of tunnel or VPN. But in that case, you'd incur
 encryption overhead either way, and the comparison is very different
 from the question you asked.

 All that said: I don't have any first-hand experience, but I suspect
 that the impact of establishing 2200 ssh connections in a five-minute
 span (assuming that you are using a five-minute check interval) is
 pretty substantial. The main impact actually lies in establishing and
 tearing down the connections, key negotiations etc.; the encryption
 during the data phase probably has only limited impact because most
 checks only transmit a few bytes back and forth.

 SSH does much better with longer-duration connections when the keys are
 already exchanged. This is even more true if you have a router-based
 VPN, because in that case the overhead is offloaded to a different machine.

 So if you have the option of sending the checks as NRPE through one or a
 few long-term VPNs: you are probably going to be better off. Of course,
 in the big picture, your mileage may vary.
 
 Firstly, thanks for the detailed explanation of the issues involved in
 this choice Kevin, it's been very helpful.

 I'm curious though, could you elaborate on why NRPE is unsuitable if
 communication with my remote hosts is going to go via the Internet? Is
 it not sufficient that NRPE uses SSL? This may be more of a network
 security question than a Nagios one, but I've no real experience in
 either area unfortunately, so I appreciate any info you can give here.
   
 No, you are right. I wasn't aware that NRPE could use SSL. In that case, 
 NRPE would be pretty much the same in terms of performance as SSL.
 
 That said, I am generally concerned from a security standpoint about any 
 kind of active checks going over the Internet. This is because if you 
 are monitoring, in your example, 200 hosts, you have to poke holes into 
 200 firewalls (or into one firewall, and then set up SSL or SSH keys on 
 200 hosts). That's 200 potential security holes all over the place with 
 little or no control, and on machines that may not necessarily be 
 hardened for access from the outside world. Worse - active checks, by 
 nature, cause a program to be launched and executed on the monitored 
 client, and usually with very high permissions. You said that you check 
 2000 services, so that's 2000 plugins (give or take a few). What if a 
 hacker found a way to compromise one of your 2000 plugins? You'd have a 
 privilege escalation issue along with remote-launch capability. On 200 
 clients.
 

Very high permissions are normally not needed. I prefer using NRPE because
of two reasons:
1. It provides a rather simple way of specifying exactly which commands
   can be run, and with which arguments (don't enable argument parsing
   in nrpe if the receiving end isn't duly protected by firewalls etc)
2. If someone breaks into the Nagios server, he or she does not get the
   public keys required for running commands on the remote servers.

 Because of these concerns, I am using passive checks almost exclusively 
 over the Internet (except for publicly available services such as HTTP 
 or SMTP, of course); I wrote an agent that resides on the client as a 
 wrapper around the excellent NSClient++ and performs the actual checks. 
 It then forwards the checks to the Nagios server via NSCA over HTTPS. A 
 second benefit is that this agent collects about 40 or so check results, 
 and then sends all of them at once through a single SSL connection. That 
 reduces the overhead of establishing a secure connection by a factor of 
 40. BTW, the agent is available as Open Source. Go to 
 http://www.tntmonitoring.com .
 

Sounds like a rather neat solution, although I suppose it has to be
configured in both ends before it's actually useful (although all other
agents require some configuration anyways, so perhaps it's not such a big
deal). I'm not too fond of relinquishing the re-check logic of Nagios
though, but I guess you can't get everything.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.


Re: [Nagios-users] Nagios and MySQL

2009-03-26 Thread Andreas Ericsson
Mark Weaver wrote:
 Hi List,
 I was wondering if anyone has connected their standard Nagios installation up 
 to 
 a MySQL backend?
  
 I'm looking at this from a purely disaster recovery aspect. It's easy enough 
 to 
 backup the configuration, but the data is another matter.
  

Why you would want to move data from flat files to a database for disaster
recovery purposes is beyond me. Care to explain?

 My current Nagios installation is version 3 installed on a CentOS 4.7 server. 
 The installation was done via Yum from the rpmforge repo. Much cleaner 
 install 
 than compiling the tarballs and much easier to manage.
  
 Yes, I could archive all the installation paths, but because they were 
 installed 
 via the RPM method things are spread out all over the file system. It would 
 be 
 real nice if I had all the data contained within a MySQL backend. That way if 
 I 
 had to restore should my Nag server die or suffer some horrible fate it could 
 be 
 a matter of reinstalling the packages, restoring the /etc/nagios directory 
 where 
 all the configs live, restore the database and I'm of again.
  

This should work reasonably well:

tar czf nagios-backup-$(date +%Y-%m-%d.%T).tar.gz $(rpm -ql nagios)

 I've looked at a few suggested Nagios front ends including Centreon and they 
 turned out to be bad experiences. Mostly due to the fact that being
 a Nagios noobie I didn't know what the hell the front was doing. The worst of 
 it 
 though was the way those front ends kept over-writing my
 configuration files in favor of it's own. Made a real mess of things and 
 after 
 the third reinstallation of Nagios on my sandbox I decided they
 weren't worth the effort or the misery. (Centreon and NagiosQL)
  
 Groundworks is out of the question because as soon as it was installed and 
 running it disabled my current instance of MySQL in favor of it's
 own. Not such a big deal as at the time it was on my sandbox machine, but had 
 that been a production machine where I've got web applications
 running I'd have been seriously pissed! Funny that... Groundworks didn't 
 mention 
 anything about that rather rude behavior.
  
 No, I'd much rather be able to connect a standard Nagios install to a MySQL 
 backend and use everything else as is.
  

What particular data is it you want to put in the database? We have plenty of
scripts, eventbrokers and programs written specifically to gather Nagios data
and put it in a database. If you use our webconfiguration tool, it will
overwrite your configuration files when you click save, but that's sort of
expected behaviour, I guess.

Still, I think you're going about this the wrong way if you only want it for
backup reasons.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Dependent service checks don't fail when depended-on service check fails

2009-03-26 Thread Andreas Ericsson
Jarrod Moore wrote:
 Hello everyone,
 
 I have a couple of related questions regarding service dependencies in
 Nagios and their limitations. I have two service checks (let's call
 them A and B) and service A depends on service B to function
 correctly. I want to set Nagios up so that if service B crashes then
 both services A and B are put into the critical state in Nagios. I've
 tried using service dependencies in Nagios to represent this behaviour
 but have yet to be successful. I can only get it to suppress
 notifications of service A if both services go down.
 

This is expected behaviour. If A is truly dependant on B, then A will
turn into a non-ok state of its own volition rather than as a result
of any dependency magic. Dependencies are designed as a means of
suppressing notifications. Otherwise, you would *always* get a
notification for B first, and a minute or so later from A (actually,
without the dependency you could get from A first).

 Is there a way to do what I'm trying to do here? I'd have thought it
 would be logical that if a service depends on another service and the
 service depended on dies then all services depending on it would fail
 their checks as well, but there;s probably some scenario where it
 doesn't work so well. I've had a look through the mailing list
 archives and found someone had asked a similar question to the
 nagios-devel list about 2.5 years ago and didn't end up getting an
 answer, so I thought I might ask whether solutions to this type of
 problem had been developed since then.
 

They haven't. You're using dependencies the wrong way, really. If
A is truly dependent on B and doesn't go into a non-ok state after
B has crashed, then your check isn't doing what it's supposed to do,
or you've misunderstood the relationship somehow.

If you were to explain what the two services actually are, it would
be easier to point you to a solution that works.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] detecting rogue dhcp server

2009-03-26 Thread Andreas Ericsson
Chris wrote:
 Is there any plugin exist to detect rogue dhcp servers across all vlans/ 
 subnet?
 
 I have had a look at two programs (not Nagios plugins): dhcp_probe and
 roguedetect - none suitable for human consumption yet. There is an
 excellent program on Windows call dhcploc.exe but I'm on Linux.
 

check_dhcp can sort of do what you want. Give it a whirl.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Max concurrent service checks

2009-03-26 Thread Gian Paolo Buono
Hi,

I set this limit because my server go in hang ...and I don'have any error
log in /var/log/messages ...
I have think that my server there are too many processes ...
I have set max_concurrent_checks=0 and my server go in hanq more often...

My server is a FreeBSD 7.1-RELEASE-p2 with 950 host and 4900 service, Nagios
3.0.3

Anothre problem is that  sometimes nagios don't update the status and when i
try to stop nagios don't dies, i try to kill -9 the process but don't dies
so I have to reboot the server.

Any idea ? thank you for the support bye..

On Wed, Mar 25, 2009 at 2:09 PM, Ricardo Maraschini 
ricardo.marasch...@opservices.com.br wrote:




 - Gian Paolo Buono gpbu...@gmail.com escreveu:
  from nagios.log I receive this message ?
 
  [1237973726] Max concurrent service checks (400) has been reached.
  Delaying further checks until previous checks are complete...
 [...]
  any idea ? I is this a problem?

 Search by max_concurrent_checks on
 http://nagios.sourceforge.net/docs/3_0/configmain.html

 -rm

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Max concurrent service checks

2009-03-26 Thread Andreas Ericsson
Gian Paolo Buono wrote:
 Hi,
 
 I set this limit because my server go in hang ...and I don'have any error
 log in /var/log/messages ...
 I have think that my server there are too many processes ...
 I have set max_concurrent_checks=0 and my server go in hanq more often...
 
 My server is a FreeBSD 7.1-RELEASE-p2 with 950 host and 4900 service, Nagios
 3.0.3
 
 Anothre problem is that  sometimes nagios don't update the status and when i
 try to stop nagios don't dies, i try to kill -9 the process but don't dies
 so I have to reboot the server.
 
 Any idea ? thank you for the support bye..
 

The only way I'm aware of that a process can become unkillable is when it's
in uninterruptable IO (ie, the kernel is waiting for response from a piece
of hardware in such a way that everything else is more or less locked down).

Are you using network-mounted drives to store any of Nagios' output files?
If so, stop doing that immediately. Network filesystems perform extremely
poorly with files that are being frequently updated.

Apart from that, it seems as if your system isn't quite up to scratch for
handling the workload you want to put on it.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] LOG2NDO doesn't insert anythigng in database

2009-03-26 Thread MAD
Hi all,

I'm using NDOUtils to extract logs from Nagios to a MySQL database. 
Unfortunately, the ndo2db daemon crashed this week-end and I noticed it only on 
yesterday. I tried to re-inject logs via the log2ndo utility and data where 
inserted in logentries table but nothing inside the other tables.

Is there a way to insert it in the servicechecks table, hostcheck table 
etc...?

Marc-André

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Max concurrent service checks

2009-03-26 Thread Giorgio Zarrelli
Ciao,

issue a top and look for wa. Your troubles make me think about some i/o
problems and so too many wait cycles on cpu.

Giorgio

Gian Paolo Buono (gpbu...@gmail.com) scritto:

 Hi,

 I set this limit because my server go in hang ...and I don'have any error
 log in /var/log/messages ...
 I have think that my server there are too many processes ...
 I have set max_concurrent_checks=0 and my server go in hanq more often...

 My server is a FreeBSD 7.1-RELEASE-p2 with 950 host and 4900 service, Nagios
 3.0.3

 Anothre problem is that  sometimes nagios don't update the status and when i
 try to stop nagios don't dies, i try to kill -9 the process but don't dies
 so I have to reboot the server.

 Any idea ? thank you for the support bye..

 On Wed, Mar 25, 2009 at 2:09 PM, Ricardo Maraschini 
 ricardo.marasch...@opservices.com.br wrote:

 
 
 
  - Gian Paolo Buono gpbu...@gmail.com escreveu:
   from nagios.log I receive this message ?
  
   [1237973726] Max concurrent service checks (400) has been reached.
   Delaying further checks until previous checks are complete...
  [...]
   any idea ? I is this a problem?
 
  Search by max_concurrent_checks on
  http://nagios.sourceforge.net/docs/3_0/configmain.html
 
  -rm
 



--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Max concurrent service checks

2009-03-26 Thread Gian Paolo Buono
Hi,
I don't have any nfs mount on this server, and can not find the problem..
I think that the problem is the raid controller...

[r...@server /usr/local/etc/nagios]# dmesg | grep -i raid
aac0: IBM ServeRAID-8k port 0x5000-0x50ff mem
0xc9e0-0xc9ff,0xc7fe-0xc7ff irq 17 at device 0.0 on pci4
aac0: ServeRAID 8k-l  , aac driver 2.0.0-1
aacd0: RAID 1 (Mirror) on aac0

but i dont have any log on this ..any suggest ?


On Thu, Mar 26, 2009 at 10:57 AM, Andreas Ericsson a...@op5.se wrote:

 Gian Paolo Buono wrote:

 Hi,

 I set this limit because my server go in hang ...and I don'have any error
 log in /var/log/messages ...
 I have think that my server there are too many processes ...
 I have set max_concurrent_checks=0 and my server go in hanq more often...

 My server is a FreeBSD 7.1-RELEASE-p2 with 950 host and 4900 service,
 Nagios
 3.0.3

 Anothre problem is that  sometimes nagios don't update the status and when
 i
 try to stop nagios don't dies, i try to kill -9 the process but don't dies
 so I have to reboot the server.

 Any idea ? thank you for the support bye..


 The only way I'm aware of that a process can become unkillable is when it's
 in uninterruptable IO (ie, the kernel is waiting for response from a piece
 of hardware in such a way that everything else is more or less locked
 down).

 Are you using network-mounted drives to store any of Nagios' output files?
 If so, stop doing that immediately. Network filesystems perform extremely
 poorly with files that are being frequently updated.

 Apart from that, it seems as if your system isn't quite up to scratch for
 handling the workload you want to put on it.

 --
 Andreas Ericsson   andreas.erics...@op5.se
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Auto-Discovery

2009-03-26 Thread Christopher McAtackney
2009/3/25 Russell Adams rlad...@adamsinfoserv.com:
 It doesn't appear to me that the configuration syntax has changed
 significantly, but I haven't used 3.x yet.

 NACE updates your templates, so the syntax shouldn't matter much. If
 you write Nagios 3.x configs, and let NACE reuse them with variable
 replacement, there shouldn't be a problem.

 Thanks.

Thanks for the info Russel, I'll give it a shot and see how I get on.

Cheers,
Chris

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios and MySQL

2009-03-26 Thread Ricardo Maraschini

- Mark Weaver mwea...@compinfosystems.com escreveu:
 I was wondering if anyone has connected their standard Nagios
 installation up to a MySQL backend?

You can take a look to www.opmon.org
We make available a configuration tool called OpCfg and an event 
broker(including performance data parse), so every information will be in a 
mysql db.

Regarding to backup issue, you can run a rpm -ql package name to get gathered 
all Nagios instalation files and make a backup of them.

HTH.

-rm

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Auto-Discovery

2009-03-26 Thread Richard Savage


Christopher McAtackney wrote:
 2009/3/25 Russell Adams rlad...@adamsinfoserv.com:
   
 It doesn't appear to me that the configuration syntax has changed
 significantly, but I haven't used 3.x yet.

 NACE updates your templates, so the syntax shouldn't matter much. If
 you write Nagios 3.x configs, and let NACE reuse them with variable
 replacement, there shouldn't be a problem.

 Thanks.
 

 Thanks for the info Russel, I'll give it a shot and see how I get on.

 Cheers,
 Chris

   
There is a new config / discovery manager for nagios called Lilac.

http://www.lilacplatform.com/

Have a look and let us know how you get on.

Rich

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Auto-Discovery

2009-03-26 Thread Christopher McAtackney
 There is a new config / discovery manager for nagios called Lilac.

 http://www.lilacplatform.com/

 Have a look and let us know how you get on.

 Rich

Hi Rich,

I have looked at Lilac before, and it looks great, unfortunately the
only supported database platform I'll be using is Oracle. Is there a
way to configure Lilac so that it does not have to use MySQL, or is it
required for the core functionality?

Cheers,
Chris

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Uniquely Identifiable Events in Nagios.log?

2009-03-26 Thread Christopher McAtackney
Hi all,

I was wondering if it is possible to configure Nagios to produce
uniquely identifiable entries in the nagios/var/nagios.log file?

The reason I ask, is that I would like to parse this log file for
service check results and perform further processing based on the
values discovered there. The trouble is, that as far as I can see,
Nagios uses a time-stamp which is only accurate to the second, and so
my log files have lines which all have the same time-stamp. Is there a
way to increase the accuracy of this time-stamp perhaps? Or any other
suitable solution to the general problem of identifying log entries?

Cheers,
Chris

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Uniquely Identifiable Events in Nagios.log?

2009-03-26 Thread Marc Powell

On Mar 26, 2009, at 9:22 AM, Christopher McAtackney wrote:

 Hi all,

 I was wondering if it is possible to configure Nagios to produce
 uniquely identifiable entries in the nagios/var/nagios.log file?

No, the log format isn't configurable.


  Or any other suitable solution to the general problem of  
 identifying log entries?

NDO seems like a good fit. It's all in an query-able database and each  
row has a unique id.

--
Marc

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Uniquely Identifiable Events in Nagios.log?

2009-03-26 Thread Christopher McAtackney
2009/3/26 Marc Powell m...@ena.com:

 On Mar 26, 2009, at 9:22 AM, Christopher McAtackney wrote:

 Hi all,

 I was wondering if it is possible to configure Nagios to produce
 uniquely identifiable entries in the nagios/var/nagios.log file?

 No, the log format isn't configurable.


  Or any other suitable solution to the general problem of
 identifying log entries?

 NDO seems like a good fit. It's all in an query-able database and each
 row has a unique id.

 --
 Marc

I have come across NDO before, but unfortunately I cannot run MySQL in
the production environment, and so I have to find a different solution
to this problem. I found an NDOUtils Oracle, but I believe that it is
still in a pre-alpha state, and so probably not suitable for a
production environment with thousands of checks occurring.

Chris

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] NDO update delay

2009-03-26 Thread Rodney Ramos
Hi everybody.

I´ve installed NDOUtils (nodutils-1.4b7) whith Nagios (nagios-3.0.6) and
it´s working.

However I´ve noticed that it´s taking more than 30 minutes to update the
MySQL tables.

I´ve detected the problem making a query on the field last_check on
nagios_hoststatus table.

Can anyone help me? There is a parameter to make the update process faster?

Thanks,
Rodney.
--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Users able to see services their not authenticated for

2009-03-26 Thread Tore Lønøy
Hello,

I have a service I just want a certain group to be able to view, here is the
object def. of the service from objects.cache:

define service {
host_name   hostname
service_description description
check_period24x7
check_command   check_nrpe!service
contacts
contact_groups
notification_period 18x7
initial_state   o
check_interval  2.00
retry_interval  5.00
max_check_attempts  4
is_volatile 0
parallelize_check   1
active_checks_enabled   1
passive_checks_enabled  0
obsess_over_service 0
event_handler_enabled   0
low_flap_threshold  0.00
high_flap_threshold 0.00
flap_detection_enabled  1
flap_detection_options  o,w,u,c
freshness_threshold 0
check_freshness 0
notification_optionsc,r
notifications_enabled   0
notification_interval   60.00
first_notification_delay0.00
stalking_optionsn
process_perf_data   1
failure_prediction_enabled  1
icon_image  graph.png
retain_status_information   1
retain_nonstatus_information1
}

I've zeroed out the values for contacts and contactgroups, which should
effectivly hide this service unless cgi.cfg tells otherwise for that user.

But, any user is able to view this service, why?

Ive tried to add myself to the contacts line and remove the contact_groups,
so it would look like this:
contacts  tore
#contact_groups

But this still doesnt work, are regular operators are still able to view the
service, why? Operators are members of a operators group, which is a
contactgroup for all hosts.

Any tips would be gratefull :)

Running Nagios 3.0.1
--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] NRPE vs. check_by_ssh

2009-03-26 Thread Kevin Keane
Andreas Ericsson wrote:
 Kevin Keane wrote:
 Christopher McAtackney wrote:
 2009/3/25 Kevin Keane subscript...@kkeane.com:
  
 I think you are comparing apples and oranges here, because in most
 situations that I can think of, the decision is dictated by the 
 network
 topology. If you are exclusively on a trusted private network,
 check_by_ssh really doesn't offer any benefits. Conversely, if your
 topology involves the Internet or some other untrusted network (WiFi),
 then you wouldn't want NRPE in the first place.

 The only exception to the above that I can think of is when it 
 comes to
 deciding between using check_by_ssh over an untrusted network, vs. 
 NRPE
 through some other kind of tunnel or VPN. But in that case, you'd 
 incur
 encryption overhead either way, and the comparison is very different
 from the question you asked.

 All that said: I don't have any first-hand experience, but I suspect
 that the impact of establishing 2200 ssh connections in a five-minute
 span (assuming that you are using a five-minute check interval) is
 pretty substantial. The main impact actually lies in establishing and
 tearing down the connections, key negotiations etc.; the encryption
 during the data phase probably has only limited impact because most
 checks only transmit a few bytes back and forth.

 SSH does much better with longer-duration connections when the keys 
 are
 already exchanged. This is even more true if you have a router-based
 VPN, because in that case the overhead is offloaded to a different 
 machine.

 So if you have the option of sending the checks as NRPE through one 
 or a
 few long-term VPNs: you are probably going to be better off. Of 
 course,
 in the big picture, your mileage may vary.
 
 Firstly, thanks for the detailed explanation of the issues involved in
 this choice Kevin, it's been very helpful.

 I'm curious though, could you elaborate on why NRPE is unsuitable if
 communication with my remote hosts is going to go via the Internet? Is
 it not sufficient that NRPE uses SSL? This may be more of a network
 security question than a Nagios one, but I've no real experience in
 either area unfortunately, so I appreciate any info you can give here.
   
 No, you are right. I wasn't aware that NRPE could use SSL. In that 
 case, NRPE would be pretty much the same in terms of performance as SSL.

 That said, I am generally concerned from a security standpoint about 
 any kind of active checks going over the Internet. This is because if 
 you are monitoring, in your example, 200 hosts, you have to poke 
 holes into 200 firewalls (or into one firewall, and then set up SSL 
 or SSH keys on 200 hosts). That's 200 potential security holes all 
 over the place with little or no control, and on machines that may 
 not necessarily be hardened for access from the outside world. Worse 
 - active checks, by nature, cause a program to be launched and 
 executed on the monitored client, and usually with very high 
 permissions. You said that you check 2000 services, so that's 2000 
 plugins (give or take a few). What if a hacker found a way to 
 compromise one of your 2000 plugins? You'd have a privilege 
 escalation issue along with remote-launch capability. On 200 clients.


 Very high permissions are normally not needed.
Depends on the plugin, but I'm not sure that this is generally true. For 
instance, something as simple as log file analysis either requires root 
permission on Linux; log files aren't readable by anybody else, or it 
requires that you relax file permissions or security somewhere else. On 
Windows, I'm running my monitoring agent (by default) as the Local 
System account (most Windows services do that anyway). That has 
basically full access to everything, but nothing on the network.

Of course check_ping, check_tcp etc. don't usually need such high 
permissions.
 I prefer using NRPE because
 of two reasons:
 1. It provides a rather simple way of specifying exactly which commands
   can be run, and with which arguments (don't enable argument parsing
   in nrpe if the receiving end isn't duly protected by firewalls etc)
 2. If someone breaks into the Nagios server, he or she does not get the
   public keys required for running commands on the remote servers.
Can you explain that second statement? I'm not sure I follow what you 
are trying to say here. Why would getting public keys be a bad thing? 
They are, by definition, freely available anyway.
 Because of these concerns, I am using passive checks almost 
 exclusively over the Internet (except for publicly available services 
 such as HTTP or SMTP, of course); I wrote an agent that resides on 
 the client as a wrapper around the excellent NSClient++ and performs 
 the actual checks. It then forwards the checks to the Nagios server 
 via NSCA over HTTPS. A second benefit is that this agent collects 
 about 40 or so check results, and then sends all of them at once 
 through a single SSL connection. That reduces the overhead 

Re: [Nagios-users] Uniquely Identifiable Events in Nagios.log?

2009-03-26 Thread Marc Powell

On Mar 26, 2009, at 9:56 AM, Christopher McAtackney wrote:

 I have come across NDO before, but unfortunately I cannot run MySQL in
 the production environment, and so I have to find a different solution
 to this problem. I found an NDOUtils Oracle, but I believe that it is
 still in a pre-alpha state, and so probably not suitable for a
 production environment with thousands of checks occurring.

If you have any programmers on staff you could create your own  
event_broker to log to a custom file or add custom log entries to  
nagios.log. Someone may have already created one but a quick Google  
doesn't show any obvious candidates.

--
Marc

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Users able to see services their not authenticated for

2009-03-26 Thread Marc Powell

On Mar 26, 2009, at 10:51 AM, Tore Lønøy wrote:


 I've zeroed out the values for contacts and contactgroups, which  
 should effectivly hide this service unless cgi.cfg tells otherwise  
 for that user.

 But, any user is able to view this service, why?

 Ive tried to add myself to the contacts line and remove the  
 contact_groups, so it would look like this:
 contacts  tore
 #contact_groups

 But this still doesnt work, are regular operators are still able to  
 view the service, why? Operators are members of a operators group,  
 which is a contactgroup for all hosts.

http://nagios.sourceforge.net/docs/3_0/objectinheritance.html#implied_inheritance

--
Marc


--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] NRPE vs. check_by_ssh

2009-03-26 Thread Charlie Reddington

On Mar 26, 2009, at 11:05 AM, Kevin Keane wrote:

 Andreas Ericsson wrote:
 Kevin Keane wrote:
 Christopher McAtackney wrote:
 2009/3/25 Kevin Keane subscript...@kkeane.com:

 I think you are comparing apples and oranges here, because in most
 situations that I can think of, the decision is dictated by the
 network
 topology. If you are exclusively on a trusted private network,
 check_by_ssh really doesn't offer any benefits. Conversely, if  
 your
 topology involves the Internet or some other untrusted network  
 (WiFi),
 then you wouldn't want NRPE in the first place.

 The only exception to the above that I can think of is when it
 comes to
 deciding between using check_by_ssh over an untrusted network, vs.
 NRPE
 through some other kind of tunnel or VPN. But in that case, you'd
 incur
 encryption overhead either way, and the comparison is very  
 different
 from the question you asked.

 All that said: I don't have any first-hand experience, but I  
 suspect
 that the impact of establishing 2200 ssh connections in a five- 
 minute
 span (assuming that you are using a five-minute check interval) is
 pretty substantial. The main impact actually lies in  
 establishing and
 tearing down the connections, key negotiations etc.; the  
 encryption
 during the data phase probably has only limited impact because  
 most
 checks only transmit a few bytes back and forth.

 SSH does much better with longer-duration connections when the  
 keys
 are
 already exchanged. This is even more true if you have a router- 
 based
 VPN, because in that case the overhead is offloaded to a different
 machine.

 So if you have the option of sending the checks as NRPE through  
 one
 or a
 few long-term VPNs: you are probably going to be better off. Of
 course,
 in the big picture, your mileage may vary.

 Firstly, thanks for the detailed explanation of the issues  
 involved in
 this choice Kevin, it's been very helpful.

 I'm curious though, could you elaborate on why NRPE is unsuitable  
 if
 communication with my remote hosts is going to go via the  
 Internet? Is
 it not sufficient that NRPE uses SSL? This may be more of a network
 security question than a Nagios one, but I've no real experience in
 either area unfortunately, so I appreciate any info you can give  
 here.

 No, you are right. I wasn't aware that NRPE could use SSL. In that
 case, NRPE would be pretty much the same in terms of performance  
 as SSL.

 That said, I am generally concerned from a security standpoint about
 any kind of active checks going over the Internet. This is because  
 if
 you are monitoring, in your example, 200 hosts, you have to poke
 holes into 200 firewalls (or into one firewall, and then set up SSL
 or SSH keys on 200 hosts). That's 200 potential security holes all
 over the place with little or no control, and on machines that may
 not necessarily be hardened for access from the outside world. Worse
 - active checks, by nature, cause a program to be launched and
 executed on the monitored client, and usually with very high
 permissions. You said that you check 2000 services, so that's 2000
 plugins (give or take a few). What if a hacker found a way to
 compromise one of your 2000 plugins? You'd have a privilege
 escalation issue along with remote-launch capability. On 200  
 clients.


 Very high permissions are normally not needed.
 Depends on the plugin, but I'm not sure that this is generally true.  
 For
 instance, something as simple as log file analysis either requires  
 root
 permission on Linux; log files aren't readable by anybody else, or it
 requires that you relax file permissions or security somewhere else.  
 On
 Windows, I'm running my monitoring agent (by default) as the Local
 System account (most Windows services do that anyway). That has
 basically full access to everything, but nothing on the network.

My nagios user only checks basic system stuff, and I haven't run into  
a permission error situation yet, and I check the following by default  
- load, users, disk, swap, memory, processes, databases, raid.



 Of course check_ping, check_tcp etc. don't usually need such high
 permissions.
 I prefer using NRPE because
 of two reasons:
 1. It provides a rather simple way of specifying exactly which  
 commands
  can be run, and with which arguments (don't enable argument parsing
  in nrpe if the receiving end isn't duly protected by firewalls etc)
 2. If someone breaks into the Nagios server, he or she does not get  
 the
  public keys required for running commands on the remote servers.
 Can you explain that second statement? I'm not sure I follow what you
 are trying to say here. Why would getting public keys be a bad thing?
 They are, by definition, freely available anyway.

What you CAN do, though it's kind of a p.i.t.a is, is have a key per  
command. So if you have something like check_disk, you can put a  
single key for just that command. On all the servers you roll this out  
to, you can secure it up 

[Nagios-users] Nagios customization question...

2009-03-26 Thread Michael . Washington

The gurus have asked me to include the following scripting in our branch
office Nagios server configuration in order to monitor a remote co-location
site DNS server .   I understand how/where to include the
host/service/contact/command definitions scripting, but where (what
file(s)) do I include the company_check_dns scripting command set procedure
listed near the bottom?  I am running v3.0.2 on Fedora with the quick
installation setup.


***

We need to monitor DNS server at ip x.x.x.x both for ICMP and DNS.


Here are definitions for nagios that used to use,  they will have to be
modified  by you to work with the DNS server.

linux-admins group would have 3 names Larry, Moe and Curly.

***

define host {
use generic-host
host_name   dns20
alias   ns20
address x.x.x.x
max_check_attempts  3
notification_period 24x7
notification_optionsd,u,r
check_command   check-host-alive
contact_groups  linux-admins
}




define service {
use generic-service
host_name  dns20
service_description PING
is_volatile 0
check_period24x7
max_check_attempts  8
normal_check_interval   10
retry_check_interval2
contact_groups  linux-admins
notification_period 24x7
notification_optionsc,r
check_command   check_ping!100.0,20%!900.0,60%
event_handler   networkdevice-notify-by-epager
event_handler_enabled   1
}


define service {
use generic-service
host_name  dns20
service_description dns_check
is_volatile 0
check_period24x7
max_check_attempts  8
normal_check_interval   20
retry_check_interval5
contact_groups  linux-admins
notification_period 24x7
notification_optionsc,r
check_command   company_check_dns
event_handler   networkdevice-notify-by-epager
event_handler_enabled   1
}

define command{
command_namecompany_check_dns
command_line$USER2$/company_check_dns $HOSTADDRESS$
}

### company_check_dns
#!/bin/bash

PATH=$PATH:/usr/lib/nagios/company-plugins

DOMAINSTOCHECK=ftp.company1.com company.com company2.com comp\
any3.com company4.com company5.com

for i in `echo $DOMAINSTOCHECK`; do
RESULT=`check_dns -H $i -s $1 | grep CRITICAL`
if [ x$RESULT != x ]
then
echo DNS CRITICAL - $1 - $i
exit 2
fi
done

echo DNS ok - $1;
exit 0
###end of company_check_dns





# 'mkozlows' contact definition
define contact{
contact_namemkozlows
alias   Mariusz Kozlowski
service_notification_period 24x7
host_notification_period24x7
service_notification_optionsc,r
host_notification_options   d,r
service_notification_commands   notify-by-email,notify-by-epager
host_notification_commands
host-notify-by-email,host-notify-by-epager
email   mariusz.kozlow...@fitchratings.com
pager   9178053...@cingularme.com
}

define contact{
contact_namektai
alias   Ken Tai
service_notification_period 24x7
host_notification_period24x7
service_notification_optionsc,r
host_notification_options   d,r
service_notification_commands   notify-by-email,notify-by-epager
host_notification_commands
host-notify-by-email,host-notify-by-epager
email   kenneth@fitchratings.com
pager   9178053...@cingularme.com
}



# 'mbarker' contact definition
define contact{
contact_namembarker
alias   Matt Barker
service_notification_period 24x7
host_notification_period24x7
service_notification_optionsc,r
host_notification_options   d,r
service_notification_commands   notify-by-email,notify-by-epager
host_notification_commands
host-notify-by-email,host-notify-by-epager
email
matthew.bar...@fitchratings.com,mbarker9...@gmail.com
pager
6462564...@cingularme.com,5167493...@cingularme.com



Re: [Nagios-users] Nagios customization question...

2009-03-26 Thread Marc Powell

On Mar 26, 2009, at 1:58 PM, michael.washing...@fitchratings.com wrote:


 The gurus have asked me to include the following scripting in our  
 branch
 office Nagios server configuration in order to monitor a remote co- 
 location
 site DNS server .   I understand how/where to include the
 host/service/contact/command definitions scripting, but where (what
 file(s)) do I include the company_check_dns scripting command set  
 procedure
 listed near the bottom?  I


mmm. tastes like homework. Crunchy ;) It's a custom plugin. It should  
be created in the directory where you normally store nagios plugins  
(is that it referenced in the script?) and made executable by the  
nagios user.

--
Marc


--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] NDO update delay

2009-03-26 Thread Jim Avery
2009/3/26 Rodney Ramos rodne...@gmail.com:
 Hi everybody.

 I´ve installed NDOUtils (nodutils-1.4b7) whith Nagios (nagios-3.0.6) and
 it´s working.

 However I´ve noticed that it´s taking more than 30 minutes to update the
 MySQL tables.

 I´ve detected the problem making a query on the field last_check on
 nagios_hoststatus table.

 Can anyone help me? There is a parameter to make the update process faster?


You can try reducing the number of days data you keep and adding some
indexes.  The solution I think helps most though was contributed by
Marc DeTrano here in a thread here on 3rd/4th March which I summarised
thus:

I simply added a file nagios.cnf
under /etc/mysql/conf.d like so:

 [mysqld]
 innodb_flush_log_at_trx_commit = 2

and it reduced my Nagios startup time from more than 3 minutes to only
30 seconds!

I already had quite a few of the data_processing_options disabled in
ndomod.cfg and had reduced all of the max_*_age parameters in
ndo2db.cfg to 24 hours (before those my startup time was more than 5
minutes).

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios customization question...

2009-03-26 Thread Jim Avery
2009/3/26  michael.washing...@fitchratings.com:

 The gurus have asked me to include the following scripting in our branch
 office Nagios server configuration in order to monitor a remote co-location
 site DNS server .   I understand how/where to include the
 host/service/contact/command definitions scripting, but where (what
 file(s)) do I include the company_check_dns scripting command set procedure
 listed near the bottom?  I am running v3.0.2 on Fedora with the quick
 installation setup.

[snip stuff I assume you're happy with]

 define command{
        command_name    company_check_dns
        command_line    $USER2$/company_check_dns $HOSTADDRESS$
        }

Above is the command definition which hopefully you're happy with.
Note the use of $USER2$ rather than the usual $USER1$.  This
refers to a macro containing a path to the location of your script.
You should find this in your resource file.  See
http://nagios.sourceforge.net/docs/3_0/configmain.html#resource_file

 ### company_check_dns
 #!/bin/bash

 PATH=$PATH:/usr/lib/nagios/company-plugins

 DOMAINSTOCHECK=ftp.company1.com company.com company2.com comp\
 any3.com company4.com company5.com

 for i in `echo $DOMAINSTOCHECK`; do
        RESULT=`check_dns -H $i -s $1 | grep CRITICAL`
        if [ x$RESULT != x ]
        then
                echo DNS CRITICAL - $1 - $i
                exit 2
        fi
 done

 echo DNS ok - $1;
 exit 0
 ###end of company_check_dns

The above is what you should create as your plugin - copy this in to a
file under the directory which $USER2$ points to and make it
executable.
[ snip contact definitions which I assume you're happy with]

hth,

Jim

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Dependent service checks don't fail when depended-on service check fails

2009-03-26 Thread Jarrod Moore
On Thu, Mar 26, 2009 at 7:57 PM, Andreas Ericsson a...@op5.se wrote:
 Jarrod Moore wrote:

 Hello everyone,

 I have a couple of related questions regarding service dependencies in
 Nagios and their limitations. I have two service checks (let's call
 them A and B) and service A depends on service B to function
 correctly. I want to set Nagios up so that if service B crashes then
 both services A and B are put into the critical state in Nagios. I've
 tried using service dependencies in Nagios to represent this behaviour
 but have yet to be successful. I can only get it to suppress
 notifications of service A if both services go down.


 This is expected behaviour. If A is truly dependant on B, then A will
 turn into a non-ok state of its own volition rather than as a result
 of any dependency magic. Dependencies are designed as a means of
 suppressing notifications. Otherwise, you would *always* get a
 notification for B first, and a minute or so later from A (actually,
 without the dependency you could get from A first).

 Is there a way to do what I'm trying to do here? I'd have thought it
 would be logical that if a service depends on another service and the
 service depended on dies then all services depending on it would fail
 their checks as well, but there;s probably some scenario where it
 doesn't work so well. I've had a look through the mailing list
 archives and found someone had asked a similar question to the
 nagios-devel list about 2.5 years ago and didn't end up getting an
 answer, so I thought I might ask whether solutions to this type of
 problem had been developed since then.


 They haven't. You're using dependencies the wrong way, really. If
 A is truly dependent on B and doesn't go into a non-ok state after
 B has crashed, then your check isn't doing what it's supposed to do,
 or you've misunderstood the relationship somehow.

 If you were to explain what the two services actually are, it would
 be easier to point you to a solution that works.

 --
 Andreas Ericsson                   andreas.erics...@op5.se
 OP5 AB                             www.op5.se
 Tel: +46 8-230225                  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.


Well basically I have a map (similar to Google Maps) embedded in a
website, which hits a URL to retrieve maps. So I have one check using
check_http to check that the website itself is up and another check on
that URL to make sure that the map service is available. Now if the
map service goes down, the website is still up but the maps won't
appear, which means the website's functionality is significantly
affected. However, it is still up and viewable so doing a check on the
website URL still passes.

Now of course I could just write a script or something to check both
URLs and set that as the check command. There is a problem for me with
this approach, however, because I have some other instances where a
web service depends on other web services. When I want to use these
services in websites, I'd then have to write a check for each script,
each containing every service in the chain that is needed to display
the website correctly. This way of doing things just seems a bit
repetitive to me, especially when I have a check for these web
services already.

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios with snmp

2009-03-26 Thread Satish Patel
only put snmp community passwd and get any information from SNMP device 
  - Original Message - 
  From: Anirudh Srinivasan 
  To: nagios-users@lists.sourceforge.net 
  Sent: Tuesday, March 24, 2009 11:18 AM
  Subject: [Nagios-users] Nagios with snmp


  Hey Folks,

  I want to test a Switch to be monitored using snmp plugin in nagios. Right 
now i am just monitoring the ping but i need to find out , what other parameter 
can be monitored on a switch using snmp. Also i have a windows server with 
NC_net installed , and i need to test using check_nt as well as check_snmp 
plugin the parameters . I am looking forward to some help with this. Can i get 
some link or some tutorials to start up with this please . 

  I have snmpd running on my linux , i need to know what changes i need to make 
for my snmpd.conf file 

  Thanks 

  -- 
  Anirudh Srinivasan




--


  --
  Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
  powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
  easily build your RIAs with Flex Builder, the Eclipse(TM)based development
  software that enables intelligent coding and step-through debugging.
  Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com


--


  ___
  Nagios-users mailing list
  Nagios-users@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/nagios-users
  ::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
  ::: Messages without supporting info will risk being sent to /dev/null--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null