date:20090408

Re: [Nagios-users] serious performance issue

2009-04-08 Thread fancyrabbit

i met almost the same issue.
after tweaking enable_embedded_perl=0, the load average was brought up but
latencies became lower.

On Wed, Apr 8, 2009 at 11:54 AM, shadih rahman shadhi...@gmail.com wrote:

 I am seeing a ton of orphaned error message for both services and hosts.  I
 am running nagios on a quad core 2.2 GHZ machine running 4 GHZ memory.  I
 will paste my configuration file below.  I have the machine sending ndo to a
 local database sitting on a 170 GB Hard drive.  nagios is obcessing on both
 host and services and sending data to a machine with identical
 configuration.  I am doing failover using NSCA.  Please advise on this.





 nagios.cfg



 log_file=/var/log/nagios/nagios.log
 cfg_file=/etc/nagios/commands.cfg
 cfg_file=/etc/nagios/contacts.cfg
 cfg_file=/etc/nagios/timeperiods.cfg
 cfg_file=/etc/nagios/templates.cfg
 cfg_dir=/etc/nagios/hosts
 cfg_dir=/etc/nagios/services
 object_cache_file=/var/log/nagios/objects.cache
 precached_object_file=/var/log/nagios/objects.precache
 resource_file=/etc/nagios/resource.cfg
 status_file=/var/log/nagios/status.dat
 status_update_interval=60
 nagios_user=nagios
 nagios_group=nagios
 check_external_commands=1
 command_check_interval=-1
 command_file=/var/log/nagios/rw/nagios.cmd
 external_command_buffer_slots=8192
 lock_file=/var/log/nagios/nagios.lock
 temp_file=/var/log/nagios/nagios.tmp
 temp_path=/tmp
 event_broker_options=8
 broker_module=/usr/lib64/nagios/ndomod.o config_file=/etc/nagios/ndomod.cfg
 log_rotation_method=m
 log_archive_path=/var/log/nagios/archives
 use_syslog=1
 log_notifications=1
 log_service_retries=1
 log_host_retries=1
 log_event_handlers=1
 log_initial_states=0
 log_external_commands=1
 log_passive_checks=1
 service_inter_check_delay_method=n
 max_service_check_spread=30
 service_interleave_factor=s
 host_inter_check_delay_method=s
 max_host_check_spread=30
 max_concurrent_checks=0
 check_result_reaper_frequency=2
 max_check_result_reaper_time=10
 check_result_path=/var/log/nagios/spool/checkresults
 max_check_result_file_age=3600
 cached_host_check_horizon=15
 cached_service_check_horizon=15
 enable_predictive_host_dependency_checks=1
 enable_predictive_service_dependency_checks=1
 soft_state_dependencies=1
 auto_reschedule_checks=1
 auto_rescheduling_interval=30
 auto_rescheduling_window=180
 sleep_time=0.25
 service_check_timeout=30
 host_check_timeout=20

 event_handler_timeout=30
 notification_timeout=60
 ocsp_timeout=5
 perfdata_timeout=5
 retain_state_information=1
 state_retention_file=var/log/nagios/retention.dat
 retention_update_interval=60
 use_retained_program_state=1
 use_retained_scheduling_info=1
 retained_host_attribute_mask=0
 retained_service_attribute_mask=0
 retained_process_host_attribute_mask=0
 retained_process_service_attribute_mask=0
 retained_contact_host_attribute_mask=0
 retained_contact_service_attribute_mask=0
 interval_length=60
 use_aggressive_host_checking=0
 execute_service_checks=1
 accept_passive_service_checks=1
 execute_host_checks=1
 accept_passive_host_checks=1
 enable_notifications=1
 enable_event_handlers=1
 process_performance_data=0
 obsess_over_services=1
 ocsp_command=send_service_check
 ochp_command=send_host_check
 obsess_over_hosts=1
 translate_passive_host_checks=0
 passive_host_checks_are_soft=0
 check_for_orphaned_services=1
 check_for_orphaned_hosts=1
 check_service_freshness=1
 service_freshness_check_interval=60
 check_host_freshness=0
 host_freshness_check_interval=60
 additional_freshness_latency=15
 enable_flap_detection=1
 low_service_flap_threshold=5.0
 high_service_flap_threshold=20.0
 low_host_flap_threshold=5.0
 high_host_flap_threshold=20.0
 date_format=us
 enable_embedded_perl=1
 use_embedded_perl_implicitly=1
 illegal_object_name_chars=`~!$%^*|'?,()=
 illegal_macro_output_chars=`~$|'
 use_regexp_matching=0
 use_true_regexp_matching=0
 admin_email=sr2...@columbia.edu
 daemon_dumps_core=0
 use_large_installation_tweaks=1
 enable_environment_macros=1
 debug_level=-1debug_verbosity=2
 debug_file=/var/log/nagios/nagios.debug
 max_debug_file_size=100




 my nagiostats output







 [sr2690nagiostats

 Nagios Stats 3.0.6
 Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org)
 Last Modified: 12-01-2008
 License: GPL

 CURRENT STATUS DATA
 --
 Status File:/var/log/nagios/status.dat
 Status File Age:0d 0h 0m 19s
 Status File Version:3.0.6

 Program Running Time:   0d 2h 5m 28s
 Nagios PID: 12139
 Used/High/Total Command Buffers:0 / 0 / 8192

 Total Services: 2783
 Services Checked:   2783
 Services Scheduled: 2782
 Services Actively Checked:  2783
 Services Passively Checked: 0
 Total Service State Change: 0.000 / 52.830 / 0.263 %
 Active Service Latency: 1.304 /

Re: [Nagios-users] Interesting problem while trying to monitor Oracle RAC services [Solved]

2009-04-08 Thread Kumar, Ashish

 check the environment of the users launching the script. Which user do you
 use to launch the script locally? And which one from remote?

 On nagios server I have tried executing it as root user as well as
 nagios user but the problem remains.



Hello all,

I made a mistake in the plug-in.  Both Perl script and KSH script were
residing in the same directory.  When executing locally it knew where
to look for external shell script. The code was as follows:

my $PIPED = qx# ksh check_oracle_services.sh $SERVICE #;

But when executed from nagios server, NRPE daemon on monitored host
wouldn't know where to look for the shell script hence the wrong
output.  Adding absolute path to the check_oracle_services.sh fixed
the problem

my $PIPED = qx# ksh /home/nagios/nrpe/libexec/check_oracle_services.sh
$SERVICE #;

The new code is as follows (may be someone would find it useful):

check_oracle_services.pl


#!/usr/bin/env perl

use strict;
use Getopt::Std;

my %return_value = (
OK = 0,
CRIT = 2,
UNKNOWN = 3
);

my $message = nagios;
my $exit_status;

my %opt=();
getopts(p:h, \%opt);

sub usage(){
print Usage: $0 -p service_name\n;
exit $return_value{'UNKNOWN'};
}

usage() if defined $opt{'h'};

my $SERVICE = $opt{'p'} if defined $opt{'p'} || usage();

my $PIPED = qx# ksh /home/nagios/nrpe/libexec/check_oracle_services.sh
$SERVICE #;

if ($PIPED =~ /OFFLINE/g) {
$exit_status = $return_value{'CRIT'};
$message = Critical: $SERVICE is not running.;
} else {
$exit_status = $return_value{'OK'};
$message = OK: $SERVICE is running.;
}

print $message\n;
exit $exit_status;




check_oracle_services.sh


#!/usr/bin/ksh

RSC_KEY=$1

/oracle/crs_home/bin/crs_stat -u | awk \
'BEGIN { FS==; state = 0; } \
$1~/NAME/  $2~/'$RSC_KEY'/ {appname = $2; state=1}; \
state == 0 {next;} \
$1~/TARGET/  state == 1 {apptarget = $2; state=2;} \
$1~/STATE/  state == 2 {appstate = $2; state=3;} \
state == 3 {printf %-45s %-18s\n, appname, appstate; state=0;}'



Sorry for the inconvenience caused.

Thanks

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] monitor primergy servers with esx

2009-04-08 Thread Natxo Asenjo

hi,

I have done some tests. One needs the RAID.mib file and this file is
only in the windows package for serverviewraid (tsk, tsk).

Anyway, once we have this file we can test stuff.

Important OIDS:
.1.3.6.1.4.1.231.2.49.1.5.2.1 -
.iso.org.dod.internet.private.enterprises.sni.sniProductMibs.fscRAIDMIB.svrObjects.svrPhysicalDeviceInfo.svrPhysicalDeviceTable.svrPhysicalDeviceEntry

.1.3.6.1.4.1.231.2.49.1.3 -
.iso.org.dod.internet.private.enterprises.sni.sniProductMibs.fscRAIDMIB.svrObjects.svrStatus

Situation 1: disks are online and working fine:

$ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.3

FSC-RAID-MIB::svrStatusLogicalDrives.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrStatusPhysicalDevices.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrStatusControllers.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrStatusOverall.0 = INTEGER: ok(1)

[j...@pc2668-210307 mibs]$ snmpwalk izvm01 -c public -v 1 -m
FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.5.2.1

FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.1.0.0 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.2.3.0 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceChannel.1.1.0.0 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceChannel.1.2.3.0 = INTEGER: 2
FSC-RAID-MIB::svrPhysicalDeviceTarget.1.1.0.0 = INTEGER: 0
FSC-RAID-MIB::svrPhysicalDeviceTarget.1.2.3.0 = INTEGER: 3
FSC-RAID-MIB::svrPhysicalDeviceLUN.1.1.0.0 = INTEGER: 0
FSC-RAID-MIB::svrPhysicalDeviceLUN.1.2.3.0 = INTEGER: 0
FSC-RAID-MIB::svrPhysicalDeviceModelName.1.1.0.0 = STRING: ST373455SS
FSC-RAID-MIB::svrPhysicalDeviceModelName.1.2.3.0 = STRING: ST373455SS
FSC-RAID-MIB::svrPhysicalDeviceVendorName.1.1.0.0 = STRING: SEAGATE
FSC-RAID-MIB::svrPhysicalDeviceVendorName.1.2.3.0 = STRING: SEAGATE
FSC-RAID-MIB::svrPhysicalDeviceCapacity.1.1.0.0 = INTEGER: 68
FSC-RAID-MIB::svrPhysicalDeviceCapacity.1.2.3.0 = INTEGER: 68
FSC-RAID-MIB::svrPhysicalDeviceMaxTransferRate.1.1.0.0 = INTEGER: 300
FSC-RAID-MIB::svrPhysicalDeviceMaxTransferRate.1.2.3.0 = INTEGER: 300
FSC-RAID-MIB::svrPhysicalDeviceType.1.1.0.0 = INTEGER: disk(2)
FSC-RAID-MIB::svrPhysicalDeviceType.1.2.3.0 = INTEGER: disk(2)
FSC-RAID-MIB::svrPhysicalDeviceConfiguredDisk.1.1.0.0 = INTEGER: true(2)
FSC-RAID-MIB::svrPhysicalDeviceConfiguredDisk.1.2.3.0 = INTEGER: true(2)
FSC-RAID-MIB::svrPhysicalDeviceInterface.1.1.0.0 = INTEGER: sas(6)
FSC-RAID-MIB::svrPhysicalDeviceInterface.1.2.3.0 = INTEGER: sas(6)
FSC-RAID-MIB::svrPhysicalDeviceErrors.1.1.0.0 = Counter32: 0
FSC-RAID-MIB::svrPhysicalDeviceErrors.1.2.3.0 = Counter32: 0
FSC-RAID-MIB::svrPhysicalDeviceNrBadBlocks.1.1.0.0 = Counter32: 0
FSC-RAID-MIB::svrPhysicalDeviceNrBadBlocks.1.2.3.0 = Counter32: 0
FSC-RAID-MIB::svrPhysicalDeviceSmartStatus.1.1.0.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrPhysicalDeviceSmartStatus.1.2.3.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrPhysicalDeviceStatus.1.1.0.0 = INTEGER: online(3)
FSC-RAID-MIB::svrPhysicalDeviceStatus.1.2.3.0 = INTEGER: online(3)
FSC-RAID-MIB::svrPhysicalDeviceFirmwareRevision.1.1.0.0 = STRING: 1651
FSC-RAID-MIB::svrPhysicalDeviceFirmwareRevision.1.2.3.0 = STRING: 1651
FSC-RAID-MIB::svrPhysicalDeviceSerialNumber.1.1.0.0 = STRING: 3LQ0DA03
FSC-RAID-MIB::svrPhysicalDeviceSerialNumber.1.2.3.0 = STRING: 3LQ0DAD7
FSC-RAID-MIB::svrPhysicalDeviceForeignConfig.1.1.0.0 = INTEGER: false(1)
FSC-RAID-MIB::svrPhysicalDeviceForeignConfig.1.2.3.0 = INTEGER: false(1)
FSC-RAID-MIB::svrPhysicalDeviceIdx.1.1.0.0 = INTEGER: 11
FSC-RAID-MIB::svrPhysicalDeviceIdx.1.2.3.0 = INTEGER: 12
FSC-RAID-MIB::svrPhysicalDeviceEntry.20.1.1.0.0 = INTEGER: 4
FSC-RAID-MIB::svrPhysicalDeviceEntry.20.1.2.3.0 = INTEGER: 4
FSC-RAID-MIB::svrPhysicalDeviceEntry.21.1.1.0.0 = INTEGER: 70007
FSC-RAID-MIB::svrPhysicalDeviceEntry.21.1.2.3.0 = INTEGER: 70007

Disks are online

Situation 2: I remove one disk from its bay

$ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.3
FSC-RAID-MIB::svrStatusLogicalDrives.0 = INTEGER: prefailure(2)
FSC-RAID-MIB::svrStatusPhysicalDevices.0 = INTEGER: failure(3)
FSC-RAID-MIB::svrStatusControllers.0 = INTEGER: prefailure(2)
FSC-RAID-MIB::svrStatusOverall.0 = INTEGER: prefailure(2)

Everything is 'prefailure', except for physicaldevices, it's a
'failure' (disk is physically removed from the bay). I forgot to check
the other OID for this one, I'll post the results later.

Situation 3: 'failed' disk is back in bay, rebuilding starts:

$ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.3

FSC-RAID-MIB::svrStatusLogicalDrives.0 = INTEGER: prefailure(2)
FSC-RAID-MIB::svrStatusPhysicalDevices.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrStatusControllers.0 = INTEGER: prefailure(2)
FSC-RAID-MIB::svrStatusOverall.0 = INTEGER: prefailure(2)

everthing is 'prefailure' except for svrStatusPhysicalDevices.0, it is
'ok', disks is in bay.

$ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.5.2.1
FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.0.0.0 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.3.0.3 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceChannel.1.0.0.0 =

[Nagios-users] Nagios and Cacti

2009-04-08 Thread Christopher McAtackney

Hi all,

I've been looking into making use of Cacti to act as an SNMP
management tool which runs alongside my Nagios instance.

Ideally, what I would like to do is have Cacti monitor various
SNMP-exposed metrics on my hosts, and then have a service check in
Nagios which parses Cacti's results (which I believe are RRD files)
and send alerts etc.

Nagios itself will still be used for running directly checks for
services running, errors in log files etc.

Does this approach make sense?

One issue that I can think of is the difficulty in keeping the config
files of Nagios and Cacti synchronised.  I was planning on using Lilac
Platform to act as my Nagios config file management tool, but how that
is kept in synch with Cacti is a problem. Has anyone ever set up an
arrangement like this before?

Cheers,
Chris

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Name or service not known

2009-04-08 Thread Assaf Flatto

can you provide the relevant conf so we may understand more about your problem ?



On Wednesday 08 April 2009 14:41:27 Thierry Lavallée wrote:
 Hi,
 I am getting the following about 50 times per day:

 HTTP  CRITICAL04-08-2009 09:37:29 0d 0h 1m 50s1/4 Name or 
 service not
 known SSH UNKNOWN 04-08-2009 09:34:39 0d 0h 4m 40s1/4 
 Usage:check_ssh
 [-46] [-t timeout] [-r remote version] [-p port] host

 I am somewhat at lost here...
 Can anyone help with this?



-- 
Assaf Flatto
SSP Ops Team
Linux System Administrator
169 Euston Road, London, NW1 2AE





IMPORTANT . this email and the information in it may be confidential, legally
privileged and/or protected by law. It is intended solely for the use of the
person to whom it is addressed. If you are not the intended recipient, please
notify the sender immediately and do not disclose the contents to any other
person, use it for any purpose, or store or copy the information in any medium.
Please also delete all copies of this email and any attachments from your
system.

We cannot guarantee the security or confidentiality of email communications. We
do not accept any liability for losses or damages that you may suffer as a
result of your receipt of this email including but not limited to computer
service or system failure, access delays or interruption, data non-delivery or
mis-delivery, computer viruses or other harmful components.

Copyright in this email and any attachments belong to Select Service Partner UK
Limited. Should you communicate with anyone at Select Service Partner UK 
Limited by
email, you consent to us monitoring and reading any such correspondence.

Nothing in this email shall be taken or read as suggesting, proposing or
relating to any agreement concerted practice or other practice that could
infringe UK or EC competition legislation.

Select Service Partner UK Limited is a company registered in England and Wales
(company number 05687183) whose registered office is at 1 The Heights, 
Brooklands, Weybridge. Surrey. KT13 0NY
 
 

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Name or service not known

2009-04-08 Thread Thierry Lavallée

or maybe you mean:


check_http

check-host-alive$USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c
5000.0,100% -p 5
check_dhcp  $USER1$/check_dhcp $ARG1$
check_ftp   $USER1$/check_ftp -H $HOSTADDRESS$ $ARG1$
check_hpjd  $USER1$/check_hpjd -H $HOSTADDRESS$ $ARG1$
check_http  $USER1$/check_http -I $HOSTADDRESS$ $ARG1$
check_imap  $USER1$/check_imap -H $HOSTADDRESS$ $ARG1$
check_local_disk$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$
check_local_load$USER1$/check_load -w $ARG1$ -c $ARG2$
check_local_mrtgtraf$USER1$/check_mrtgtraf -F $ARG1$ -a $ARG2$ -w
$ARG3$ -c $ARG4$ -e $ARG5$
check_local_procs   $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s $ARG3$
check_local_swap$USER1$/check_swap -w $ARG1$ -c $ARG2$
check_local_users   $USER1$/check_users -w $ARG1$ -c $ARG2$
check_nt$USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$
check_ping  $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -p 5
check_pop   $USER1$/check_pop -H $HOSTADDRESS$ $ARG1$
check_smtp  $USER1$/check_smtp -H $HOSTADDRESS$ $ARG1$
check_snmp  $USER1$/check_snmp -H $HOSTADDRESS$ $ARG1$
check_ssh   $USER1$/check_ssh $ARG1$ $HOSTADDRESS$
check_tcp   $USER1$/check_tcp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$
check_udp   $USER1$/check_udp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$
notify-host-by-email/usr/bin/printf %b * Nagios
*\n\nNotification Type: $NOTIFICATIONTYPE$\nHost:
$HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo:
$HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n | /bin/mail -s **
$NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is $HOSTSTATE$ **
$CONTACTEMAIL$
notify-service-by-email /usr/bin/printf %b * Nagios
*\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService:
$SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState:
$SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional
Info:\n\n$SERVICEOUTPUT$ | /bin/mail -s ** $NOTIFICATIONTYPE$
Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **
$CONTACTEMAIL$
process-host-perfdata   /usr/bin/printf %b
$LASTHOSTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$HOSTSTATETYPE$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$\n
 /usr/local/nagios/var/host-perfdata.out
process-service-perfdata/usr/bin/printf %b
$LASTSERVICECHECK$\t$HOSTNAME$\t$SERVICEDESC$\t$SERVICESTATE$\t$SERVICEATTEMPT$\t$SERVICESTATETYPE$\t$SERVICEEXECUTIONTIME$\t$SERVICELATENCY$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\n
 /usr/local/nagios/var/service-perfdata.out



check_SSH


check-host-alive$USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c
5000.0,100% -p 5
check_dhcp  $USER1$/check_dhcp $ARG1$
check_ftp   $USER1$/check_ftp -H $HOSTADDRESS$ $ARG1$
check_hpjd  $USER1$/check_hpjd -H $HOSTADDRESS$ $ARG1$
check_http  $USER1$/check_http -I $HOSTADDRESS$ $ARG1$
check_imap  $USER1$/check_imap -H $HOSTADDRESS$ $ARG1$
check_local_disk$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$
check_local_load$USER1$/check_load -w $ARG1$ -c $ARG2$
check_local_mrtgtraf$USER1$/check_mrtgtraf -F $ARG1$ -a $ARG2$ -w
$ARG3$ -c $ARG4$ -e $ARG5$
check_local_procs   $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s $ARG3$
check_local_swap$USER1$/check_swap -w $ARG1$ -c $ARG2$
check_local_users   $USER1$/check_users -w $ARG1$ -c $ARG2$
check_nt$USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$
check_ping  $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -p 5
check_pop   $USER1$/check_pop -H $HOSTADDRESS$ $ARG1$
check_smtp  $USER1$/check_smtp -H $HOSTADDRESS$ $ARG1$
check_snmp  $USER1$/check_snmp -H $HOSTADDRESS$ $ARG1$
check_ssh   $USER1$/check_ssh $ARG1$ $HOSTADDRESS$
check_tcp   $USER1$/check_tcp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$
check_udp   $USER1$/check_udp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$
notify-host-by-email/usr/bin/printf %b * Nagios
*\n\nNotification Type: $NOTIFICATIONTYPE$\nHost:
$HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo:
$HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n | /bin/mail -s **
$NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is $HOSTSTATE$ **
$CONTACTEMAIL$
notify-service-by-email /usr/bin/printf %b * Nagios
*\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService:
$SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState:
$SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional
Info:\n\n$SERVICEOUTPUT$ | /bin/mail -s ** $NOTIFICATIONTYPE$
Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **
$CONTACTEMAIL$
process-host-perfdata   /usr/bin/printf %b
$LASTHOSTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$HOSTSTATETYPE$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$\n
 /usr/local/nagios/var/host-perfdata.out
process-service-perfdata

Re: [Nagios-users] Name or service not known

2009-04-08 Thread Assaf Flatto

Thierry

The information you need to look into is the configuration files of the nagios 
, located in the etc 
directory of the nagios installation directory .

which version of nagios are you running ?
did you install from source or packages ?
what distro are you using ?

how did you preform the initial installation ?

Assaf


On Wednesday 08 April 2009 15:29:02 Thierry Lavallée wrote:
 thanks a lot for your reply Assaf,

 I am not sure which conf you mean but here are a few things in the
 meantime. Maybe you could point this Nagios newbie what you need? :/

 thanks!
 --
 Thierry

  --
  Assaf Flatto
  SSP Ops Team
  Linux System Administrator
  169 Euston Road, London, NW1 2AE





IMPORTANT . this email and the information in it may be confidential, legally
privileged and/or protected by law. It is intended solely for the use of the
person to whom it is addressed. If you are not the intended recipient, please
notify the sender immediately and do not disclose the contents to any other
person, use it for any purpose, or store or copy the information in any medium.
Please also delete all copies of this email and any attachments from your
system.

We cannot guarantee the security or confidentiality of email communications. We
do not accept any liability for losses or damages that you may suffer as a
result of your receipt of this email including but not limited to computer
service or system failure, access delays or interruption, data non-delivery or
mis-delivery, computer viruses or other harmful components.

Copyright in this email and any attachments belong to Select Service Partner UK
Limited. Should you communicate with anyone at Select Service Partner UK 
Limited by
email, you consent to us monitoring and reading any such correspondence.

Nothing in this email shall be taken or read as suggesting, proposing or
relating to any agreement concerted practice or other practice that could
infringe UK or EC competition legislation.

Select Service Partner UK Limited is a company registered in England and Wales
(company number 05687183) whose registered office is at 1 The Heights, 
Brooklands, Weybridge. Surrey. KT13 0NY
 
 

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Name or service not known

2009-04-08 Thread Assaf Flatto

Thierry

please send all your replies to the list and not to individual members - so 
other may learn from the 
information gathered by other members .

second :
this is a command declaration in the commands file , have you associated a 
service to the host ?

and even before that - did you install the nagios-plugins ? 
the nagios-plugins are the actual check scripts executed to preform the checks 
, and without them 
you will not be able to use any of nagios capabilities.

Assaf

 check_ssh   $USER1$/check_ssh $ARG1$ $HOSTADDRESS$



-- 
Assaf Flatto
SSP Ops Team
Linux System Administrator
169 Euston Road, London, NW1 2AE





IMPORTANT . this email and the information in it may be confidential, legally
privileged and/or protected by law. It is intended solely for the use of the
person to whom it is addressed. If you are not the intended recipient, please
notify the sender immediately and do not disclose the contents to any other
person, use it for any purpose, or store or copy the information in any medium.
Please also delete all copies of this email and any attachments from your
system.

We cannot guarantee the security or confidentiality of email communications. We
do not accept any liability for losses or damages that you may suffer as a
result of your receipt of this email including but not limited to computer
service or system failure, access delays or interruption, data non-delivery or
mis-delivery, computer viruses or other harmful components.

Copyright in this email and any attachments belong to Select Service Partner UK
Limited. Should you communicate with anyone at Select Service Partner UK 
Limited by
email, you consent to us monitoring and reading any such correspondence.

Nothing in this email shall be taken or read as suggesting, proposing or
relating to any agreement concerted practice or other practice that could
infringe UK or EC competition legislation.

Select Service Partner UK Limited is a company registered in England and Wales
(company number 05687183) whose registered office is at 1 The Heights, 
Brooklands, Weybridge. Surrey. KT13 0NY
 
 

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Adaptive Monitoring: Broken?

2009-04-08 Thread Marc Powell


On Apr 7, 2009, at 1:26 PM, Patrick Morris wrote:

 Here are the important stats:

 Nagios Version: Version 3.1.0
 Proficiency Level: Pretty damned high

 While the first command works fine, and sets the service to an OK  
 state,
 the next two (which I've tried in various combinations) show up in the
 Nagios logs as having been sent, but do nothing. The check that  
 appears
 in the config files keeps running instead of my check_ok check.

 Here's how it shows up in the logs:

 [1239128528] EXTERNAL COMMAND: CHANGE_SVC_EVENT_HANDLER;dummy- 
 host;DNS;check_ok
 [1239128528] EXTERNAL COMMAND: CHANGE_SVC_CHECK_COMMAND;dummy- 
 host;DNS;check_ok

 I've noticed the message is different if I use an invalid command, so
 I'm relatively sure I'm using the right ones; they just don't do
 anything.

 Event handlers are enabled for these services, but even if they  
 weren't
 the check command should change, right?

 Am I doing something wrong here, or have I run into a bug?

I'm not using 3.x yet but just to provide some feedback, what you're  
doing looks reasonable from my reading of the documentation. I do see  
this in 3.1.0's commands.c though --

 /* SECURITY PATCH - disable these for the time being */
 switch(cmd){
 case CMD_CHANGE_GLOBAL_HOST_EVENT_HANDLER:
 case CMD_CHANGE_GLOBAL_SVC_EVENT_HANDLER:
 case CMD_CHANGE_HOST_EVENT_HANDLER:
 case CMD_CHANGE_SVC_EVENT_HANDLER:
 case CMD_CHANGE_HOST_CHECK_COMMAND:
 case CMD_CHANGE_SVC_CHECK_COMMAND:
 return ERROR;
 }

That's in the right section and my reading of the code is that it does  
exactly that; prevent changing of those values... Maybe it's something  
being worked on in the development branch?

--
Marc


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Andrew Davis

And just an FYI from my own experience... putting Nagios  Cacti on the 
same server has been somewhat problematic for us. We have over 400 
network devices between switches, routers, WAPs, etc. We also have about 
300 monitored servers. Initially I had Nagios and Cacti both on one 
server with Cacti running via cron every 5 minutes. About every 5 
minutes, my shells would become unresponsive for roughly 30 to 90 
seconds. Turning off either Nagios or Cacti resolved the issue. Running 
both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 
2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does 
both trending and alerts of all servers. Cacti does trending only of all 
network devices/ports. Once I moved Cacti to its own server, all was 
fine as far as load/latency went.


 A. Davis
 Email: ncc...@gmail.com

 There is no limit to what a man can accomplish
  if he doesn't care who gets the credit. - Ronald Reagan



Marco Tirado wrote:

Hello:

There are a couple of examples in the nagios exchange page of 
different approachs for integrating nagios and cacti. You should check 
that out.


I believe the synchronization is going to cost you time and money, a 
better approach is to use nagios + pnp4naigos (this generates nice 
graphs) + check_snmp_int.pl (this for bandwidth tests). That way you 
have only one place to place your configuration.  There are tons of 
other snmp plugins you can use for other tests (CPU, Memory, etc),


//Marco

On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney 
crist...@gmail.com mailto:crist...@gmail.com wrote:


Hi all,

I've been looking into making use of Cacti to act as an SNMP
management tool which runs alongside my Nagios instance.

Ideally, what I would like to do is have Cacti monitor various
SNMP-exposed metrics on my hosts, and then have a service check in
Nagios which parses Cacti's results (which I believe are RRD files)
and send alerts etc.

Nagios itself will still be used for running directly checks for
services running, errors in log files etc.

Does this approach make sense?

One issue that I can think of is the difficulty in keeping the config
files of Nagios and Cacti synchronised.  I was planning on using Lilac
Platform to act as my Nagios config file management tool, but how that
is kept in synch with Cacti is a problem. Has anyone ever set up an
arrangement like this before?

Cheers,
Chris


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
mailto:Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null




--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com


___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Christopher McAtackney

2009/4/8 Andrew Davis ncc...@gmail.com:
 And just an FYI from my own experience... putting Nagios  Cacti on the same
 server has been somewhat problematic for us. We have over 400 network
 devices between switches, routers, WAPs, etc. We also have about 300
 monitored servers. Initially I had Nagios and Cacti both on one server with
 Cacti running via cron every 5 minutes. About every 5 minutes, my shells
 would become unresponsive for roughly 30 to 90 seconds. Turning off either
 Nagios or Cacti resolved the issue. Running both seems to have hammered the
 server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate
 Cacti and Nagios, however. Nagios does both trending and alerts of all
 servers. Cacti does trending only of all network devices/ports. Once I moved
 Cacti to its own server, all was fine as far as load/latency went.

That's useful to know Andrew, thanks.

Regarding the trending of network devices - is there any reason why
this can't be done by Nagios? I intend to install PNP4Nagios to take
care of graphing anyway, but I think it would be nice to have all my
monitored resources under the one system (for notifications and ease
of administration).

Is there some major advantage that Cacti provides when it comes to
SNMP monitoring of network devices that cannot be achieved with Nagios
and the various SNMP plug-ins available for it (e.g. like these ones
http://nagios.manubulon.com) ?

Cheers,
Chris

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Name or service not known

2009-04-08 Thread Thierry Lavallée

thanks Assaf,
But I cannot get passed step #1 Relax - it's going to take some time. ;)
hehe.

Really, I don't want to redo the whole install as I am not THAT server
techy and I try to rely on a service called Supreme Support that are
supposed to make my life better. I need to point to Supreme Support
where the problem is.

Do you think you can point it out quickly from the config files I sent?

Because most seems installed correctly, it looks there are loose handles.

thanks again!
-- 
Thierry

2009/4/8 Assaf Flatto assaf.fla...@ssp-intl.com:

 In that case I suggest you start here :
 http://nagios.sourceforge.net/docs/3_0/beginners.html


 read the documentation  and most of your questions will be answered.


 Assaf


 On Wednesday 08 April 2009 16:00:35 you wrote:
 thanks Assaf.
 I did not do the installation, but the support who did the
 installation (Supreme Support) do not seem knowledgeable enough.

 I am attaching my config files (please tell me if not secure to send like
 this) I am runnig 3.0.6
 No idea about Distro

 hoping you can still help me.
 thanks!


 --
 Assaf Flatto
 SSP Ops Team
 Linux System Administrator
 169 Euston Road, London, NW1 2AE





 IMPORTANT . this email and the information in it may be confidential, legally
 privileged and/or protected by law. It is intended solely for the use of the
 person to whom it is addressed. If you are not the intended recipient, please
 notify the sender immediately and do not disclose the contents to any other
 person, use it for any purpose, or store or copy the information in any 
 medium.
 Please also delete all copies of this email and any attachments from your
 system.

 We cannot guarantee the security or confidentiality of email communications. 
 We
 do not accept any liability for losses or damages that you may suffer as a
 result of your receipt of this email including but not limited to computer
 service or system failure, access delays or interruption, data non-delivery or
 mis-delivery, computer viruses or other harmful components.

 Copyright in this email and any attachments belong to Select Service Partner 
 UK
 Limited. Should you communicate with anyone at Select Service Partner UK 
 Limited by
 email, you consent to us monitoring and reading any such correspondence.

 Nothing in this email shall be taken or read as suggesting, proposing or
 relating to any agreement concerted practice or other practice that could
 infringe UK or EC competition legislation.

 Select Service Partner UK Limited is a company registered in England and Wales
 (company number 05687183) whose registered office is at 1 The Heights, 
 Brooklands, Weybridge. Surrey. KT13 0NY



--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] problems with log rotation

2009-04-08 Thread Eric Doutreleau

hi

i used nagios3.1
i have configured a daily log rotation

but after several days i suspect there s a problem with the log rotation

i found several instance of nagios which the majority are defunct ones
and they all begun at 00:00 but on different days.

and the scheduler is going mad.
a lot of check are not launched and i got a lot of these messages

[1239201355] Warning: The check of service 'memoire disponible' on host
'www-tp' looks like it was orphaned (results never came back).  I'm
scheduling an immediate check of the service...
[1239201355] Warning: The check of service 'nombre total de process' on
host 'www-tp' looks like it was orphaned (results never came back).  I'm
scheduling an immediate check of the service...
[1239201355] Warning: The check of service 'HTTP' on host 'yum' looks
like it was orphaned (results never came back).  I'm scheduling an
immediate check of the service...
[1239201355] Warning: The check of service 'charge' on host 'yum' looks
like it was orphaned (results never came back).  I'm scheduling an
immediate check of the service...
[1239201355] Warning: The check of service 'nombre d utilisateur' on
host 'yum' looks like it was orphaned (results never came back).  I'm
scheduling an immediate check of the service...
[1239201414] Warning: The check of service 'ciscoswitch' on host
'Indicateurs' looks like it was orphaned (results never came back).  I'm
scheduling an immediate check of the service...
[1239201414] Warning: The check of service 'smtpsout' on host
'Indicateurs_detail' looks like it was orphaned (results never came
back).  I'm scheduling an immediate check of the service...

Does someone knows how to solve that problems?

thanks in advance for any help


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Name or service not known

2009-04-08 Thread Assaf Flatto


From the first glance it looks like they just installed it and never bothered 
to configure anything 
in it .

not nagios is a great tool , but it requires quite a bit of initial 
configuration and setup , which 
looks like the didn't do .

Assaf



On Wednesday 08 April 2009 16:20:06 Thierry Lavallée wrote:
 thanks Assaf,
 But I cannot get passed step #1 Relax - it's going to take some time. ;)
 hehe.

 Really, I don't want to redo the whole install as I am not THAT server
 techy and I try to rely on a service called Supreme Support that are
 supposed to make my life better. I need to point to Supreme Support
 where the problem is.

 Do you think you can point it out quickly from the config files I sent?

 Because most seems installed correctly, it looks there are loose handles.

 thanks again!



-- 
Assaf Flatto
SSP Ops Team
Linux System Administrator
169 Euston Road, London, NW1 2AE






IMPORTANT . this email and the information in it may be confidential, legally
privileged and/or protected by law. It is intended solely for the use of the
person to whom it is addressed. If you are not the intended recipient, please
notify the sender immediately and do not disclose the contents to any other
person, use it for any purpose, or store or copy the information in any medium.
Please also delete all copies of this email and any attachments from your
system.

We cannot guarantee the security or confidentiality of email communications. We
do not accept any liability for losses or damages that you may suffer as a
result of your receipt of this email including but not limited to computer
service or system failure, access delays or interruption, data non-delivery or
mis-delivery, computer viruses or other harmful components.

Copyright in this email and any attachments belong to Select Service Partner UK
Limited. Should you communicate with anyone at Select Service Partner UK 
Limited by
email, you consent to us monitoring and reading any such correspondence.

Nothing in this email shall be taken or read as suggesting, proposing or
relating to any agreement concerted practice or other practice that could
infringe UK or EC competition legislation.

Select Service Partner UK Limited is a company registered in England and Wales
(company number 05687183) whose registered office is at 1 The Heights, 
Brooklands, Weybridge. Surrey. KT13 0NY
 
 

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Daniel Emmanuel Feinsmith


It depends on the intensity of your snmp usage. Cacti has a native  
daemon to do large scale snmp getting, and it does a great job of it.  
So if u have hundreds of devices, each with a lot of interfaces, u  
will probably like cacti. The user interface is also well done for  
graphing snmp data and thresholding on it using the threshold plugin.

=
Daniel Feinsmith
=
{sent from iPhone}

On Apr 8, 2009, at 8:15 AM, Christopher McAtackney  
crist...@gmail.com wrote:

 2009/4/8 Andrew Davis ncc...@gmail.com:
 And just an FYI from my own experience... putting Nagios  Cacti on  
 the same
 server has been somewhat problematic for us. We have over 400 network
 devices between switches, routers, WAPs, etc. We also have about 300
 monitored servers. Initially I had Nagios and Cacti both on one  
 server with
 Cacti running via cron every 5 minutes. About every 5 minutes, my  
 shells
 would become unresponsive for roughly 30 to 90 seconds. Turning off  
 either
 Nagios or Cacti resolved the issue. Running both seems to have  
 hammered the
 server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't  
 integrate
 Cacti and Nagios, however. Nagios does both trending and alerts of  
 all
 servers. Cacti does trending only of all network devices/ports.  
 Once I moved
 Cacti to its own server, all was fine as far as load/latency went.

 That's useful to know Andrew, thanks.

 Regarding the trending of network devices - is there any reason why
 this can't be done by Nagios? I intend to install PNP4Nagios to take
 care of graphing anyway, but I think it would be nice to have all my
 monitored resources under the one system (for notifications and ease
 of administration).

 Is there some major advantage that Cacti provides when it comes to
 SNMP monitoring of network devices that cannot be achieved with Nagios
 and the various SNMP plug-ins available for it (e.g. like these ones
 http://nagios.manubulon.com) ?

 Cheers,
 Chris

 --- 
 --- 
 --- 
 -
 This SF.net email is sponsored by:
 High Quality Requirements in a Collaborative Environment.
 Download a free trial of Rational Requirements Composer Now!
 http://p.sf.net/sfu/www-ibm-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when  
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Daniel Emmanuel Feinsmith

If you move your mysql instance to another server, you can get much  
better performance on a nagios/cacti server. Check top while cacti is  
running a large install and you will see that mysql is hoarding CPU  
and memory resources not leaving much for nagios.

=
Daniel Feinsmith
=
{sent from iPhone}

On Apr 8, 2009, at 8:03 AM, Andrew Davis ncc...@gmail.com wrote:

 And just an FYI from my own experience... putting Nagios  Cacti on  
 the same server has been somewhat problematic for us. We have over  
 400 network devices between switches, routers, WAPs, etc. We also  
 have about 300 monitored servers. Initially I had Nagios and Cacti  
 both on one server with Cacti running via cron every 5 minutes.  
 About every 5 minutes, my shells would become unresponsive for  
 roughly 30 to 90 seconds. Turning off either Nagios or Cacti  
 resolved the issue. Running both seems to have hammered the server a  
 bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate  
 Cacti and Nagios, however. Nagios does both trending and alerts of  
 all servers. Cacti does trending only of all network devices/ports.  
 Once I moved Cacti to its own server, all was fine as far as load/ 
 latency went.
   A. Davis
   Email: ncc...@gmail.com

   There is no limit to what a man can accomplish
if he doesn't care who gets the credit. - Ronald Reagan


 Marco Tirado wrote:

 Hello:

 There are a couple of examples in the nagios exchange page of  
 different approachs for integrating nagios and cacti. You should  
 check that out.

 I believe the synchronization is going to cost you time and money,  
 a better approach is to use nagios + pnp4naigos (this generates  
 nice graphs) + check_snmp_int.pl (this for bandwidth tests). That  
 way you have only one place to place your configuration.  There are  
 tons of other snmp plugins you can use for other tests (CPU,  
 Memory, etc),

 //Marco

 On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney crist...@gmail.com 
  wrote:
 Hi all,

 I've been looking into making use of Cacti to act as an SNMP
 management tool which runs alongside my Nagios instance.

 Ideally, what I would like to do is have Cacti monitor various
 SNMP-exposed metrics on my hosts, and then have a service check in
 Nagios which parses Cacti's results (which I believe are RRD files)
 and send alerts etc.

 Nagios itself will still be used for running directly checks for
 services running, errors in log files etc.

 Does this approach make sense?

 One issue that I can think of is the difficulty in keeping the config
 files of Nagios and Cacti synchronised.  I was planning on using  
 Lilac
 Platform to act as my Nagios config file management tool, but how  
 that
 is kept in synch with Cacti is a problem. Has anyone ever set up an
 arrangement like this before?

 Cheers,
 Chris

 --- 
 --- 
 --- 
 -
 This SF.net email is sponsored by:
 High Quality Requirements in a Collaborative Environment.
 Download a free trial of Rational Requirements Composer Now!
 http://p.sf.net/sfu/www-ibm-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when  
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/ 
 null


 --- 
 --- 
 --- 
 -
 This SF.net email is sponsored by:
 High Quality Requirements in a Collaborative Environment.
 Download a free trial of Rational Requirements Composer Now!
 http://p.sf.net/sfu/www-ibm-com

 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when  
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/ 
 null

 --- 
 --- 
 --- 
 -
 This SF.net email is sponsored by:
 High Quality Requirements in a Collaborative Environment.
 Download a free trial of Rational Requirements Composer Now!
 http://p.sf.net/sfu/www-ibm-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when  
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Max

On Wed, Apr 8, 2009 at 11:52 AM, Daniel Emmanuel Feinsmith
dan...@danielemmanuelfeinsmith.com wrote:

 It depends on the intensity of your snmp usage. Cacti has a native
 daemon to do large scale snmp getting, and it does a great job of it.
 So if u have hundreds of devices, each with a lot of interfaces, u
 will probably like cacti. The user interface is also well done for
 graphing snmp data and thresholding on it using the threshold plugin.

With parallel checks in Nagios 3 and some configuration tuning and
well-written SNMP checks, I'd argue that Nagios is as good if not a
better poller than cactid :).   our instance is not huge, but
currently we do 7000+ SNMP-based checks in 3 minutes on a dual
quad-core Linux-based server.

Before PNP I used to use Cacti and Nagios.  I like Cacti, but with PNP
around I would never go back to that combination again .. Nagios + PNP
really does simplify life for Nagios administrators and provides a lot
of flexibility as far as how you scale your graphing as your node base
grows.

- Max

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Andrew Davis

I agree. Initially I had Nagios doing all the trending. But with 400+ 
network devices and many of them with multiple 48 port blades, I found 
Cacti was easier to configure... it scaled a lot better. For a smaller 
network, you could easily do just Nagios. I've had no issues at all with 
Nagios + PNP for alerts and trending. In fact, Nagios still watches my 
core network devices (but not all the ports of them... ie: Nagios 
watches that switch1 is up and available and trends its CPU and memory 
usage... however I use Cacti for trending the 6 blades each with 48 
ports in switch1). This way, if switch1 fails or utilization is too 
high, Nagios tells me, but if a particular user is hogging all our 
bandwidth or having lots of packet loss, I find that via Cacti.


 A. Davis
 Email: ncc...@gmail.com

 There is no limit to what a man can accomplish
  if he doesn't care who gets the credit. - Ronald Reagan



Daniel Emmanuel Feinsmith wrote:
It depends on the intensity of your snmp usage. Cacti has a native  
daemon to do large scale snmp getting, and it does a great job of it.  
So if u have hundreds of devices, each with a lot of interfaces, u  
will probably like cacti. The user interface is also well done for  
graphing snmp data and thresholding on it using the threshold plugin.


=
Daniel Feinsmith
=
{sent from iPhone}

On Apr 8, 2009, at 8:15 AM, Christopher McAtackney  
crist...@gmail.com wrote:


  

2009/4/8 Andrew Davis ncc...@gmail.com:

And just an FYI from my own experience... putting Nagios  Cacti on  
the same

server has been somewhat problematic for us. We have over 400 network
devices between switches, routers, WAPs, etc. We also have about 300
monitored servers. Initially I had Nagios and Cacti both on one  
server with
Cacti running via cron every 5 minutes. About every 5 minutes, my  
shells
would become unresponsive for roughly 30 to 90 seconds. Turning off  
either
Nagios or Cacti resolved the issue. Running both seems to have  
hammered the
server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't  
integrate
Cacti and Nagios, however. Nagios does both trending and alerts of  
all
servers. Cacti does trending only of all network devices/ports.  
Once I moved

Cacti to its own server, all was fine as far as load/latency went.
  

That's useful to know Andrew, thanks.

Regarding the trending of network devices - is there any reason why
this can't be done by Nagios? I intend to install PNP4Nagios to take
care of graphing anyway, but I think it would be nice to have all my
monitored resources under the one system (for notifications and ease
of administration).

Is there some major advantage that Cacti provides when it comes to
SNMP monitoring of network devices that cannot be achieved with Nagios
and the various SNMP plug-ins available for it (e.g. like these ones
http://nagios.manubulon.com) ?

Cheers,
Chris

--- 
--- 
--- 
-

This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when  
reporting any issue.

::: Messages without supporting info will risk being sent to /dev/null




--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
  
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios Warning Bug/Misconfiguration

2009-04-08 Thread Thomas Donnelly

Got the issue. Turns out someone installed the Cisco MIBS along with the 
net snmp mibs. Thank you very much for your help.

For anyone searching and finds this check

/usr/local/share/snmp/mibs

and see if you have some-snmp-mib.txt and some-snmp-mib.my


Thanks Again!
-=Tom Donnelly


Patrick Morris wrote:
 On Tue, 07 Apr 2009, Thomas Donnelly wrote:

   
 Thanks for the quick reply!

 Ran from the command line got:

 # ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95: -c 75:
 SNMP WARNING - 100 | SNMPv2-SMI::mib-2.33.1.2.4.0=100

 # ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95 -c 75:
 SNMP WARNING - *100* | SNMPv2-SMI::mib-2.33.1.2.4.0=100

 So by intentionally triggering it again (remove :), it shows the *'s
 

 How about if you add a -v to get verbose output?

 Also, you may want to check the return code from the manual run on an OK
 resultx (for example, by running echo $? aafter your check_snmp
 command to make sure it matches what you see in the output).

 What happens in my case occasionally is that I install a screwed-up MIB
 for an unrelated service. It won't show any obvious errors, but it will
 cause check_snmp to return a warning result code regardless of whether
 the SNMP result falls within my thresholds.

 In effect, it's warning me that my MIBs are hosed, based on
 the fact that it got a non-OK result from snmpget (which is what
 check_snmp calls to do the actual SNMP getting).



   
 Not really sure what they mean by:

 1. Prevent check_snmp from loading the MIBs (default behaviour) by using 
 numeric oids AND using the -m : option
 

 If you a numeric OID rather than mib-2.33.1.2.4.0 and pass the -m :
 then check_snmp (and, by extension, snmpget) don't need to load the MIBs
 at all, so you don't get an error if you've got a bad MIB.

   


 Patrick Morris wrote:
 
 What happens when you run it manually? 

 This, maybe?

 http://www.nagios.org/faqs/viewfaq.php?faq_id=208

 On Tue, 07 Apr 2009, Thomas Donnelly wrote:

   
   
 Hi all,

 I am having an issue with all of the devices I added showing warning all 
 the time. It is a simple snmp check to see if the amps are above 
 160warn/180critical. They always say warning even though they are less 
 than the specified 160. One thing to note is once it hits the 160 mark 
 it gets the * value * in turn, showing that it is actually in the 
 warning range. I have shown the neccesary data I hope below. Any/all 
 help is greatly appreciated.



 # uname -a
 FreeBSD server.example.net 5.5-RELEASE-p2 FreeBSD 5.5-RELEASE-p2 #3: Tue 
 Oct  9 22:39:13 EST 2007 
 r...@server.example.net:/usr/obj/usr/src/sys/MONITOR  i386

 Nagios
 Version 2.0b3


 # ./check_snmp -V
 check_snmp (nagios-plugins 1.4.3) 1.58



  From the webui

 APC-RR-R3-1.hou
  check_rr_amp
  WARNING  04-07-2009 13:00:17   8d 3h 17m 2s  10/10  SNMP WARNING - 90

  APC-RR-R3-2.hou
  check_rr_amp
  WARNING  04-07-2009 12:57:52  18d 2h 46m 48s  10/10  SNMP WARNING - *160*

 ^note the * 160 * for the one that actually is in the warning range.


 from checkcommands.cfg

 define command {
command_name check_rr_amp
command_line$USER1$/check_snmp -H $HOSTADDRESS$ -o 
 mib-2.33.1.4.4.1.3.1 -C cPanel -w $ARG1$ -c $AR
 }


  From the hosts config file.

 define service{
host_name   APC-RR-R1-1.hou
service_description check_rr_amp
check_command   check_rr_amp!159!179
max_check_attempts  10
normal_check_interval   5
retry_check_interval3
check_period24x7
notification_interval   30
notification_period 24x7
notification_optionsw,c,r
contact_groups  backup-admins
 }



 Thanks!
 -=Tom


 --
 This SF.net email is sponsored by:
 High Quality Requirements in a Collaborative Environment.
 Download a free trial of Rational Requirements Composer Now!
 http://p.sf.net/sfu/www-ibm-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when 
 reporting any issue. 
 ::: Messages without supporting info will risk being sent to /dev/null
 
 


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS

Re: [Nagios-users] Children unreachable on soft down?

2009-04-08 Thread Marc Powell


On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote:

 So is this just something I'll have to live with? I don't seem to be
 getting much feedback on the subject. :(

Well, my response would be to fix the problem that's causing the  
outages in the first place or adjust the way you're monitoring the  
parents so that the plugin used recognizes when this temporary event  
is occurring. What you're asking for is that nagios track that the  
child went from down-unreachable-down without an intermediate OK  
state and suppress notifications in that case. That would appear to be  
a code change and would be better discussed on nagios-devel but I  
would encourage the check plugin approach first.

--
Marc


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread gmarzot


 Is there some major advantage that Cacti provides when it comes to
 SNMP monitoring of network devices that cannot be achieved with Nagios
 and the various SNMP plug-ins available for it (e.g. like these ones
 http://nagios.manubulon.com) ?

Also does anyone have some nagios config examples integrating PNP and
these SNMP plugins...

I have been trying to get an idea how to create the commands.cfg and
services.cfg using these parts... Any examples of host based checks
would be great. 

I have tried to read the relevant docs but have not found explicit
nagios .cfg examples... if they exist a gentle pointer would also be
great.

thank you, Giovanni


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Children unreachable on soft down?

2009-04-08 Thread Israel Brewster





On Apr 8, 2009, at 9:28 AM, Marc Powell wrote:


 On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote:

 So is this just something I'll have to live with? I don't seem to be
 getting much feedback on the subject. :(

 Well, my response would be to fix the problem that's causing the
 outages in the first place or adjust the way you're monitoring the
 parents so that the plugin used recognizes when this temporary event
 is occurring.

Ok, fair enough. There is nothing we can do about the outages (as I  
explained in one of my e-mail, they are an artifact of the connection  
type), so that leaves us with adjusting the monitoring. Now I thought  
that the recheck options were there exactly for this reason: to catch  
brief outages and not alert. And for the parent host that seems to be  
the case, but apparently that logic doesn't carry on to the child  
hosts. As such, somehow things would need to be adjusted so it never  
even sees the outages, even enough to go into a soft down state.  
Anyone have any suggestions for how I can accomplish this? Adjusting  
the timeout or using, say, an ssh check rather than icmp won't do it -  
the packets are still lost, and the ssh check would still timeout..  
Perhaps if I sent more pings at longer intervals (so that if it  
doesn't get a response the single check retries at 15 second intervals  
or so before returning a response), but then the check would start  
taking several seconds or more to complete, and that wouldn't be a  
good thing. Assuming nagios even allowed a check to run that long -  
doesn't it have a mechanism to kill a check that doesn't return in a  
given time frame? I'm a little stumped here how I can adjust things.

 What you're asking for is that nagios track that the
 child went from down-unreachable-down without an intermediate OK
 state and suppress notifications in that case. That would appear to be
 a code change and would be better discussed on nagios-devel but I
 would encourage the check plugin approach first.

Ok. I know there is code in there that know who it sent down messages  
to and doesn't send up messages to people that didn't get a down  
(primarily dealing with escalations) so I was hoping that maybe there  
would be something similar for this, i.e. seeing that the last  
notification sent was a down notification, and as such there is no  
need to send another. But if not, so be it. Thanks for the response!

---
Israel Brewster
Computer Support Technician II
Frontier Flying Service Inc.
5245 Airport Industrial Rd
Fairbanks, AK 99709
(907) 450-7250 x293
---

 --
 Marc


 --
 This SF.net email is sponsored by:
 High Quality Requirements in a Collaborative Environment.
 Download a free trial of Rational Requirements Composer Now!
 http://p.sf.net/sfu/www-ibm-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when  
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] multiple parents

2009-04-08 Thread Lori Adams

We have several hosts that have multiple parents.

Will the child notify as down if only one of the parents is down?  Or will the 
child suppress notifications because one of its parents is down?

-Lori
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Children unreachable on soft down?

2009-04-08 Thread Christopher Burke

I wonder if there is something you can do with notification escalations?
I know you can control how the notifications are sent out, but I don't
know if a state change from down to unreachable to down will cause the
escalation to reset.

 

 

From: Israel Brewster [mailto:isr...@frontierflying.com] 
Sent: Wednesday, April 08, 2009 2:32 PM
To: Marc Powell
Cc: nagios-users@lists.sourceforge.net Users
Subject: Re: [Nagios-users] Children unreachable on soft down?

 





On Apr 8, 2009, at 9:28 AM, Marc Powell wrote:


 On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote:

 So is this just something I'll have to live with? I don't seem to be
 getting much feedback on the subject. :(

 Well, my response would be to fix the problem that's causing the
 outages in the first place or adjust the way you're monitoring the
 parents so that the plugin used recognizes when this temporary event
 is occurring.

Ok, fair enough. There is nothing we can do about the outages (as I 
explained in one of my e-mail, they are an artifact of the connection 
type), so that leaves us with adjusting the monitoring. Now I thought 
that the recheck options were there exactly for this reason: to catch 
brief outages and not alert. And for the parent host that seems to be 
the case, but apparently that logic doesn't carry on to the child 
hosts. As such, somehow things would need to be adjusted so it never 
even sees the outages, even enough to go into a soft down state. 
Anyone have any suggestions for how I can accomplish this? Adjusting 
the timeout or using, say, an ssh check rather than icmp won't do it - 
the packets are still lost, and the ssh check would still timeout.. 
Perhaps if I sent more pings at longer intervals (so that if it 
doesn't get a response the single check retries at 15 second intervals 
or so before returning a response), but then the check would start 
taking several seconds or more to complete, and that wouldn't be a 
good thing. Assuming nagios even allowed a check to run that long - 
doesn't it have a mechanism to kill a check that doesn't return in a 
given time frame? I'm a little stumped here how I can adjust things.

 What you're asking for is that nagios track that the
 child went from down-unreachable-down without an intermediate OK
 state and suppress notifications in that case. That would appear to be
 a code change and would be better discussed on nagios-devel but I
 would encourage the check plugin approach first.

Ok. I know there is code in there that know who it sent down messages 
to and doesn't send up messages to people that didn't get a down 
(primarily dealing with escalations) so I was hoping that maybe there 
would be something similar for this, i.e. seeing that the last 
notification sent was a down notification, and as such there is no 
need to send another. But if not, so be it. Thanks for the response!

---
Israel Brewster
Computer Support Technician II
Frontier Flying Service Inc.
5245 Airport Industrial Rd
Fairbanks, AK 99709
(907) 450-7250 x293
---

 --
 Marc



--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread jmoseley

I agree with Daniel's post below.  We have Nagios and Cacti running on the
same system; Nagios monitors 691 hosts and 1800 services while Cacti is
pulling stats for about the same number of hosts, but something like 3200
data sources.  They run on a dual Xeon 2.8 Ghz box with only 2 Gb or RAM
(no swapping going on).  Average load is about 1.5 and peaks at 3 about 3-4
times a day.
The key is that mysql operations are on a dedicated box with 15k SCSI
drives and RAID 10.


James Moseley




   
 Daniel Emmanuel   
 Feinsmith 
 dan...@danielemm  To 
 anuelfeinsmith.co ncc...@gmail.com  
 mncc...@gmail.com  
cc 
 04/08/2009 10:36  Nagios Users
 AMNagios-users@lists.sourceforge.net 
  
   Subject 
   Re: [Nagios-users] Nagios and Cacti 
   
   
   
   
   
   




If you move your mysql instance to another server, you can get much better
performance on a nagios/cacti server. Check top while cacti is running a
large install and you will see that mysql is hoarding CPU and memory
resources not leaving much for nagios.

=
Daniel Feinsmith
=
{sent from iPhone}

On Apr 8, 2009, at 8:03 AM, Andrew Davis ncc...@gmail.com wrote:

  And just an FYI from my own experience... putting Nagios  Cacti on
  the same server has been somewhat problematic for us. We have over
  400 network devices between switches, routers, WAPs, etc. We also
  have about 300 monitored servers. Initially I had Nagios and Cacti
  both on one server with Cacti running via cron every 5 minutes. About
  every 5 minutes, my shells would become unresponsive for roughly 30
  to 90 seconds. Turning off either Nagios or Cacti resolved the issue.
  Running both seems to have hammered the server a bit (4Gb of RAM, 2 x
  dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios,
  however. Nagios does both trending and alerts of all servers. Cacti
  does trending only of all network devices/ports. Once I moved Cacti
  to its own server, all was fine as far as load/latency went.
A. Davis
Email: ncc...@gmail.com

There is no limit to what a man can accomplish
 if he doesn't care who gets the credit. - Ronald Reagan



  Marco Tirado wrote:
Hello:

There are a couple of examples in the nagios exchange page of
different approachs for integrating nagios and cacti. You
should check that out.

I believe the synchronization is going to cost you time and
money, a better approach is to use nagios + pnp4naigos (this
generates nice graphs) + check_snmp_int.pl (this for bandwidth
tests). That way you have only one place to place your
configuration.  There are tons of other snmp plugins you can
use for other tests (CPU, Memory, etc),

//Marco

On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney 
crist...@gmail.com wrote:
  Hi all,

  I've been looking into making use of Cacti to act as an SNMP
  management tool which runs alongside my Nagios instance.

  Ideally, what I would like to do is have Cacti monitor
  various
  SNMP-exposed metrics on my hosts, and then have a service
  check in
  Nagios which parses Cacti's results (which I believe are RRD
  files)
  and send alerts etc.

  Nagios itself will still be used for running directly checks
  for
  services running, errors in log files etc.

  Does this approach make sense?

  One issue that I can think of is the difficulty in keeping
  the config
  files of Nagios and Cacti synchronised.  I was planning on
  using Lilac
  Platform to act as my

Re: [Nagios-users] Adaptive Monitoring: Broken?

2009-04-08 Thread Patrick Morris

On Wed, 08 Apr 2009, Marc Powell wrote:

 
 On Apr 7, 2009, at 1:26 PM, Patrick Morris wrote:
 
  Here are the important stats:
 
  Nagios Version: Version 3.1.0
  Proficiency Level: Pretty damned high
 
  While the first command works fine, and sets the service to an OK  
  state,
  the next two (which I've tried in various combinations) show up in the
  Nagios logs as having been sent, but do nothing. The check that  
  appears
  in the config files keeps running instead of my check_ok check.
 
  Here's how it shows up in the logs:
 
  [1239128528] EXTERNAL COMMAND: CHANGE_SVC_EVENT_HANDLER;dummy- 
  host;DNS;check_ok
  [1239128528] EXTERNAL COMMAND: CHANGE_SVC_CHECK_COMMAND;dummy- 
  host;DNS;check_ok
 
  I've noticed the message is different if I use an invalid command, so
  I'm relatively sure I'm using the right ones; they just don't do
  anything.
 
  Event handlers are enabled for these services, but even if they  
  weren't
  the check command should change, right?
 
  Am I doing something wrong here, or have I run into a bug?
 
 I'm not using 3.x yet but just to provide some feedback, what you're  
 doing looks reasonable from my reading of the documentation. I do see  
 this in 3.1.0's commands.c though --
 
  /* SECURITY PATCH - disable these for the time being */
  switch(cmd){
  case CMD_CHANGE_GLOBAL_HOST_EVENT_HANDLER:
  case CMD_CHANGE_GLOBAL_SVC_EVENT_HANDLER:
  case CMD_CHANGE_HOST_EVENT_HANDLER:
  case CMD_CHANGE_SVC_EVENT_HANDLER:
  case CMD_CHANGE_HOST_CHECK_COMMAND:
  case CMD_CHANGE_SVC_CHECK_COMMAND:
  return ERROR;
  }
 
 That's in the right section and my reading of the code is that it does  
 exactly that; prevent changing of those values... Maybe it's something  
 being worked on in the development branch?

Thanks! I should have done some code-diving, because that goes pretty
far toward explaining why those commands don't work for me as currently
documented.



--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] how to add Q1,Q2,Q3,Q4 reports

2009-04-08 Thread XYZ XYZ

Any idea how do i add few custom reports to nagios report period drop down 
list in availability report like Q1(first quarter), Q2(second quarter)... etc.





  --
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] how to add Q1,Q2,Q3,Q4 reports

2009-04-08 Thread Andy Shellam

XYZ XYZ wrote:
 Any idea how do i add few custom reports to nagios report period 
 drop down list in availability report like Q1(first quarter), 
 Q2(second quarter)... etc.




Edit the source code and recompile?

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Adaptive Monitoring: Broken?

2009-04-08 Thread Andreas Ericsson

Marc Powell wrote:
 On Apr 7, 2009, at 1:26 PM, Patrick Morris wrote:
 
 Here are the important stats:

 Nagios Version: Version 3.1.0
 Proficiency Level: Pretty damned high
 
 While the first command works fine, and sets the service to an OK  
 state,
 the next two (which I've tried in various combinations) show up in the
 Nagios logs as having been sent, but do nothing. The check that  
 appears
 in the config files keeps running instead of my check_ok check.

 Here's how it shows up in the logs:

 [1239128528] EXTERNAL COMMAND: CHANGE_SVC_EVENT_HANDLER;dummy- 
 host;DNS;check_ok
 [1239128528] EXTERNAL COMMAND: CHANGE_SVC_CHECK_COMMAND;dummy- 
 host;DNS;check_ok

 I've noticed the message is different if I use an invalid command, so
 I'm relatively sure I'm using the right ones; they just don't do
 anything.

 Event handlers are enabled for these services, but even if they  
 weren't
 the check command should change, right?

 Am I doing something wrong here, or have I run into a bug?
 
 I'm not using 3.x yet but just to provide some feedback, what you're  
 doing looks reasonable from my reading of the documentation. I do see  
 this in 3.1.0's commands.c though --
 
  /* SECURITY PATCH - disable these for the time being */
  switch(cmd){
  case CMD_CHANGE_GLOBAL_HOST_EVENT_HANDLER:
  case CMD_CHANGE_GLOBAL_SVC_EVENT_HANDLER:
  case CMD_CHANGE_HOST_EVENT_HANDLER:
  case CMD_CHANGE_SVC_EVENT_HANDLER:
  case CMD_CHANGE_HOST_CHECK_COMMAND:
  case CMD_CHANGE_SVC_CHECK_COMMAND:
  return ERROR;
  }
 
 That's in the right section and my reading of the code is that it does  
 exactly that; prevent changing of those values... Maybe it's something  
 being worked on in the development branch?
 

It's not. That snippet comes from Nov 30 2008 as a measure to prevent
CVE-2008-5027 (cmd.cgi authorization bypass vulnerability) and
CVE-2008-5028 (cross-site request forgery) from becoming remote command
execution vulnerabilities.

Ethan added that snippet as an extra security measure. It's been in
Nagios since 3.0.4.

Assuming both the patches I sent are applied, it's safe to remove that
particular snippet and recompile Nagios.


I wrote about the two vulnerabilities here in case anyone needs to
refresh their memory:
http://blogs.op5.org/blog4.php/2008/11/11/nagios-cmd-cgi-authorization-bypass-vuln
http://blogs.op5.org/blog4.php/2008/11/11/cross-site-request-forgery-vulnerability-6

The patches to prevent them are available here:
http://git.op5.org/git/?p=nagios.git;a=shortlog;h=refs/heads/security

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] multiple parents

2009-04-08 Thread Andreas Ericsson

Lori Adams wrote:
 We have several hosts that have multiple parents.
 
 Will the child notify as down if only one of the parents is down?  Or
 will the child suppress notifications because one of its parents is
 down?
 

All parents of a host has to be down for it to become unreachable.
If you have configured the host (and your contacts) to send notifications
on unreachable states, you may still get notifications for it, but not
HOST DOWN ones.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Hostnames and regex

2009-04-08 Thread Niall O Broin

I recently changed a Nagios configuration to use a regex with  
hostnames but it hasn't been entirely satisfactory. I have e.g.

define service{
 use generic-service
 host_name host*
 service_description Total Processes
 check_command snmp_procs!120!150

but for some hosts, I need to use different thresholds. I tried to use

define service{
 use generic-service
 host_name hostNN
 service_description Total Processes
 check_command snmp_procs!200!250

but hostNN still alerts based on the host* thresholds. I tried placing  
the definition for host NN before AND after the host* definition - it  
made no difference. From what I read of Nagios regex, they're not full  
regex so it wouldn't be possible to write one which matched host* but  
didn't match hostNN.

Is there a way of doing what I want, apart from the obvious one of  
renaming the hosts which I don't want to match the regex?


__
Kindest regards,


Niall  O Broin
MakaluMedia Group | http://makalumedia.com


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] NSClient not providing results

2009-04-08 Thread Murali Krishnan S

Hi,

  When I set NSClient++ to start automatically in services, its running but
not providing the results. All I'm getting is

could not fetch information from server .

Even when I run the check_nt plugin manually also, I'm getting the same
result.

[r...@nagios libexec]# /usr/local/nagios/libexec/check_nt -H 192.168.0.119
-p 12489 -v MEMUSE  -w 80 -c 90
could not fetch information from server

After searching and reading mailing lists I've tweaked certain things like
disabling firewall,  enabling DEP, etc. Still the same result.

Only one way I could make the NSClient to respond is,  running it in Command
line with -test option.

c:\NSClient++NSClient++.exe -test

When I run this, I'm getting the CHECKS GREEN..

[r...@nagios libexec]# /usr/local/nagios/libexec/check_nt -H 192.168.0.119
-p 12489 -v MEMUSE  -w 80 -c 90
Memory usage: total:4308.71 Mb - used: 1606.41 Mb (37%) - free: 2702.30 Mb
(63%) | 'Memory usage'=1606.41Mb;3446.97;3877.84;0.00;4308.71

Everytime I reboot the machine. I need to start this in command line.  Any
solution, inputs ?   Please provide...

Thanks.

--
Regards
Mkrish
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Monitor netstat connection states using nagios.

2009-04-08 Thread asam30

Hi All,

I am using check_tcp to check status of a particular port on the server.
This is working good. I would also need to monitor LISTEN status (SYN_RECV)
of that port. for ex,

I have a ldap service running on port 3890, so the command

netstat  -anp  | grep 3890

tcp0  0 0.0.0.0:3890 0.0.0.0:*
   LISTEN 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:8831
ESTABLISHED 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:61052
ESTABLISHED 16029/java
tcp  228  0 10.121.30.121:3890  10.121.6.1:49440
ESTABLISHED 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:11664
  SYN_RECV16029/java

The establish connections are ok to allow, but we need to monitor SYN_RECV
status. If there is any such(SYN_RECV) connection appears, we immediately
get an alert from nagios. Is there any way to monitor such states with
nagios or check_tcp?

I have written some shell script to monitor such events, but also I would
like to integrate that scripts into nagios? Is that possible?

Please help me

Thanks


-- 
Shankar
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Monitor netstat connection states using nagios.

2009-04-08 Thread asam30

Hi All,

I am using check_tcp to check status of a particular port on the server.
This is working good. I would also need to monitor LISTEN status (SYN_RECV)
of that port. for ex,

I have a ldap service running on port 3890, so the command

netstat  -anp  | grep 3890

tcp0  0 0.0.0.0:3890 0.0.0.0:*
   LISTEN 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:8831
ESTABLISHED 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:61052
ESTABLISHED 16029/java
tcp  228  0 10.121.30.121:3890  10.121.6.1:49440
ESTABLISHED 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:11664
  SYN_RECV16029/java

The establish connections are ok to allow, but we need to monitor SYN_RECV
status. If there is any such(SYN_RECV) connection appears, we immediately
get an alert from nagios. Is there any way to monitor such states with
nagios or check_tcp?

I have written some shell script to monitor such events, but also I would
like to integrate that scripts into nagios? Is that possible?

Please help me or provide some suggestions

-- 
Shankar
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Nagios for IRIX

2009-04-08 Thread amol.bute

Greeting.

 

 

I have IRIX 6.5 servers which I like to monitor using Nagios 3.0. I am
able to monitor other servers like Linux  Windows but not able to
configure nagios on IRIX OS servers.

 

Please guide me how can I install nagios plug-in on IRIX OS and
configure. Where do we will get nagios plug-ins for IRIX.

 

 

 

Thanks and Regards

Amol Bute

 

**
Email Disclaimer:

Information contained and transmitted by this e-mail (including any 
attachments) is confidential, proprietary and legally privileged data of Tata 
Technologies that is intended for use only by the addressee. If you are not the 
intended recipient, you are notified that any review, use, dissemination, 
distribution, copying or printing of this e-mail is strictly prohibited. You 
are requested to delete this e-mail or any copies immediately and notify the 
sender by reply email. Internet communications cannot be guaranteed to be 
timely, secure, error or virus-free.  Tata Technologies does not accept any 
liability for virus infected email or errors or omissions or consequences which 
may arise as a result of this e-mail transmission. To know more about Tata 
Technologies please visit http://www.tatatechnologies.com

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] serious performance issue

Re: [Nagios-users] Interesting problem while trying to monitor Oracle RAC services [Solved]

Re: [Nagios-users] monitor primergy servers with esx

[Nagios-users] Nagios and Cacti

Re: [Nagios-users] Name or service not known

Re: [Nagios-users] Name or service not known

Re: [Nagios-users] Name or service not known

Re: [Nagios-users] Name or service not known

Re: [Nagios-users] Adaptive Monitoring: Broken?

Re: [Nagios-users] Nagios and Cacti

Re: [Nagios-users] Nagios and Cacti

Re: [Nagios-users] Name or service not known

[Nagios-users] problems with log rotation

Re: [Nagios-users] Name or service not known

Re: [Nagios-users] Nagios and Cacti

Re: [Nagios-users] Nagios and Cacti

Re: [Nagios-users] Nagios and Cacti

Re: [Nagios-users] Nagios and Cacti

Re: [Nagios-users] Nagios Warning Bug/Misconfiguration

Re: [Nagios-users] Children unreachable on soft down?

Re: [Nagios-users] Nagios and Cacti

Re: [Nagios-users] Children unreachable on soft down?

[Nagios-users] multiple parents

Re: [Nagios-users] Children unreachable on soft down?

Re: [Nagios-users] Nagios and Cacti

Re: [Nagios-users] Adaptive Monitoring: Broken?

[Nagios-users] how to add Q1,Q2,Q3,Q4 reports

Re: [Nagios-users] how to add Q1,Q2,Q3,Q4 reports

Re: [Nagios-users] Adaptive Monitoring: Broken?

Re: [Nagios-users] multiple parents

[Nagios-users] Hostnames and regex

[Nagios-users] NSClient not providing results

[Nagios-users] Monitor netstat connection states using nagios.

[Nagios-users] Monitor netstat connection states using nagios.

[Nagios-users] Nagios for IRIX

35 matches

Site Navigation

Mail list logo

Footer information