[Nagios-users] High Availabilty with Nagios

2013-05-09 Thread Steve Shipway
/active Nagios setup? Would be interested in hearing all ideas... Steve Steve Shipway University of Auckland ITS UNIX Systems Design Lead s.ship...@auckland.ac.nzmailto:s.ship...@auckland.ac.nz Ph: +64 9 373 7599 ext 86487

[Nagios-users] Monitoring Cisco Ironport

2010-04-19 Thread Steve Shipway
Hello all. Here, we have recently acquired a Cisco Ironport Email gateway appliance, and this makes all sorts of useful data available via an XML interface. Therefore, I have created a plugin for Nagios and MRTG that can collect and threshold this data using the HTTPS/XML interface. The

Re: [Nagios-users] Check_multipath

2010-03-25 Thread Steve Shipway
My money is on Requiretty. Locally you have a TTY, while NRPE does not. The Requiretty setting in /etc/sudoers must be turned off. Comment out this line in /etc/sudoers: Defaultsrequiretty I agree -- this one had me stumped for days when I was producing a different plugin (for

Re: [Nagios-users] Virtual Machines - define as parent or as host dependency...

2010-01-26 Thread Steve Shipway
This is the way we do it, with Parents (not host dependencies). First we create a virtual object for the VMWare farm. This has a status of UP if any of the farm servers are up (using check_summary). This virtual 'host' has several services, using the v0.9 check_vmware, relating to the farm's

Re: [Nagios-users] Installing Nagios Server on a Virtual Machine

2009-11-12 Thread Steve Shipway
I would be very very wary of running Nagios on a VM (we use VMware here). The reason for this is Clock Skew. Clock Skew causes the virtual clock on the guest OS to lag behind then skip forward depending on the loading and sleep times of the guest. Note that this will not affect

Re: [Nagios-users] Distributed Monitoring Parents

2009-07-05 Thread Steve Shipway
If you want the satellites to suppress host/service checks when hosts are unreachable, then yes. Otherwise, your central Nagios master will correctly suppress notifications (as it knows about the dependencies, and the satellites don't do notifications) On our system, Ive defined the

Re: [Nagios-users] Changing check_http.c

2009-04-21 Thread Steve Shipway
From: Jim Avery [mailto:j...@jimavery.me.uk] Sent: Wednesday, 22 April 2009 9:38 a.m. 2009/4/21 Andrew Davis ncc...@gmail.com: So far, so good, but what I really want to see is the URL in the output. ... I agree, but since the plugin doesn't do that, I sometimes put the full url in the

[Nagios-users] Version 0.9beta of check_vmware available

2009-02-25 Thread Steve Shipway
no longer has an SNMP interface. Thanks for your attention, Steve --- Steve Shipway UNIX Systems Administration, University of Auckland, New Zealand +64 9 3737 599 x 86487 s.ship...@auckland.ac.nz P Please consider the environment before printing this e-mail smime.p7s

[Nagios-users] Windows Eventlog agent v1.9.0 released

2008-10-08 Thread Steve Shipway
Windows. I am hoping that some members of the community who have offered to help will be able to provide this. Thankyou for your time, Steve --- Steve Shipway UNIX Systems Administration, University of Auckland, New Zealand +64 9 3737 599 x 86487 [EMAIL PROTECTED]mailto:[EMAIL PROTECTED] P Please

[Nagios-users] Monitoring VMware

2008-09-23 Thread Steve Shipway
stable. Anyone attending LISA08 may be interested in seeing a brief demo of the sort of stats it retrieves at the MRTG BoF session on Wednesday. Steve --- Steve Shipway UNIX Systems Administration, University of Auckland, New Zealand +64 9 3737 599 x 86487 [EMAIL PROTECTED]mailto:[EMAIL PROTECTED

Re: [Nagios-users] creating a nagios cluster

2008-06-28 Thread Steve Shipway
We have achieve this here by having a shared external storage unit, and then using LinuxHA with the nagios service and filesystem (on the external disk unit) being defined as HA services. Works well - at the same time we also fail over the NSCA, SNMPtrapdaemon, and mysql database on the same

Re: [Nagios-users] Problem: Server is UP??

2008-06-17 Thread Steve Shipway
I really need some help with this guys. My setup was working for the longest time and I can't think of any changes made to it that would cause this behavior. I get random (and quite annoying!) Nagios alerts that say Problem: $SERVER is UP! (with the actual host name) or sometimes it just

Re: [Nagios-users] SMS notifications

2008-06-17 Thread Steve Shipway
On Jun 16, 2008, at 11:06 AM, Luc MAIGNAN wrote: Isn't there a free way to send SMS via Nagios ? Most cell phone companies have an e-mail - sms gateway service. Count yourselves lucky, here in New Zealand they charge you to use it in either direction. You need to pay to allow people

Re: [Nagios-users] check_vmfs

2008-06-11 Thread Steve Shipway
Check on nagiosexchange.org There's the check_esx2 plugin to check ESX servers and their guests. Its currently being reworked to support ESX3 better. I also have a plugin (run via NRPE) that checks vmem and vcpu on a linux/windows guest using the VMWare library API. Also check_vmfs that

Re: [Nagios-users] send_nsca

2008-05-22 Thread Steve Shipway
On May 22, 2008, at 12:19 PM, Nair wrote: Can some one please help me in integrating send_nsca command with my Perl script for passive monitoring system. Say how can we integrate plugin output without writing to any file and then piping thru send_nsca. You can also take a look at

Re: [Nagios-users] help required- oracle monitoring using nagios

2008-04-09 Thread Steve Shipway
Your service definition should instead say: check_command check_oracle_generic!test2!system!manage!18!12!select count(*) from v$session where username in not null and your command definition should have command_line/usr/local/nagios/libexec/check_oracle_generic -SID '$ARG1$'

Re: [Nagios-users] Too stupid? Services are available, but nagios reports host to be down!

2008-04-08 Thread Steve Shipway
I've a list of hosts, these hosts are not available for ping, but normal service checks (SSH, SMTP, ...) work. Nagios reports theses hosts beeing down! Ugly! On our system, we too have a small subset of hosts which cannot be pinged. However, they can be SSH'ed. So, I defined a new test,

Re: [Nagios-users] Problem with high latencies after going distributed

2008-01-22 Thread Steve Shipway
As I'd mentioned in a previous message, I'm in the process of converting from a centralized Nagios 2.10 setup all running on a single host to a distributed setup running on at least 3 hosts (3 to start anyway). The centralized setup has 572 hosts and 2900 services 99.9% of which are active

Re: [Nagios-users] Problem with high latencies after going distributed

2008-01-22 Thread Steve Shipway
Active Service Latency: 0.000 / 7267.198 / ... The only possible cause is the OCSP command slowing things down somehow. ... But if the submit_check_result is running slowly, that would only affect the service execution time wouldn't it? My understanding of check latency is

Re: [Nagios-users] graphical mapping tool

2008-01-03 Thread Steve Shipway
Here, we use weathermap -- http://www.network-weathermap.com/ -- which can take data feeds from MRTG and Nagios as well as from Cacti. I have a home-grown remote editor 'weatherman' available from http://www.steveshipway.org/software/weathermap-3.5.zip (perl/Tcl for windows/linux/mac/etc) which

Re: [Nagios-users] Linux Software Raid Plugin Recommendation?

2007-12-19 Thread Steve Shipway
There are several plugins for this this already on nagios exchange. I would take a look there as well. http://www.nagiosexchange.org/RAID_Controller.58.0.html?tx_netnagext_pi 1[p_view]=224 This is the one we use - it supports several hardware RAID, plus software raid on linux, solaris and AIX.

Re: [Nagios-users] Cannot get check_http to authenticate

2007-12-13 Thread Steve Shipway
This may be obvious, but have you checked for sure that you have the correct username and password, and that this particular username is authorised for the URL? You'll get a 401 if you have a valid username/password but the directory has a require directive that excludes the user (or excludes the

Re: [Nagios-users] How to get reboot messages

2007-11-29 Thread Steve Shipway
Here, we do this by checking the uptime of the host/device.For switches etc, this is in the SNMP counter. For windows hosts, it is via check_nt and the UPTIME object. For unix, you just create an appropriate script to run via nrpe. We then do a critical if uptime 10min. Since hosts are

Re: [Nagios-users] notification_interval seems to be ignored

2007-11-21 Thread Steve Shipway
Just to wrap up this topic, I finally defeated this problem by changing the output of my ambient temperature monitor to not return the actual temperature in the server room. This made the message static and unchanging and prevented repeated notifications from going out. I'm coming into this a

Re: [Nagios-users] Notification Problems

2007-11-21 Thread Steve Shipway
Check the definitions for your contact. Do you have host_notification_options d,r set in there as well as in the host definition? If not, then the notifications will be filtered out by your contact definition. Steve notifications on Nagios 3.0a3. I have all hosts configured for DOWN and

Re: [Nagios-users] Nagios - MySql

2007-10-18 Thread Steve Shipway
I want to integrate Nagios with MySQL and i was checking in nagiosexchange and found this: http://www.nagiosexchange.org/Misc.36.0.html?tx_netnagext_pi1[p_view]=4 62 Ah - this is one of mine. We use this here, of course - it was written originally for Nagios 1.x (which we use) but is now

Re: [Nagios-users] looking for help with NSClient

2007-10-01 Thread Steve Shipway
The old NSClient is notorious for not reporting problems correctly. Usually, when this happens, I find the issue is that the TCP port has been stolen by another application. Exchange is frequently guilty of this as it starts getting temporary port numbers from 1024 upwards and quickly takes

[Nagios-users] Monitoring Windows CPU/Mem under VMWare

2007-09-23 Thread Steve Shipway
://webdropoff.auckland.ac.nz/cgi-bin/pickup/945661545ec0ebe8e9cae13 7a553c127/356932 The linux one is better tested than the windows one. Treat these as alpha code: if you put them into a production environment, then you're on your own. Steve --- Steve Shipway UNIX Systems Adminstration

Re: [Nagios-users] nagios server inside vmware

2007-09-02 Thread Steve Shipway
Mels Said: Cook, Garry wrote: IIRC, the solution given in that thread was 'Don't use VMware'. I run three different Ubuntu servers (Nagios, MRTG, and NeDi) on VMware, and have no issues whatsoever with time (or anything else). ... I have Suse 10.2 and Nagios, MRTG, Netdirector in

Re: [Nagios-users] negative check latency with Nagios as VM?

2007-08-21 Thread Steve Shipway
VMWare themselves advise not to perform any monitoring which is rate-based on the guest, and further say that any monitoring which polls hardware (eg network card traffic) will cause performance problems, and also that monitoring CPU and Memory on the guest is pointless and misleading

Re: [Nagios-users] negative check latency with Nagios as VM?

2007-08-20 Thread Steve Shipway
Does the comment not to Nagios on VMWare also apply to a Distribution server? Yes, if any of the checks are rate-based with the rate being calculated by the plugin. Doing anything on a VMWare guest which is sensitive to the clock is not a good idea. So, you can check that (eg) http is up

[Nagios-users] Looking for Nagios experience exchange

2007-08-19 Thread Steve Shipway
email me. Similarly for anyone who'd like to drop by and talk Nagios. I'll even try and revive my previous idea of a New Zealand Nagios Users group, now I have management funding :-) Steve --- Steve Shipway UNIX Systems Adminstration, University of Auckland, New Zealand +64 9 3737 599 x

Re: [Nagios-users] negative check latency with Nagios as VM?

2007-08-19 Thread Steve Shipway
We run a lot of VMWare here, although we're running our Nagios on a physical box for performance reasons. I've spent a lot of time researching how to monitor virtual hosts and the potential pitfalls... We're testing our Nagios 2.9 implementation on a VMWare server. This box does have the

Re: [Nagios-users] Suggestions needed for VMWare Guest OS'es

2007-07-17 Thread Steve Shipway
As we are currently consolidating hosts to VMWare i need some way to define reasonable parent/child relations ships in Nagios as well as defining host dependencies. This is a really awkward one. What we do here is to first define the guests as individual hosts for monitoring, although we

Re: [Nagios-users] using NDOUtils with Nagios

2007-07-12 Thread Steve Shipway
As you heard, NDOUtils is only for sending status data from Nagios-Database. However, if performance is your issue, then since Nagios 2 the daemon has written a cache file containing pre-processed configuration data to a temporary file as it runs. The CGIs can read this for massive performance

Re: [Nagios-users] best place to put NSClient++ on a Windows server?

2007-07-04 Thread Steve Shipway
When you run the /install option, this configures a Windows service which points at the nsclient++.exe executable. After you run this, you cannot move the .exe file, else the service will not be able to start! Here, I have made a windows install package, which installs the stuff into

Re: [Nagios-users] Distributed setups

2007-07-02 Thread Steve Shipway
We're currently looking at creating a distributed setup using NSCA. One thing that I've found no mention of is how the host and service commands are forwarded. I think they are not. Even if the central machien does all the notifications (as we're planning), completely dis/enabling

Re: [Nagios-users] nrpe configuration on solaris.

2007-07-02 Thread Steve Shipway
1. On solaris, make sure you have the SSL libraries installed if you are going to use SSL. Solaris does not seem to have these by default. 2. If you disable SSL, you need to do it on the client as well. So, your inetd definition uses -n to disable SSL, and you must also give the -n option to

Re: [Nagios-users] Double monitoring.

2007-07-01 Thread Steve Shipway
I'm admin of network with nagios, the network has like 30 servers, and im trying to do a double monitoring.ie: Sounds like what you want is either High Availaibility or Standy Nagios. We use both. For our 'live' Nagios, we have two servers running Linux-HA which are both connected to the same

Re: [Nagios-users] Recurring Downtime

2007-07-01 Thread Steve Shipway
I think this one is one of mine. We're in the process of moving to Nagios 2.x here, and in doing so I fixed a couple of bugs in this script, and improved it considerably. Until now I've not been able to properly test under Nagios 2.x. I have just uploaded the v2.0beta1 of these scripts to

[Nagios-users] Context-sensitive help pages?

2007-06-12 Thread Steve Shipway
to use? Alternatively, if Ethan has made some, where can I obtain them? Thanks for any help, Steve --- Steve Shipway UNIX Systems Adminstration, University of Auckland, New Zealand +64 9 3737 599 x 86487 [EMAIL PROTECTED

Re: [Nagios-users] Monitoring Drupal CMS with Nagios and WebInject

2007-05-28 Thread Steve Shipway
Does anybody using Nagios for monitoring Drupal CMS deployments? Nagios already has check_http and check_curl plugins but I can only validate index.php. I was thinking about test tool called WebInject. It has a We use webinject to monitor our CMS system, although it is a different CMS to

Re: [Nagios-users] Trap reset script

2007-03-13 Thread Steve Shipway
It will look up the status and command files from the Nagios config, parse the status and then force an active check for services matching these criterias: 1. Should not be scheduled to be checked 2. Has active checks enabled Is there a reason why you are not using the Nagios freshness

[Nagios-users] check_rbl update: NJABL change

2007-03-08 Thread Steve Shipway
--- Steve Shipway UNIX Systems Adminstration, University of Auckland, New Zealand +64 9 3737 599 x 86487 [EMAIL PROTECTED] - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel

Re: [Nagios-users] nsclient++ and nrpe commands

2007-02-28 Thread Steve Shipway
quoted and check nsclient++ for definitions of illegal characters... Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Mies, Christian

Re: [Nagios-users] What to Monitor

2007-02-18 Thread Steve Shipway
certificate expiry, active users, active connections, time taken for a test email to pass through the system, number of mail processes (postfix, amavis, gld), virus scanner daemons, mail cluster activity, performance, and failover. Plus most of this is also graphed where possible. Steve -- Steve

Re: [Nagios-users] Request new functionality: Off Hours state.

2007-02-15 Thread Steve Shipway
for us - I can send a copy of the modified status.c if you want, but we're using nagios 1.4... Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] - Take Surveys. Earn Cash

Re: [Nagios-users] NsClient 2.0.1

2007-01-16 Thread Steve Shipway
installing nsclient++ or nc_net and see if they work, or if it still fails then the problem lies with your check_nt call. Double check that you are really passing meaningful parameters to check_nt. Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED

Re: [Nagios-users] NsClient 2.0.1

2007-01-15 Thread Steve Shipway
and not compatible? We use nagios-plugins 1.4 and this works with pnsclient2.0.1 correctly. Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Marco

Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage

2006-12-19 Thread Steve Shipway
that you have given the -n option (non-SSL) in your init.d definition, but not in your inetd.conf definition! Are you using -n in your check_nrpe? Pick one and use it throughout... Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED

Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage

2006-12-18 Thread Steve Shipway
Sounds like the daemon is comparing the client's IP against the list of permitted connections, and not getting a match. For some reason the query of the source IP on the connection is returning 0.0.0.0 instead of the source IP - maybe you have some special wrapper or intervening agent. I'd

Re: [Nagios-users] check_nt and MSSQL$SQLEXPRESS

2006-11-30 Thread Steve Shipway
will still expand $ARG1$, but the shell will not subsequently try to expand the $ in the argument. Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] - Take Surveys. Earn Cash

Re: [Nagios-users] timeouts when using secondary dns

2006-11-09 Thread Steve Shipway
Wedealt withthis by installing a local caching-only nameserver on the Nagios host itself. This also took a lot of the load off of the main nameservers. So, resolv.conf was set to use 127.0.0.1 by default and have our normal name servers as secondaries. A nice sideeffect was that it vastly

Re: [Nagios-users] Esx checks

2006-11-08 Thread Steve Shipway
Derek Balling [mailto:[EMAIL PROTECTED] wrote: On Nov 7, 2006, at 4:45 PM, Steve Shipway wrote: As far as I know, there is no way to find out the IP address of an ESX guest OS without connecting to it (eg by logging in to the virtual ... That can't be true. VirtualCenter routinely tells

Re: [Nagios-users] Esx checks

2006-11-07 Thread Steve Shipway
) and then you'll be able to use DNS to indentify it and SNMP query it directly. Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] - Using Tomcat but need to do more? Need to support web

Re: [Nagios-users] No data was received from host!

2006-11-05 Thread Steve Shipway
This nicely explains why my mrtg-pnsclient script (which allows MRTG to directly query pnsclient, NC_Net etc) has occasional problems when querying two variables at once, and the second query fails. It is interesting to note that the original pnsclient doesn't have this limitation :) If

Re: [Nagios-users] Open Relay Check Plugin?

2006-10-29 Thread Steve Shipway
) and alerts you if this happens. Of course, this is not quite what you were asking for, but (once you fix your mail server) it lets you know if you ever get incorrectly blocked. We check it every few hours. Available from nagiosexchange.org. Steve -- Steve Shipway ITSS, University of Auckland (09

[Nagios-users] Testing host notification recipients

2006-10-23 Thread Steve Shipway
Nagios 1.4, at the moment. Cant go to 2.x until next year. Thanks in advance Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] - Using Tomcat but need to do more? Need to support web

Re: [Nagios-users] how to make check_snmp timeout CRITICAL

2006-10-23 Thread Steve Shipway
I achieve this by using the check_snmp options to test for a known value in the response string (eg: '.'). This way, I can alert if anyone stops the SNMP daemon or changes the community strings. Steve --Steve ShipwayITSS, University of Auckland(09) 3737 599 x 86487[EMAIL PROTECTED]

Re: [Nagios-users] Monitoring VMWare

2006-10-19 Thread Steve Shipway
. Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology

Re: [Nagios-users] custom fields

2006-10-15 Thread Steve Shipway
-Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Az Sent: Sunday, 15 October 2006 9:34 a.m. Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] custom fields Kyle Vorster wrote: What i am trying to do is give a more detailed

Re: [Nagios-users] custom fields

2006-10-15 Thread Steve Shipway
and you can have a copy. The name 'buster' comes from 'ghostbuster', since it is supposed to use this extra text to tell the night ops who they're gonna call... Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED

Re: [Nagios-users] check_mailq

2006-10-08 Thread Steve Shipway
I'm guessing that this plugin is designed to run on the mail server itself which is running nagios /and/ qmail? Yes. Checking the mail queue remotely is maybe not possible? Maybe I'm just missing something? Run it remotely via NRPE. That's what we do. Steve -- Steve Shipway ITSS

Re: [Nagios-users] Monitoring IIS server...

2006-10-04 Thread Steve Shipway
and use the loadbalancer to keep session-id's from being eaten. 4) Buy more RAM for the old server. 5) Buy a new, beefier, server. You must be psychic. Actually, they'd already done (3) and (4) before they decided on doing (2). Doing (1) would have been too easy. Steve -- Steve Shipway ITSS

Re: [Nagios-users] Monitoring IIS server...

2006-10-02 Thread Steve Shipway
. Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your

Re: [Nagios-users] Nagios On VMWare machines

2006-10-01 Thread Steve Shipway
We use Nagios to monitor the service console (via check_esx2, from nagiosexchange) and the standard host MIB checks. Then we use NRPE or pNSclient on the guests to monitor them (but not CPU, memory or network IO). check_esx2 takes care of CPU use, readytime, and memory use checks. Works

Re: [Nagios-users] how to loop event-handlers

2006-09-28 Thread Steve Shipway
like to try playing with the is_volatile=1, and stalking_options=c to see if this results in multiple calls to the event handler on every critical check rather than just on state changes? Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED

Re: [Nagios-users] Service Checks in Distributed mode

2006-08-21 Thread Steve Shipway
-- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make

Re: [Nagios-users] check_http can't follow redirect properly, no cookies!

2006-08-17 Thread Steve Shipway
Check out the WebInject Nagios plugin. I have it setup to check several pages in a secure website and it handles cookies. It could save you the hassle of creating your our script. We use webinject here for precisely this reason, and it works well. It also supports MRTG format output

Re: [Nagios-users] New check script: check_nsca

2006-08-16 Thread Steve Shipway
Great idea, I never though of it for some reason. I've just added check_tcp on port 5667 to make this test - we use NSCA for receiving SNMP traps and messages from NagEventLog, so these are reset to 'ok' by a freshness check rather than to 'unknown'. Checking NSCA is alive is a good idea. Steve

Re: [Nagios-users] How to get stdout from event handlers into Nagiosemail messages?

2006-08-15 Thread Steve Shipway
Hi, I'm running Nagios 2.5 and my question is: is there any way to capture the stdout of an event handler and get it into the email Nagios sends out? If you think about it, this must be impossible. Since the event handler is called at the same time as the notifications, the output (if

Re: [Nagios-users] Tracking dynamic parents?

2006-07-30 Thread Steve Shipway
I am monitoring several hundred virtual machines and they move from time to time to different VMware host machines. My question is how can I easily update the parents for these? We have this situation as well. The way I get around it is to reconfigure the scripts daily. I have a

Re: [Nagios-users] Why do I get these socket timeouts?!?!

2006-07-28 Thread Steve Shipway
CHECK_NRPE: Socket timeout after 10 seconds. This is the check_nrpe timing out. There are 3 or 4 timeouts to check -- 1)Service checks have a global timeout in nagios.cfg. This is usually about 30sec 2)check_nrpe has a timeout specified by -t. This is usually 10sec. 3)The remote nrpe agent

Re: [Nagios-users] bind nagios server to a certain IP

2006-07-17 Thread Steve Shipway
Is it possible to get a nagios server to bind to an IP? ... I want to get nagios to use one IP on a cluster so that the other host (active-passive) can take it over in the event of a failure. ... I've set allowed_hosts in nrpe.conf - so the clients will only accept checks

Re: [Nagios-users] Reporting Transactions

2006-07-13 Thread Steve Shipway
Does anyone have any recommendations for using Nagios to consume or plug in something that can test end user experience and report transaction steps to Nagios? I would need this to test drilling down into web sites and Win32 apps. Kinda like a transaction testing solution like Rational

Re: [Nagios-users] syslog-ng process monitor plugin

2006-07-13 Thread Steve Shipway
check_procs will only check for processes on the local machine. Use the -h option to see all the parameters you can give it to check for different things. You will probably want to use NRPE to run the check_procs plugin on the remote server (this is how we do it here) check_nrpe -H

[Nagios-users] Filtering display of current status in CGI

2006-07-05 Thread Steve Shipway
Hi -- I need to get a display of the current problems to go up on the Operations' status screen. So far, I can do this with: status.cgi?hostgroup=x-productionstyle=detailservicestatustypes=248servic eprops=8202hostprops=8202noheadersorttype=2sortoption=3 which limits display to just the

Re: [Nagios-users] Check Exchange Queues

2006-07-02 Thread Steve Shipway
Sometimes, windows servers have problems with service order. Make sure you have started exchanege before you start perfmon, so that perfmon can verify the counters from exchange Or is it the other way around? We have a problem with SQLServer where if you restart the (SQL) service all the

Re: [Nagios-users] Check Exchange Queues

2006-06-29 Thread Steve Shipway
Here's my checkcommands.cfg section: ## Exchange Server#define command { command_name nt_exch_throughput command_line $USER1$/check_nt -H $HOSTADDRESS$ -v COUNTER -l "\\MSExchangeMTA\\Messages/Sec","%.2f messages/sec" -w $ARG1$ -c $ARG2$}define command { command_name nt_exch_inq

Re: [Nagios-users] Distributed Monitoring - Redundancy

2006-06-25 Thread Steve Shipway
I'm running Nagios is a distributed environment which is working very well. I would like to add a little redundancy to the picture now that I have everything working. ;-) ... It seems that a secondary cold spare might be the best solution. Then there are maintenance issues with keeping

Re: [Nagios-users] Switch Port Monitoring

2006-06-25 Thread Steve Shipway
Anyone like to share how they monitor switch ports? Here, we use MRTG to graph the switch ports, and the routers2 frontend to MRTG to make it pretty. Also, the routers2 frontend has a Nagios plugin to allow the Nagios data to be displayed as well -- and a portstatus plugin to show the current

Re: [Nagios-users] Monitoring Cisco 3750 stacks - OIDs or traps ?

2006-06-25 Thread Steve Shipway
If you want the switches to let you know when something happens, use traps. If you want to pull data at regular intervals, use polling. There's nothing to stop you from doing both. We use both, although not on precisely this hardware. On our foundry load balancer and SAN, for example,

Re: [Nagios-users] avaibility stats on Nagios server crash...

2006-06-11 Thread Steve Shipway
When the server which hosts nagios2.x crashes (for instance, electric cut), avaibility statistics are totally wrong, the stop period is not catch by nagios (the last state stored by Nagios is used to fill this time period...) nothing appears in Undetermined section of statistics. Too

[Nagios-users] check_esx version 2 released on Nagiosexchange

2006-05-28 Thread Steve Shipway
Version 2 of check_esx (to allow Nagios to check the health of a VMware ESX server via SNMP) is now uploaded on Nagiosexchange. This is a major change on the old v1.4 and can check for many health problems such as balloon memory, CPU ready time, VM swapping, and so on. Also has an add-on for

RE: [Nagios-users] monitoring esx vmfs

2006-05-19 Thread Steve Shipway
VMWare is a bit strange with this one. Basically, vdf lists normal filesystems plus the vmfs, but the NRPE check_disk plugin is not vmware-aware and so cannot check them. I'm doing it here via the SNMP which seems to work, but I've needed to write a special plugin. You might also like to notice

RE: [Nagios-users] How to reduce a very high latency number

2006-05-17 Thread Steve Shipway
[Trask wrote] I am still butting up against very high latency issues with my Nagios setup. I feel like I must be missing something obvious because it doesn't seem like I have so many services that the servers cannot keep up. I've noticed we get this problem when there are more than one or

RE: [Nagios-users] Hide certain 'Criticals'

2006-05-16 Thread Steve Shipway
... You could set your view to filter out acknowledged alerts? This is an undocumented option to status.cgi. ... Thanks for the reply Steve, how would I filter out the acknowledged alerts? After filtering them out, how would I know that they have recovered, does it reset the

RE: [Nagios-users] Monitore Postfix queue

2006-05-16 Thread Steve Shipway
I've write a small script do monitore postfix queue using postqueue to count the number of emails on my queue: postqueue -p | tail -1 | cut -d -f 5 The problem: Sometimes, my queue have more than 20.000 emails, and I can't count this number of emails in less then 1 minute :-) We

RE: [Nagios-users] Hide certain 'Criticals'

2006-05-11 Thread Steve Shipway
I would like to be able to acknowledge that the device is down and be able to remove it from our view. Only when the device comes back online then again goes offline it would reset the acknowledgment and again show it on the screen as 'CRITICAL' until it is acknowledged again. I have

RE: [Nagios-users] Check Printer QUEUE on Windows w2k /2003

2006-05-10 Thread Steve Shipway
I looking for a method to check the printer queue on Windows server (like if more than 10 jobs i queue then warning or critical) I tried the snmp check /usr/local/nagios/libexec/check_snmp_win.pl (very good) but i can just test the "Service" Spooler. I looked att nagiosexchange.org and on

RE: [Nagios-users] Failover nagios server

2006-03-07 Thread Steve Shipway
I have a standalone server with nagios running on it. I want to tune up an another server, for distributed monitoring. But main task is to provide for failover work of allover nagios configuration - in case of failure of one of the servers another server must provide data collection from

RE: [Nagios-users] Potential bug in avail.cgi/Nagios 2.0.

2006-03-06 Thread Steve Shipway
The problem is the too large values of time_down_unscheduled and the fact that the total_time_down is not the sum of sched and unsched downtime. We get this as well, and it is a big nuisance. It looks very much like someone is using signed integers when they should be using unsigned, or

RE: [Nagios-users] pNSClient.exe port used?

2006-03-06 Thread Steve Shipway
pnsclient1248/tcp inbound nrpeand nrpe_NT 5666/tcp inbound Eventlog monitor (uses nsca) 5667/tcp outbound Steve I have a silly question about pNSClient.exe for monitoring my windows servers. I added guarddog to my nagios server to better control my firewall, but when I

[Nagios-users] Using Nagios plugins via UCD-SNMPD

2006-03-05 Thread Steve Shipway
Hi, I thought I'd see if anyone has done this before. I'm installing the standard Nagios check_* plugins on a server with only the UCD SNMP daemon (IE, the one you get with Linux). This takes generic extensions, which are compatible with the Nagios plugins. So, I would add something like this

RE: [Nagios-users] NSClient++ Counters

2006-02-22 Thread Steve Shipway
Brian Huffman: I'm trying to setup counters with NSClient++ and I am having trouble with counters that have an instance. For example, Processor-%Processor Usage has either an instance of _Total or 0 for the machine that I'm using, but I can't get either of these to work. No matter what

RE: [Nagios-users] check_nt binary available?

2006-02-22 Thread Steve Shipway
You could use NSCA. This is what I'm using to monitor Windows servers that are behind other machines. The key requirement is that the Windows machines can make outgoing connections. Your Nagios server will be passive, only listening for the incoming connections. I've been using the

RE: [Nagios-users] MySql Status Data

2006-02-20 Thread Steve Shipway
I have uploaded to www.nagiosexchange.org a utility (in Perl) that will regularly eat an archived Nagios log file from Nagios 1.x and install it into a mysql database for subsequent reporting. We're using it here in beta. Feel free to download it and try it, or modify for your own use. You

[Nagios-users] check_nt binary available?

2006-02-20 Thread Steve Shipway
Hopefully someone out there can help me. Has anyone managed to compile the check_nt program under Windows? If so, can you send me a copy of the binary or (preferably) the modified source code? We have a server that is hidden behind another server, and so to monitor it, I hope to use NRPE_NT on

RE: R: [Nagios-users] I: NSClient++ configuration problems

2006-02-19 Thread Steve Shipway
(Danny Allard) A quick test to make sure that the port is open is to telnet to it. telnet YourHostname 1248 On www.nagiosexchange.org you can download my checkagent utility, which will probe a remote host for various agents (NRPE, NRPE/SSL, pNSclient (including NC_Net, NSClient++),

RE: [Nagios-users] Nagios and NRPE daemon for monitoring

2006-02-19 Thread Steve Shipway
If you are using the UCD SNMP daemon (the one shipped with Linux, but you can compile it for other UNIXes) then this allows you to monitor for instances of a process. Steve From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Naveen C JoshiSent: Saturday, 18 February

  1   2   >