[Nagios-users] Getting graphs for one host, but not another... but both in same hostgroup
I've stumbled across something odd. We're using Nagios and PNP4Nagios to get trending and graphs. I have two clients in the same hostgroup. The hostgroup name is linux-servers. Members are osiris1 and imhotep. I'm using NRPE to check each. Relevant config files are below. In a nutshell, I'm getting disk graphs for Osiris1, but not for Imhotep and I can't figure out why. To clarify... the disk graphs are there for both. However, for host Osiris1 they are populated with data. But for host Imhotep they are blank (not non-existent, existent, but blank/empty). Checking manually at the command line of the Nagios server shows the pipe and the subsequent data (tests at bottm of email). I could understand if was monitoring the hosts individually and had a typo in a config for one, but I'm monitoring them as a hostgroup and command line tests work, but one is giving graphs with results and the other blank graphs. Any thoughts? *hosts.cfg* (/relevant portion/): # Linux Hosts # # -- Linux host define host { nameLinux-Servers register0 hostgroups linux-servers check_command check-host-alive icon_image tux.png statusmap_image tux.png icon_image_alt Linux Server max_check_attempts 3 check_period24x7 contact_groups unixadmins notification_interval 120 notification_period 24x7 notification_optionsd, u, r, f, s } define host { host_name imhotep use Linux-Servers alias Imhotep (SLES) address 10.1.1.57 parents *changed_for_security_reasons* } define host { host_name osiris1 use Linux-Servers alias Osiris1 (SLES) address 10.1.1.52 parents *changed_for_security_reasons* } *hostgroups.cfg* (/relevant portion/): # Linux Servers # define hostgroup { hostgroup_name linux-servers alias Linux Servers (Checked via NRPE) members imhotep, osiris1 } *checkcommands.cfg* (/relevant portion/): define command { command_namecheck_nrpe command_line$USER1$/check_nrpe -H $HOSTADDRESS$ -t 60 -c $ARG1$ } *services.cfg* (/relevant portion/): define service { hostgroup_name linux-servers service_description DISK check_command check_nrpe!check_disks max_check_attempts 3 normal_check_interval 15 retry_check_interval5 check_period24x7 notification_interval 120 notification_period 24x7 notification_optionsw, u, c, r, f, s contact_groups unixadmins action_url /nagios/pnp/index.php?host=$HOSTNAME$srv=$SERVICEDESC$ } Manual tests from Nagios command line: [nag...@nephilim ]$ /usr/local/nagios/libexec/check_nrpe -H imhotep -c check_disks DISK OK| /=37431MB;62067;65515;0;68964 /dev/shm=0MB;7187;7586;0;7986 /srv=487666MB;857825;905482;0;953139 /Process=771063MB;1151634;1215614;0;1279594 /home=484743MB;974463;1028600;0;1082737 /mnt/store=906386MB;1715990;1811323;0;1906656 /data=397590MB;837857;884405;0;930953 /mnt/archives=2715115MB;4618679;4875272;0;5131866 [nag...@nephilim ]$ /usr/local/nagios/libexec/check_nrpe -H osiris1 -c check_disks DISK OK| /=5753MB;62067;65515;0;68964 /dev/shm=0MB;7013;7403;0;7793 /mnt/store=906386MB;1715990;1811323;0;1906656 /mnt/newxray=651767MB;837621;884155;0;930690 /mnt/incell=1503651MB;1715990;1811323;0;1906656 /mnt/newdata=788776MB;878368;927166;0;975965 /share=560786MB;851256;898548;0;945840 /data=397590MB;837857;884405;0;930953 /home=484743MB;974463;1028600;0;1082737 /mnt/archives=2715115MB;4618679;4875272;0;5131866 /Peptide=452914MB;85;891359;0;938273 Any thoughts? I'm stumped on this one... anyone know where PNP keeps its logs? I'm thinking I might be able to grep through the logs for the host with the blank graphs and find something... -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today. http://p.sf.net/sfu/beautyoftheweb___ Nagios-users
Re: [Nagios-users] How do you wake up an iPhone user???
We put in our own sms gateway and have it configured so that all alerts go through email unless its related to the network or email (which email alerts are dependent upon), in which case they get sms instead. We then broke up the alerts such that any sms alert would repeat every 5 minutes until acknowledged. That solved the problem for us... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan On 9/10/10 5:40 AM, zarre...@linux.it wrote: http://growl.info/about.php -- Automate Storage Tiering Simply Optimize IT performance and efficiency through flexible, powerful, automated storage tiering capabilities. View this brief to learn how you can reduce costs and improve performance. http://p.sf.net/sfu/dell-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Automate Storage Tiering Simply Optimize IT performance and efficiency through flexible, powerful, automated storage tiering capabilities. View this brief to learn how you can reduce costs and improve performance. http://p.sf.net/sfu/dell-sfdev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios for iPhone
I didn't want to expose our Nagios URL to the world as if it was compromised it would give an attacker way too much inside knowledge of our network. I believe Nagios is fairly secure, but its a lot of PHP and other open source scripting languages, as well as third party addons that may not hold up too well to brute force attempts or might even have some lazy coding. As such, I have an outside server with a hardened access rule that acts as a proxy to the inside Nagios URL, then use iNag to monitor it. Best $10 I've ever spent. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan On 7/14/10 11:31 AM, Giorgio Zarrelli wrote: Hi, Anyone using an easy interface for Nagios on iPhone? I do not mean some native iPhone app, I mean something server side like Nagios 4 iPhone. Any hints? Ciao, Giorgio -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Acknowledgement text in emails...
For months I've been acknowledging alerts and including a reason why, etc but I just realized that the acknowledgment emails don't include the reasons that I take the time to include... and thus, my boss sees my ack'ing them, but doesn't know if its a non-critical issue, etc. Is there an option I can add to include the acknowledgment reasons in the acknowledgment emails that go out? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] How do I know *who* ack'd an alert?
This is a follow-up to my question about including comments in ack emails. I just tested and ack'd an issue and got an email: * Nagios * Notification Type: ACKNOWLEDGEMENT Host: KVM6 State: DOWN Address: 10.5.12.75 Info: CRITICAL - 10.1.3.75: Host unreachable @ 10.10.45.170. rta nan, lost 100% Date/Time: Fri Feb 12 12:17:58 EST 2010 Additional Info: Comment: Brought down intentionally to test comments in ack emails - From IP:10.5.7.22 How do I see who ack'd this message? Better put, what macro do I use to show who ack'd an alert? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How do I know *who* ack'd an alert?
Thanks for the hint. I did, indeed search the page. I didn't, however, think to look for the string AUTHOR. I was looking at ACK, ADMIN, etc... didn't think to look for AUTHOR. Thanks much. :) A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan On 2/12/10 12:29 PM, Martin Melin wrote: Come on. You got the link to the macro list in the previous thread, couldn't you be bothered to even look at that? Hint: the macro names end with AUTHOR. Regards Martin Melin On Fri, Feb 12, 2010 at 6:23 PM, Andrew Davisncc...@gmail.com wrote: This is a follow-up to my question about including comments in ack emails. I just tested and ack'd an issue and got an email: * Nagios * Notification Type: ACKNOWLEDGEMENT Host: KVM6 State: DOWN Address: 10.5.12.75 Info: CRITICAL - 10.1.3.75: Host unreachable @ 10.10.45.170. rta nan, lost 100% Date/Time: Fri Feb 12 12:17:58 EST 2010 Additional Info: Comment: Brought down intentionally to test comments in ack emails - From IP:10.5.7.22 How do I see who ack'd this message? Better put, what macro do I use to show who ack'd an alert? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Virtual Machines - define as parent or as host dependency...
Thanks much for the info. Curious... you mention ESX and VSphere. Do you know if your check_vmware test also works against VMWare Server 2.x (free edition). We are running ESX internally, but have a single external server running VMWare Server 2.x with three VMs in it. Hence why I ask... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan On 1/26/10 3:03 AM, Steve Shipway wrote: This is the way we do it, with Parents (not host dependencies). First we create a virtual object for the VMWare farm. This has a status of UP if any of the farm servers are up (using check_summary). This virtual 'host' has several services, using the v0.9 check_vmware, relating to the farm's alarms, storage volumes, etc. These services have service dependencies on the VirtualCentre service running on the Virtual Centre host. The Farm object has ALL of the ESX Servers as Parents. All the VMs in the farm have the Farm object as a parent. Some of them also use check_esx3 to alert on Alarms, CPU, and Memory usage within VMWare. This might seem a bit complex if you've only the one server, but as soon as oyu have multiple servers in the farm, and use DRS, you have to use a farm object for parents/dependencies. It might make more sense for these relationships to be host dependencies rather than parents i nmost cases, but we have a SAN mirrored environment to a seocnd ESX farm so that the VMs can be brought up ther ein the event of a complete farm outage, hence the use of Parents rather than dependencies. If you have VSphere4 (ESX4.0) with a SNMP-enabled Cisco virtual switch in the farm, you could probably make the virtual switch the parent device rather than having to use a farm object. The VMWare monitoring plugin we're using is v0.9 of check_vmware, from here: http://www.steveshipway.org/forum/viewtopic.php?f=28t=1648 http://www.steveshipway.org/forum/viewtopic.php?f=28t=1648 check_summary is available from nagiosexchange.org (as is check_esx3 which is the forerunner of check_vmware) Steve *From:* Andrew Davis [ncc...@gmail.com] *Sent:* Tuesday, 26 January 2010 9:14 a.m. *To:* nagios-users@lists.sourceforge.net *Subject:* [Nagios-users] Virtual Machines - define as parent or as host dependency... I'm trying to figure out the best way to do this, yet keep things as simple as possible. Say I have a server called Saturn running VMWare. I'm monitoring this server with Nagios. I also have three VM's on Saturn: Jupiter, Mars, and Pluto I want to suppress all host and service alerts on Jupiter, Mars, Pluto if the host Saturn is down (unreachable). I do NOT want to suppress host or service alerts from Jupiter, Mars, and Pluto if the VMWare processes (services) are down on Saturn. Basically, if my VM server is completely unreachable, don't bother me about its client VM's. Am I better off doing this with a host dependency? Something like: *define hostdependency { host_name Host B dependent_host_name Host C notification_failure_criteria d,u } * Or am I better off defining Saturn as the parent of the VM's in the host config? Something like: *define host { host_name jupiter use VMs alias jupiter address 172.26.251.60 parents saturn, tpdmzsw1 }* I've successfully used the parents directive to define network topology, so I would think this would work. What might be the risks of defining both? -- A. Davis Email:ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Virtual Machines - define as parent or as host dependency...
I'm trying to figure out the best way to do this, yet keep things as simple as possible. Say I have a server called Saturn running VMWare. I'm monitoring this server with Nagios. I also have three VM's on Saturn: Jupiter, Mars, and Pluto I want to suppress all host and service alerts on Jupiter, Mars, Pluto if the host Saturn is down (unreachable). I do NOT want to suppress host or service alerts from Jupiter, Mars, and Pluto if the VMWare processes (services) are down on Saturn. Basically, if my VM server is completely unreachable, don't bother me about its client VM's. Am I better off doing this with a host dependency? Something like: *define hostdependency { host_name Host B dependent_host_name Host C notification_failure_criteria d,u } * Or am I better off defining Saturn as the parent of the VM's in the host config? Something like: *define host { host_name jupiter use VMs alias jupiter address 172.26.251.60 parents saturn, tpdmzsw1 }* I've successfully used the parents directive to define network topology, so I would think this would work. What might be the risks of defining both? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NDOUtils mysql question?
Okay, perhaps I inverted it then. My build of Cacti stores everything to a mysql DB, though it could admittedly be a myISAM DB with NDOutils being of the InnoDB type. However it works out, I'm pretty confident that I have two database types on one mysql server and I'm trying to figure out how to use a mysql command to list the database names and their types... # mysql -u root -p -e show databases; Enter password: ++ | Database | ++ | information_schema | | cacti | | mysql | | nagdb | | nagios | | test | ++ A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan shadih rahman wrote: I would double check these information. Cacti runs on rrd( round robin database). I have innodb for my ndoutils 1.47b. On Fri, Aug 7, 2009 at 3:03 PM, Andrew Davis ncc...@gmail.com mailto:ncc...@gmail.com wrote: Our Nagios server also has Cacti on it. I'm pretty sure that Cacti uses an InnoDB database, while NDOUtils uses myISAM. The two are backed up differently (example: mysqlhotcopy doesn't working on InnoDB databases). Does anyone know what mysql command you run to determine which type of DB is used for various databases, especially considering you can run multiple types at once? I know I can use mysqlshow or the show databases; options to show the databases themselves, but it doesn't list the DB type. Using mysqlshow with a -t and a DB name shows the table types, but again, not the database types. I'm sort of stuck on this. I want to make sure I know what DB types I'm dealing with so I can ensure I'm backing them up properly. Google searches aren't helping... the results all go back to the mysql versus innodb debate... -- A. Davis Email: ncc...@gmail.com mailto:ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net mailto:Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cordially, Shadhin Rahman -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NDOUtils mysql question?
Really? You sure about that? I'm pretty confident that mysql supports different types of databases (myISAM and InnoDB as two of them) as well as different table types within a database. Your reply did help, though, as when I ran it for the default mysql DB, I was reminded of the phrase engine in the output. Thus, now I see: mysql -u root -p -e show table status mysql Enter password: +---++-++--++-++--+---++-+-+-+-+--++---+ | Name | Engine | Version | Row_format | Rows | Avg_row_length | Data_length | Max_data_length| Index_length | Data_free | Auto_increment | Create_time | Update_time | Check_time | Collation | Checksum | Create_options | Comment | +---++-++--++-++--+---++-+-+-+-+--++---+ | columns_priv | MyISAM | 10 | Fixed |0 | 0 | 0 | 227994731135631359 | 1024 | 0 | NULL | 2009-02-12 10:21:04 | 2009-02-12 10:21:04 | NULL| utf8_bin| NULL || Column privileges mysql show engines; ++-++ | Engine | Support | Comment| ++-++ | MyISAM | DEFAULT | Default engine as of MySQL 3.23 with great performance | | MEMORY | YES | Hash based, stored in memory, useful for temporary tables | | InnoDB | YES | Supports transactions, row-level locking, and foreign keys | | BerkeleyDB | YES | Supports transactions and page-level locking | | BLACKHOLE | NO | /dev/null storage engine (anything you write to it disappears) | | EXAMPLE| NO | Example storage engine | | ARCHIVE| NO | Archive storage engine | | CSV| NO | CSV storage engine | | ndbcluster | NO | Clustered, fault-tolerant, memory-based tables | | FEDERATED | NO | Federated MySQL storage engine | | MRG_MYISAM | YES | Collection of identical MyISAM tables | | ISAM | NO | Obsolete storage engine| ++-++ 12 rows in set (0.00 sec) An engine is another way of saying database type. This is distinct from a table type. Of course, the above only shows me the engines that are already compiled in and available for use, but hopefully I can go forward from here... I'm thinking I can do a show databases;, then pump that into a for x in ... to show the engine type for each DB. Thanks for the assist... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On Aug 7, 2009, at 2:03 PM, Andrew Davis wrote: Our Nagios server also has Cacti on it. I'm pretty sure that Cacti uses an InnoDB database, while NDOUtils uses myISAM. The two are backed up differently (example: mysqlhotcopy doesn't working on InnoDB databases). Does anyone know what mysql command you run to determine which type of DB is used for various databases, especially considering you can run multiple types at once? I know I can use mysqlshow or the show databases; options to show the databases themselves, but it doesn't list the DB type. It's not a database level option. It's table level. Using mysqlshow with a -t and a DB name shows the table types, but again, not the database types. I'm sort of stuck on this. I want to make sure I know what DB types I'm dealing with so I can ensure I'm backing them up properly. Google searches aren't helping... the results all go back to the mysql versus innodb debate... My Google-fu seems to be more powerful. mysql -u Username -p -h database.hostname.com -e show table status databasename. -- Marc -- Let Crystal Reports handle the reporting - Free Crystal
Re: [Nagios-users] NDOUtils mysql question?
My apologies... didn't mean to come across as rude. I figured that others may have tools like Cacti along with NDOutils on the same server and thus might find that they have multiple database types as well... hence my email to the Nagios list instead of the mysql list. In my case, I learned the hard way that mysqlhotcopy doesn't backup InnoDB databases... hence my need to determine the DB type so I knew how to properly backup the various DB's. In any case, your original reply pointed me in the right direction. I just wrote up the script below. It first shows all the databases, then determines the database type for each, making it easy to parse out: #!/bin/bash DATABASES=`mysql -uroot -ppassword -e show databases;|awk -F | '{print $1}'|grep -vE Database|information_schema|test|nagdb` for x in $DATABASES do DATABASE_TYPE=`mysql -uroot -ppassword -e show table status; $x|awk '{print $2}'|uniq|grep -v Engine` echo $x:$DATABASE_TYPE done exit 0 Of course, I've also learned its possible to mix engines within a database, so this isn't exactly foolproof. The output looks something like this: # /root/test.sh cacti:MyISAM mysql:MyISAM nagios:InnoDB Thanks for pointing me in the right direction... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On Aug 7, 2009, at 3:15 PM, Andrew Davis wrote: Really? You sure about that? I'm pretty confident that mysql supports different types of databases (myISAM and InnoDB as two of them) as well as different table types within a database. If you really wanted to be sure about that and not question the answers you receive, shouldn't you be asking on mysql-users instead of the unrelated nagios-users? That would be the forum for definitive answers about mysql database types, don't you agree? -- Marc -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitor Wireless Network Bandwidth Health
Can't speak for Nagios, but Cacti has some custom addons for trending out Cisco AP's and WLC's... search on the Cacti forums. I just rolled this out for 3 controllers, two ACS/WCS servers, roughly 50 AP's... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Dei Bertine wrote: Hi, I was wondering if there's a way to monitor my wireless router via Nagios? I have Cisco 520 Series Wireless LAN Controller with 3 APs attached (AIRLAP521G). Mainly want to monitor the performance of the wireless connections and if possible send alerts if the network is slow. Any advise would be appreciated. Thanks, DB -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Problem getting past main page
I recognize this may sound a little terse and I don't mean it to be so... but as I've seen all the emails from you that are indicative of a first time setup, might I suggest getting a Nagios book to assist you? There's quite a few out there. I personally use Nagios 2nd Edition from Wolfgang Barth and its proven to very valuable... Just a thought... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Kaplan, Andrew H. wrote: This issue is an offshoot of the Problem in preflight check with commands.cfg. I went ahead and copied the nagios directory from our original server to the new one. Once that was done, I did the upgrade to version 3.1.2 of nagios. The main page of the application appears without a problem. However, whenever I try to go to any one of the links from the main page, an error message similar to the one below appears on-screen: *Not Found* *The requested URL /nagios/cgi-bin/status.cgi was not found on this server. * I noticed that after I inserted the snippets from the nagios.conf file into the httpd.conf file, and subsequently restarted the Apache server, the following errors appeared on-screen: *Starting httpd: The ScriptAlias directive in /etc/httpd/conf/httpd.conf at line* *566 will probably never match because it overlaps an earlier ScriptAlias.* *[warn] The Alias directive in /etc/httpd/conf/httpd.conf at line 583 will* *probably never match because it overlaps an earlier Alias.* I have enclosed a copy of the httpd.conf file in its present version. I am at a lossas to why the error message is occurring, and I would really appreciate help on this. httpd.conf The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Problem getting past main page
Ah, now I understand. My apologies. I missed the part where you were upgrading and reading the emails showed errors that are very common to first time installations. That said, all I can say is read the log files. Quite a few issues I've encountered were solved when consulting various log files (httpd logs, /var/log/messages, mysql logs, etc). A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Kaplan, Andrew H. wrote: Hi there -- Thank-you for your reply. I did indeed check out several books prior to posting my messages, and while all of them are thorough with doing a scratch installation, there were none that I read which delved deeply into doing an upgrade. The upgrade scenario is what I am involved with at this time. The purpose of this exercise is to be able to preserve the configuration, and have it available in the newer version of the Nagios application. I am hoping to avoid having to rebuild all checks from the beginning. The amount of checks that I have in place would require a great deal of time that I am hoping I will not have to commit. My previous upgrading experince, from Nagios 1.x to Nagios 2.x was not very difficult. I am hoping to accomplish that here. *From:* Andrew Davis [mailto:ncc...@gmail.com] *Sent:* Thursday, June 25, 2009 9:19 AM *To:* nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] Problem getting past main page I recognize this may sound a little terse and I don't mean it to be so... but as I've seen all the emails from you that are indicative of a first time setup, might I suggest getting a Nagios book to assist you? There's quite a few out there. I personally use Nagios 2nd Edition from Wolfgang Barth and its proven to very valuable... Just a thought... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Kaplan, Andrew H. wrote: This issue is an offshoot of the Problem in preflight check with commands.cfg. I went ahead and copied the nagios directory from our original server to the new one. Once that was done, I did the upgrade to version 3.1.2 of nagios. The main page of the application appears without a problem. However, whenever I try to go to any one of the links from the main page, an error message similar to the one below appears on-screen: *Not Found* *The requested URL /nagios/cgi-bin/status.cgi was not found on this server. * I noticed that after I inserted the snippets from the nagios.conf file into the httpd.conf file, and subsequently restarted the Apache server, the following errors appeared on-screen: *Starting httpd: The ScriptAlias directive in /etc/httpd/conf/httpd.conf at line* *566 will probably never match because it overlaps an earlier ScriptAlias.* *[warn] The Alias directive in /etc/httpd/conf/httpd.conf at line 583 will* *probably never match because it overlaps an earlier Alias.* I have enclosed a copy of the httpd.conf file in its present version. I am at a lossas to why the error message is occurring, and I would really appreciate help on this. httpd.conf The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Center implemntation
Not sure about you, but w/o a screenshot or something I'd be inclined to skip it. :) A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Assaf Flatto wrote: Hello List Has anyone managed to install and use this nagios add on ? http://nagcen.sourceforge.net/ Assaf -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Configuration files obfuscation
One idea that we do here is to chroot the entire nagios dir and all related components (mysql, apache, etc). The parent directory of the chroot is owned by root and set to 700 perm's. It requires a bit of tweaking to get it right and lots of reading through log files to work out the inital bugs, but once that's done, it works. For that matter, the server itself is virtualized... so its chrooted within a virtualized container. Are you concerned with users getting in through the web page and accessing the raw files and then knowing about your inside network or employees gaining physical access to the server? The approach to protect each is different. Or perhaps you're passing username/password combos in some of the cfg files (ie: http testing, etc), so you want to hide that? Easier solution is a nagios user with limited access to use for this purpose. If you can elaborate on your areas of concern, we could probably offer further input. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Mat W wrote: if the idea is for others not to see it... why not just ensure proper file ownership and limited permissions? Assuming only Root and Nagios users can read the file... I'd think anyone that could become those users should be able to read them anyway. -- Mat W. - http://www.techadre.com http://www.techadre.com/ Date: Tue, 16 Jun 2009 11:53:11 -0400 From: ebadd...@gmail.com To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Configuration files obfuscation Hi, is ther a way to obfuscate configuration files? edward Bing™ brings you maps, menus, and reviews organized in one place. Try it now. http://www.bing.com/search?q=restaurantsform=MLOGENpubl=WLHMTAGcrea=TEXT_MLOGEN_Core_tagline_local_1x1 -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Reboot Notification
Same here... we have a few machines that can reboot in less than 2 minutes and with Nagios only checking them every 5 minutes or so, some reboots can slip through the cracks. We have an uptime test that is critical at less than 30 minutes and warn at less than 60 seconds, plus resolution notifications enabled. So if a machine panic reboots and Nagios misses it, withing 5 minutes the uptime test will show uptime of less than 30 minutes and send an alert. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Allan Clark wrote: Hi! On Jun 11, 2009, at 3:14 PM, Kevin Mitnikc teckad...@gmail.com wrote: Hello everybody, I'm new to the boards, so this topic might already be covered. But I'm not exactly sure how to fully search the boards. Issue at hand: I have successfully setup Nagios 3 on Ubuntu 8.10. I have a few test servers being monitored, but what I can't get working properly are when the servers reboot. I'm looking for a way to monitor when a server reboots or has an unexpected shutdown. I've tried the host-is-alive and also the ping, but the server comes up to fast. I've also tried just monitoring a service, such as the NSClient++ but Nagios doesn't notify me right away once the server reboots. How is everybody monitoring server reboots. I've used Whats Up Gold in the past, and it would notify me right away. There has to be a way to monitor reboots and get a notification of that reboot. Many people have suggested I use scripts, but I don't know how to write scripts. In past, we monitored uptime 5 min, there might be a contributed script or plugin. I would look, but have spotty Internet right now. Allan -- http://LinkedIn.com/in/goldfish -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Mailing list mod please remove user...
We're seeing this (below) for this account for just about every email through the list for more than a week. Would a mod or someone remove this address from the mailing list server already? Its quite infuriating... :) A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan wrote: Erro ao enviar o email para nagios-us...@lists. sourceforge. net Nagios Users mailinglist nagios-users@lists.sourceforge.net A caixa postal do destinatario esta cheia. O email foi recusado The mailbox is full. The email was rejected. -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Error while configuring NRPE on solaris
Per my previous email... I had the EXACT same issue and went to check_by_ssh for my Solaris hosts... I'm thinking there must be some magic step we're missing that isn't into the current oh yeah... if you have Solaris then do this knowledge pool... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan N Patil wrote: This problem is still unresolvable. I have tried all possible situations but no use. I can still see error in dmesg when i run /usr/local/nagios/libexec/check_nrpe -H localhost CHECK_NRPE: Error - Could not complete SSL handshake. Error i see when i run dmesg, svc:/network/nrpe/tcp:default (chdir: No such file or directory) Jun 10 10:15:53 unknown inetd[7268]: [ID 702911 daemon.error] Failed to set credentials for the inetd_start method of instance svc:/network/nrpe/tcp:default (chdir: No such file or directory) Jun 10 10:15:59 unknown inetd[7276]: [ID 702911 daemon.error] Failed to set credentials for the inetd_start method of instance svc:/network/nrpe/tcp:default (chdir: No such file or directory) I am using SunOS 5.10 Generic_120012-14 i86pc i386 i86pc Thanks, Nilesh *Luc I. Suryo l...@suryo.com* 05/29/2009 10:07 PM Please respond to Luc I. Suryo l...@suryo.com To Eric Pearce epea...@amberpoint.com cc N Patil n.pa...@lntinfotech.com, Nagios Users Mailinglist nagios-users@lists.sourceforge.net Subject Re: [Nagios-users] Error while configuring NRPE on solaris fyi I have been using nagios and nrpe 9-10 years now; sparc and x86, started back with solaris 7 and now soalris 10, zero error mix solaris, aix, hpux, linux. The server has always been Solaris (sparc or x86), use inetd/xinetd/deamon mode again zero error The one 'problem' i have seen people complain about is ssl and nrpe, read the manual and it should pretty clear what todo, 99.9% is almost the use not having doing some RTFM thingy :) The other one is tcp-wrapper and nrpe, nrpe has a access control buildt-in so I never understood one would need to use tcp-wrapper :) -ls From: N Patil To: Eric Pearce Cc: Nagios Users Mailinglist Sent: Thursday, May 28, 2009 9:13 PM Subject: Re: [Nagios-users] Error while configuring NRPE on solaris Thanks Eric, I have followed the same article but it dint help. This problem is something which occured at the end, i mean while testing connectivity. Thanks, Nilesh May 28 19:15:27 solaris10.remotehost.com inetd[24241]: [ID 702911 daemon.error] Failed to set credentials for the inetd_start method of instance svc:/network/nrpe/tcp:default (chdir: No such file or directory) I'm just guessing, but do you have a home directory for the nagios user (with owner and group set to nagios)? The chdir error might come from this. -e -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null __ __ -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users
Re: [Nagios-users] nagios monitoring for hack
I'd look into the various hardening and monitoring tools available (Bastille, Tripwire, chroot, etc). There's different tools for different purposes, obviously. We chroot all our BIND and Apache stuff. Bastille is great for hardening the environment. Tripwire monitors for changes to key files. Each program has its own logging mechanisms. So once you have your tool in place, you can use Nagios to watch the log file(s) and generate alerts based on keywords (ALERT, WARN, CRIT, etc). You can also dump your logs to an alternate server and have Nagios watch them from there, but in the case of DDoS attack, your bandwidth may be affected for remote syslog and/or Nagios network checks. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan shadih rahman wrote: our web sites got hacked and we were subjected to ddos for last few days. I wanted to know what can I do for monitoring to find out if I am hacked or not. By the way, we were hacked by php exploits. Please advise on this. Thanks -- Cordially, Shadhin Rahman -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] DNS down and false alerts...
I've observed an interesting issue with Nagios. Our environment is a mix of UNIX, Linux, Apple, and Windows. The core of the network is Active Directory including two AD servers that are both our primary, internal DNS servers. All non-Windows systems have a resolv.conf that looks like: *nameserver 10.1.1.13 nameserver 10.1.1.14 domain int.our.domain search int.our.domain* About half of the servers have the nameserver entries inverted (ie: .14 first, .13 second). The issue is that anytime one of the nameservers is rebooted (at least once a month if staying current on patches thanks to Black Tuesdays), whichever hosts have that nameserver listed first in its resolv.conf start throwing the following errors: *CRITICAL - Plugin timed out while executing system call.* This occurs for multiple tests for each host. Obviously, there's a name resolution correlation here. If the nameserver with .13 is rebooted, all hosts (about half of them) that list this IP first in their resolve.conf then timeout for multiple tests. If the .14 server is rebooted, all the other hosts timeout. Interestingly, none of the Windows clients issue errors... only UNIX, Linux, and Mac's... only those with an /etc/resolv.conf. The end result is a host of false positives, but more importantly it looks bad on availability reports and causes phones/pagers to go ballistic with unneeded emails. I'm trying to find a solution and I can't find one that I like: Solution 1) is to cluster the DNS servers. We have lots of clusters here. This isn't good, though, as you don't normally cluster DNS servers... they're meant to be redundant for a reason... one fails and it uses the next one. Solution 2) is to setup a service/host dependency. My thought would be either a host dependency that says if either .13 or .14 are down, then don't alert for any other host that uses them. Or a service to host dependency... if the DNS service is down, then don't alert on any of these dependent hosts. Honestly, I'm not sure if you can mix host and service dependencies like this... plus... if the DNS server is actually down, then the DNS service is down, so better to use a host dependency. The problem is that now we're not alerting on any dependent hosts which themselves could have a legitimate issue we want to know about. Plus, what happens if the DNS server actually dies and take a few hours/days to rebuild/restore? At this point, the dependent hosts aren't watched for a very long time. Solution 3) is to setup a UNIX/Linux DNS server that slaves all zones from the AD servers and have all UNIX/Linux/Apple clients query from this server. This would work except that A) I need two of them to keep redundancy and B) I've now added an extra layer of complication to resolve an application (Nagios)... not exactly good practice. Solution 4) is to set the timeout value of a host querying a DNS server. Perhaps adjust the client to timeout on the first listed nameserver after only 10 seconds, then try the next one? Since most Nagios tests have a minimum timeout value of 30 seconds, if the first DNS query timed out after 10 seconds, it would go to the next one with, hopefully, enough time to respond. The downside is having to adjust every single server. Has anyone else seen this? Anyone else using Windows AD servers to provide DNS for *nix servers? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] DNS down and false alerts...
Hey... I'm the OP. We're using a mix of client tools. For Windows systems (which aren't affected by this) we use nsclient++. For our Linux servers, NRPE... for UNIX (Solaris) and OS X we're using check_by_ssh. Both the NRPE and check_by_ssh clients are affected by this. I'm willing to give the caching nameserver on the server a try, but as others have noted, I don't think it will make a difference as its the local test on the client that's failing to resolv. I surely cannot do a caching nameserver setup on all clients... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Martin Melin wrote: I don't know if I'm misreading the OP, but if the plugins start timing out on only the boxes whose primary DNS is being rebooted, would adding a caching DNS server to the Nagios box really make a difference? I think the root cause to these timeouts is that the Nagios plugin timeout is happening before the connection to the primary DNS on the target machine has a chance to time out and then connect to the secondary DNS. The correct course of action to resolve this would be to either make sure that the DNS connection on the target machines fail quicker, or that Nagios/the plugin waits longer for a result from the check. The DNS failover is working as designed here but you're not giving it enough time to kick in. On Tue, Jun 9, 2009 at 5:37 PM, Russell Adams rlad...@adamsinfoserv.com mailto:rlad...@adamsinfoserv.com wrote: Really the best choice is to using caching DNS on the Nagios server. I'd recommend dnsmasq, it just does caching locally without needing to do big zone transfers. It has low overhead and simple configuration as a result. Enjoy. On Tue, Jun 09, 2009 at 11:19:20AM -0400, Andrew Davis wrote: I've observed an interesting issue with Nagios. Our environment is a mix of UNIX, Linux, Apple, and Windows. The core of the network is Active Directory including two AD servers that are both our primary, internal DNS servers. All non-Windows systems have a resolv.conf that looks like: *nameserver 10.1.1.13 nameserver 10.1.1.14 domain int.our.domain search int.our.domain* About half of the servers have the nameserver entries inverted (ie: .14 first, .13 second). The issue is that anytime one of the nameservers is rebooted (at least once a month if staying current on patches thanks to Black Tuesdays), whichever hosts have that nameserver listed first in its resolv.conf start throwing the following errors: *CRITICAL - Plugin timed out while executing system call.* This occurs for multiple tests for each host. Obviously, there's a name resolution correlation here. If the nameserver with .13 is rebooted, all hosts (about half of them) that list this IP first in their resolve.conf then timeout for multiple tests. If the .14 server is rebooted, all the other hosts timeout. Interestingly, none of the Windows clients issue errors... only UNIX, Linux, and Mac's... only those with an /etc/resolv.conf. The end result is a host of false positives, but more importantly it looks bad on availability reports and causes phones/pagers to go ballistic with unneeded emails. I'm trying to find a solution and I can't find one that I like: Solution 1) is to cluster the DNS servers. We have lots of clusters here. This isn't good, though, as you don't normally cluster DNS servers... they're meant to be redundant for a reason... one fails and it uses the next one. Solution 2) is to setup a service/host dependency. My thought would be either a host dependency that says if either .13 or .14 are down, then don't alert for any other host that uses them. Or a service to host dependency... if the DNS service is down, then don't alert on any of these dependent hosts. Honestly, I'm not sure if you can mix host and service dependencies like this... plus... if the DNS server is actually down, then the DNS service is down, so better to use a host dependency. The problem is that now we're not alerting on any dependent hosts which themselves could have a legitimate issue we want to know about. Plus, what happens if the DNS server actually dies and take a few hours/days to rebuild/restore? At this point, the dependent hosts aren't watched for a very long time. Solution 3) is to setup a UNIX/Linux DNS server that slaves all zones from the AD servers and have all UNIX/Linux/Apple clients query from this server. This would work except that A) I need two of them to keep redundancy and B) I've now added an extra layer of complication
Re: [Nagios-users] nagios -- ndo2db -- centreon
I don't want to come across like a jerk, but wouldn't this better be discussed on the Centreon site? I understand its built on Nagios, but I'm thinking you might get better help and support on the Centreon site itself... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Romain Le Merlus wrote: Hi, On Thu, Jun 4, 2009 at 6:18 PM, James Pifer j...@obrien-pifer.com mailto:j...@obrien-pifer.com wrote: I have two nagios hosts.cfg files: /etc/nagios/hosts.cfg /usr/local/centreon/filesGeneration/nagiosCFG/1/hosts.cfg neither of them contains the host I created... I grepped all of the cfg files and none of them contain it... It seems that you did not generate restart your configuration from the main Centreon server ? Configuration Nagios Nagios Do you select the correct poller on the form ? Do you have any debug/informations when you run it from Centreon ? Best regards. -- Romain LE MERLUS | Directeur des projets rlemer...@merethis.com mailto:rlemer...@merethis.com Tel. +33 (0)1 49 69 97 12 Mob. +33(0)6 85 05 02 82 MERETHIS est Ă©diteur du logiciel Centreon. -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] 3D Status Map Java exception error?
Client is a Mac (OS X 10.5.7) with Firefox 3.x. A VRML viewer *is* installed. Clicking on the 3-D Status Map link launch a Java applet that then says: Unable to launch the application. If I click the Details button, the header of the window says: Error: Could not parse launch file. Error at line 0. In the Launch File field, I see: #VRML V2.0 utf8 WorldInfo{ title Nagios 3-D Network Status View info [Copyright (c) 1999-2002 Ethan Galstad egals...@nagios.org] } Background{ skyColor 0.1 0.1 0.15 ~ I truncated the rest ~ The Exception field shows: JNLParseException[ Could not parse launch file. Error at line 0.] at com.sun.javaws.jnl.XMLFormat.parse(XMLFormat.java:58) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:52) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:64) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:143) at com.sun.javaws.Main.launchApp(Main.java:253) at com.sun.javaws.Main.continueInSecureThread(Main.java:213) at com.sun.javaws.Main$1.run(Main.java:108) at java.lang.Thread.run(Thread.java:637) And the Wrapped Exception field shows: Failed to find the '' charater that marks the end of a CDATA element. Exception parsing xml at line 0 at com.sun.deploy.xml.XMLParser.skipPCData(XMLParser.java:182) at com.sun.deploy.xml.XMLParser.nextToken(XMLParser.java:100) at com.sun.deploy.xml.XMLParser.parse(XMLParser.java:58) at com.sun.javaws.jnl.XMLFormat.parse(XMLFormat.java:56) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:52) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:64) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:143) at com.sun.javaws.Main.launchApp(Main.java:253) at com.sun.javaws.Main.continueInSecureThread(Main.java:213) at com.sun.javaws.Main$1.run(Main.java:108) at java.lang.Thread.run(Thread.java:637) Any ideas? A. Davis -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] 3D Status Map Java exception error?
Sorry, wasn't trying to be coy. Didn't realize it was relevant. And right now I'm not sure that VRML 2.0 is supported. I'll check that and try another VRML viewer. Curious if anyone else is using Firefox 3.x on Mac OS X with a VRML viewer that works and which one you're using??? A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On Jun 3, 2009, at 9:47 AM, Andrew Davis wrote: Client is a Mac (OS X 10.5.7) with Firefox 3.x. A VRML viewer *is* installed. Why so coy about which one you've installed? That's an important bit of information, no? At the very least you can confirm that someone else has successfully used it. Clicking on the 3-D Status Map link launch a Java applet that then says: Unable to launch the application. If I click the Details button, the header of the window says: Error: Could not parse launch file. Error at line 0. Sounds like a compatibility issue with the supposed VRML viewer you've chosen. Does it speak VRML V2.0? Have you tried a different viewer? -- Marc -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios Avaya
We have recently had a new Avaya VOIP system installed, including eXtreme switches. The NagiosExchange site has at least one SNMP addon for the eXtreme switches... however searching for avaya, lucent, definity, and audix all reveal zero hits. I understand that Avaya has their s/w that is somewhat in competition with Nagios, but I can't imagine that no one is polling the Avaya equipment outside of Avaya's own s/w. Is anyone out there querying new'ish Avaya VOIP phone switches, add-on cards, and/or Modular Messaging environments? I'm interested in info (other URL's if needed), existing scripts, howto's, etc... pretty much anything as I'm seeing nothing right now. Thanks, -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Avaya
I agree completely and will do that if necessary, but where I possible, I prefer not to re-invent the wheel. Seeing nothing on NagiosExchange or the first page of a Google search, I thought I'd poll the list. I was actually really surprised to see absolutely nothing from Google searches and my conspiracy theory mind thought there might be a reason why (Avaya claiming IP on their MIBs/OIDs and take down notices for using them in Nagios add-ons?), but I digress... If I get nothing, I guess I'll just start working on my own... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Onotsky, Steve x55328 wrote: Apologies for the blank reply, everyone (damned Ctrl-Enter). Andrew, does snmpwalk against the switches give back any useful information? Rather than searching for something that may not exist, by way of canned solutions, you may get off the ground faster to just query the OID(s) you want. *From:* Andrew Davis [mailto:ncc...@gmail.com] *Sent:* June 3, 2009 12:48 *To:* nagios-users@lists.sourceforge.net *Subject:* [Nagios-users] Nagios Avaya We have recently had a new Avaya VOIP system installed, including eXtreme switches. The NagiosExchange site has at least one SNMP addon for the eXtreme switches... however searching for avaya, lucent, definity, and audix all reveal zero hits. I understand that Avaya has their s/w that is somewhat in competition with Nagios, but I can't imagine that no one is polling the Avaya equipment outside of Avaya's own s/w. Is anyone out there querying new'ish Avaya VOIP phone switches, add-on cards, and/or Modular Messaging environments? I'm interested in info (other URL's if needed), existing scripts, howto's, etc... pretty much anything as I'm seeing nothing right now. This message and any attachments are intended only for the use of the addressee and may contain information that is privileged and confidential. If the reader of the message is not the intended recipient or an authorized representative of the intended recipient, you are hereby notified that any dissemination of this communication is strictly prohibited. If you have received this communication in error, please notify us immediately by e-mail and delete the message and any attachments from your system. -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Stop Nagios before applying up2date?
Its not required, but after a reboot or reload of certain libraries, you could possibly see issues. In general, you should be fine. However our Nagios server is also our Cacti server and Cacti builds against the exact versions of snmp, so when this is upgraded as part of up2date or yum, then we have to recompile Cacti. Depending on what or what else you're doing, you could see similar issues... especially with mySQL, etc. Its sort of a crapshoot... but once you do it the first time, you'll know the answer for all subsequent times. :) A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Karen Tsai wrote: Hi, Should we keep Nagios down while applying Linux up2date on the server where Nagios is installed? Thanks, Karen -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] 3D Status Map Java exception
Client is a Mac (OS X 10.5.7) with Firefox 3.x. A VRML viewer *is* installed. Clicking on the 3-D Status Map link launch a Java applet that then says: Unable to launch the application. If I click the Details button, the header of the window says: Error: Could not parse launch file. Error at line 0. In the Launch File field, I see: #VRML V2.0 utf8 WorldInfo{ title Nagios 3-D Network Status View info [Copyright (c) 1999-2002 Ethan Galstad egals...@nagios.org] } Background{ skyColor 0.1 0.1 0.15 ~ I truncated the rest ~ The Exception field shows: JNLParseException[ Could not parse launch file. Error at line 0.] at com.sun.javaws.jnl.XMLFormat.parse(XMLFormat.java:58) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:52) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:64) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:143) at com.sun.javaws.Main.launchApp(Main.java:253) at com.sun.javaws.Main.continueInSecureThread(Main.java:213) at com.sun.javaws.Main$1.run(Main.java:108) at java.lang.Thread.run(Thread.java:637) And the Wrapped Exception field shows: Failed to find the '' charater that marks the end of a CDATA element. Exception parsing xml at line 0 at com.sun.deploy.xml.XMLParser.skipPCData(XMLParser.java:182) at com.sun.deploy.xml.XMLParser.nextToken(XMLParser.java:100) at com.sun.deploy.xml.XMLParser.parse(XMLParser.java:58) at com.sun.javaws.jnl.XMLFormat.parse(XMLFormat.java:56) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:52) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:64) at com.sun.javaws.jnl.LaunchDescFactory.buildDescriptor(LaunchDescFactory.java:143) at com.sun.javaws.Main.launchApp(Main.java:253) at com.sun.javaws.Main.continueInSecureThread(Main.java:213) at com.sun.javaws.Main$1.run(Main.java:108) at java.lang.Thread.run(Thread.java:637) Any ideas? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Host dependencies not being picked up...
The bulk of my systems are behind one of three switches. I just went through my hosts.cfg file and added the parents line and appropriate switch name for each, but the host dependencies aren't showing up. I do use a host template for most of the servers and then other specifics. Below is a sample of what's defined. Initially I thought the issue was the Nagios server being in the same switch as one of the switches, but the last entry below shows a switch defined as a parent and the Nagios server is not behind the switch (Nagios is behind flscrb234srvsw5, the last host is behind flscrb234srvsw6). The VLAN is the same for all the hosts in the three switches... all hosts operate on 10.1.1.x/16 or 10.1.80.x/16 which is all a single VLAN. I'd appreciate any thoughts on why it might not be working: # -- Network Switches define host { nameNetwork-Switches register0 hostgroups network-switches check_command check-host-alive icon_image cisco.jpg icon_image_alt Cisco Device max_check_attempts 3 check_period24x7 contact_groups networkadmins notification_interval 120 notification_period 24x7 notification_optionsd, u, r, f, s } define host { host_name flscrb234srvsw5 use Network-Switches alias 4948-5 - B234 address 10.100.0.45 } define host { host_name flscrb234srvsw6 use Network-Switches alias 4948-6 - B234 address 10.100.0.46 } ... # Linux Hosts # # -- Linux host define host { nameLinux-Servers register0 hostgroups linux-servers check_command check-host-alive icon_image tux.png icon_image_alt Linux Server max_check_attempts 3 check_period24x7 contact_groups unixadmins notification_interval 120 notification_period 24x7 notification_optionsd, u, r, f, s } define host { host_name aten use Linux-Servers alias Aten (SLES) address 10.1.80.3 parents flscrb234srvsw5 } ... define host { host_name delos use Linux-Servers alias Delos (CentOS) address 10.1.1.93 parents flscrb234srvsw6 } Reading configuration data... Running pre-flight check on configuration data... Checking services... Checked 673 services. Checking hosts... Checked 168 hosts. Checking host groups... Checked 23 host groups. Checking service groups... Checked 36 service groups. Checking contacts... Checked 9 contacts. Checking contact groups... Checked 12 contact groups. Checking service escalations... Checked 0 service escalations. Checking service dependencies... Checked 30 service dependencies. Checking host escalations... Checked 0 host escalations. *Checking host dependencies... Checked 0 host dependencies.* Checking commands... Checked 58 commands. Checking time periods... Checked 5 time periods. Checking for circular paths between hosts... Checking for circular host and service dependencies... Checking global event handlers... Checking obsessive compulsive processor commands... Checking misc settings... Total Warnings: 0 Total Errors: 0 Things look okay - No serious problems were detected during the pre-flight check -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] newbie question...
Marc Powell wrote: On Jun 1, 2009, at 3:09 AM, Arnar Þórarinsson wrote: So there is no way of getting just the host down alert when a host goes down ? To explain a little, lets say I'm monitoring CPU, memory and disk space on a host. The host goes down and Nagios sends an alert by email for the host down event and also for the CPU, memory and disk space events. All I need to know about this event is that the host is down. Just think that it's not neccessary to send an alert email about services on a host that is down. And so does nagios. As I said earlier, nagios does this automatically. To restate - when a host is down, nagios suppresses all e-mail notifications about that hosts services, but will still display them as down in the GUI. It will only send the host down notification. The first section of http://nagios.sourceforge.net/docs/2_0/networkreachability.html states it best. It still applies to 3.x but I haven't found the section that states it as clearly. -- Marc If I'm interpreting your question correctly, you're saying that when one of your servers actually goes down, you ARE getting alerts (email/SMS/whatever) for more than just the host being down??? I see what Marc's saying... he's telling you this shouldn't be. Nagios was built to check first that the host is up and reachable, and if its not to notify you the host is down, but to not ALERT you about all host-dependent tests that are now failing. Nagios will still try all tests and fail on them and the web interface will reflect more than just the HOST DOWN, but the only email/SMS you get should be for the HOST DOWN. However, you may need to clarify what you mean by *down*. *Down* does not always mean off or 100% non-responsive. In the case of *nix systems I've seen quite a few times where a server will hang, fail, or segfault but still be reachable over the network. The reason is that parts of the OS are in memory and things like pings from remote hosts still respond, even though the overall functionality of the host itself is down (ISP's get this a lot: host pings, but you can't ssh in, for example). If Nagios can ping the host, it will then try the other tests and alert on them. Here's a quick way to narrow this down: turn off the server (shut down and pull power). The Nagios web interface should show the host down and all tests as failing, but the only email/SMS you should get is the host down. If you still get emailed/alerted then you might have a configuration error. Perhaps you didn't properly define your host checks as opposed to service checks? Do you have a check_ping or check_icmp host check for each host? AD -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Announcement: Brooklyn for Nagios now available for iPhone and iPod Touch!
You mention VPN support, but what about SSL support for exposed, but secured Nagios sites? Can it handle self-signed certs or only those from a valid, public CA? Also, personally I don't mind the $5 charge, but you might consider giving it away for free for 30 days or so, getting patches and fixes and such, then charging for it after the first release of bugfixes. You might quickly find that just cause it works on your Nagios site that doesn't mean it will work on all and some code changes might be needed. Just my $0.02 worth... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan D. Emmanuel Feinsmith wrote: Dear Nagios Community, I am pleased to announce the first and only fully native iPhone and iPod Touch app for Nagios, Brooklyn for Nagios, is now on the iPhone store and can be downloaded at: http://itunes.apple.com/WebObjects/MZStore.woa/wa/viewSoftware?id=317040435mt=8 This app is different from iNag and other similar iPhone applications that use Nagios in that it does not require any changes whatsoever to your Nagios server. It is a complete Nagios standalone native client. Just download it to your iPhone or iPod Touch, set up the settings to point to your nagios server, and you're done. If your server is accessible through VPN, first set up your VPN connection on your iPhone or iPod Touch. Enjoy! And please send any feedback or requests to dan...@feinsmithsoftware.com Coming soon: Support for other platforms, including Groundwork Open Source. Best Regards, Daniel. On May 30, 2009, at 5:05 PM, Marc Powell wrote: On May 30, 2009, at 11:43 AM, Arnar Þórarinsson wrote: Hi all, How can I make it so that if a host goes down Nagios will only report the host down and not all the services on that host ? All I want to get is a host down alert. Yes I know RTFM... but I have and I can't seem to find this in the docs. Nagios will only send an e-mail for the single Host down event but will always display both the host and services as down (because they are). This is not configurable behavior and I can't think of any creative way to change the status of the services in that event that isn't a pain... -- Marc -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Host dependencies not being picked up...
Double-checking the docs I think I just realized this... If I understand this right, the parents option is used for Nagios itself (prehaps better re-worded as internally) to define the network topology for the core host check. If the switch is down, it won't mark a host as Down, only as Unreachable. For a host dependency, I have to define something like this: define hostdependency{ host_nameHost A dependent_host_nameHost C notification_failure_criteriad } define hostdependency{ host_nameHost B dependent_host_nameHost C notification_failure_criteriad,u } Am I understanding this right? What gets me is that what Nagios seems to call a parent is, to me anyway, a network/host dependency. Hosts are dependent on the network and services are dependent on hosts. I guess I'm missing where a host dependency would actually be of any value... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On Jun 1, 2009, at 1:46 PM, Andrew Davis wrote: The bulk of my systems are behind one of three switches. I just went through my hosts.cfg file and added the parents line and appropriate switch name for each, but the host dependencies aren't showing up. Parents and host dependencies are not the same thing. parents is probably what you want and have configured correctly based on the information you've provided. Nagios doesn't count them for the summary. -- Marc -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Vautour Style on Nagios 3.1.0
FWIW: it says specifically that its for Nagios 3.0.x, not 3.1.x We can both hope and assume it would work, but until the dev or another makes it work, there's no guarantees. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Derek J. Morris wrote: I am having an issue installing this neat style to the UI of Nagios 3.1.0. I wont render correctly when I install it, the sidebar is just text with white backgroud and there is no top bar either. Anybody come across this? -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Error while configuring NRPE on solaris
FYI: I had lots of issues getting NRPE working on Solaris x86 and Sparc and with OpenSolaris. I had lots of compiling a library errors, but after I got past those I continued to have issues getting NRPE to start from inetd/xinetd. I eventually just used check_by_ssh... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan N Patil wrote: Hi everyone, I have successfully installed configure NRPE on Solaris 10 but when I run, /usr/local/nagios/libexec/check_nrpe -H localhost CHECK_NRPE: Error - Could not complete SSL handshake. I have compiled with, ./configure --with-ssl-lib=/usr/sfw/lib --with-ssl-inc=/usr/sfw/include --with-ssl=/usr/sfw I can also see following error on console as well as in /var/adm/messages logs, May 28 19:08:26 solaris10.remotehost.com inetd[24233]: [ID 702911 daemon.error] Failed to set credentials for the inetd_start method of instance svc:/network/nrpe/tcp:default (chdir: No such file or directory) May 28 19:15:27 solaris10.remotehost.com inetd[24241]: [ID 702911 daemon.error] Failed to set credentials for the inetd_start method of instance svc:/network/nrpe/tcp:default (chdir: No such file or directory) I have already tried few solutions after googling but no use. Please help me to resolve this problem. Thanks, Nilesh * Larsen Toubro Infotech Ltd.*_ __www.Lntinfotech.com_ http://www.lntinfotech.com/ This Document is classified as: LT Infotech Proprietary LT Infotech Confidential LT Infotech Internal Use Only LT Infotech General Business This Email may contain confidential or privileged information for the intended recipient (s) If you are not the intended recipient, please do not use or disseminate the information, notify the sender and delete it from your system. __ -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring switches via SNMP --
I would agree... get all the relevant devel and necessary library packages. For some plugins, you can straight into the plugins directory and compile that one plugin (once all the pre-reqs are there), then copy the binary to your libexec folder. I had to do this with a client with mysql. Initially it didn't have mysql, so the test wasn't compiled. I installed mysql including the dev pkgs and libs, then dropped into the ~nagios-plugins/plugins folder and did a make check_mysql.c, then copied the check_mysql binary to my libexec folder and all was fine... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Paul Weaver wrote: checking for init_snmp in -lnetsnmp... no configure: error: snmp library not found I looked through the Ubuntu package website, and tried installing any package I could find that's related to SNMP, but already have them all installed. It's probably a -dev package, which you need to compile, but not to run, the plugin. Try installing libsnmp-dev snmp is already the newest version. libsnmp-base is already the newest version. libsnmp15 is already the newest version. snmpd is already the newest version. libsnmp-perl is already the newest version. Is there an inherent incompatibility between Ubuntu 8.10 and the check_snmp plugin? Do I need to go the all Perl/CPAN route? I can perform snmpwalks, but I don't know of any other way to find out what I'm missing. No, I personally prefer using perl to write my own plugins though You rarely need to use cpan with ubuntu -- The probability of someone watching you is proportional to the stupidity of your action. Paul Weaver Systems Development Engineer News Production Facilities, BBC News http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp asthey present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://www.creativitycat.com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp asthey present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://www.creativitycat.com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Unable to open MRTG log file
Have you checked ownership of the mrtg log file to ensure your nagios user has at least read-only access? As your nagios user, try to cat the file and see if you get a permission denied message. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Dei Bertine wrote: Hi folks, I've been getting these messages from my nagios page under mrtg: check_mrtgtraf: Unable to open MRTG log file in which Im unable to figure it out why? here's my config file: # Monitor Port 1 Bandwidth Usage define service{ use mrtg-service host_name ciscoasa-firewall service_description SDFWPORT1_BAND_USE check_command check_local_mrtgtraf!/var/www/mrtg/192.168.1.253_1.log!AVG!100,100!500,500!10 } When I launch the command manually, same error. Here's my command looks like: # 'check_local_mrtgtraf' command definition define command{ command_namecheck_local_mrtgtraf command_line$USER1$/check_mrtgtraf -F $ARG1$ -a $ARG2$ -w $ARG3$ -c $ARG4$ -e $ARG5$ } Please someone advise. Thank you! DB * * -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Monitoring clustered resources with Windows...
One of our admins is actively migrating us from Server 2003 to Server 2008 and using the built-in clustering capabilities of '08 to enable service-level failover. So far, he's done so with Exchange, print server services, and SQL. I'm wondering how to best monitor shares resources on Windows hosts from Nagios. At present, we use nsclient++ to watch the physical servers. This is good for basic checks of load average, memory, local disk consumption, etc. I can even monitor services that are running. No, I know I can monitor anything that's accessible from an IP and port, but I'm somwhat stumped on other resources... For example, we have two physical Exchange servers. They're in a cluster and the various Exchange services are only active on one node at a time. I can watch OWA as its accessible from an IP and port, but the Exchange services themselves will stop on one server and start on the other if a server fails. Nagios can't dynamically adjust to watch this service on the new node. It will only yell that's its down on the failed node. Clustered file storage is another example. Again, I can watch the local CPU, memory, and local C: drive, etc. But let's say its sharing a large volume as drive F:. I can watch this fine on the primary node, but if it fails over, its no longer accessible from that node as its being shared on the new active node. I'm curious if any Nagios users are using clustered resources on the Windows side and how you handle service failover of services that aren't necessarily accessible by IP and port... -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring clustered resources with Windows...
Thanks. Unfortunately I doubt our Windows Admin is going to install the HP agents on all the systems. I wonder if the MIB's for the cluster service in Server 2008 are public and queryable... off to dig around. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan James Pratt wrote: Apologies - that site is really acting strange lately. I used to use this one, but it requires HP hardware, and since we are mostly a vmware shop, it's no good to me anymore - http://www.monitoringexchange.org/cgi-bin/page.cgi?g=Detailed%2F1452.htm l;d=1 (let me know if that link is broken as well - if so, you can try to google for check_mscs_hpma instead) Sorry! James -Original Message- From: Andrew Davis [mailto:ncc...@gmail.com] Sent: Wednesday, May 20, 2009 2:01 PM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Monitoring clustered resources with Windows... Were you trying to link to a specific project/plugin ID cause it just took me to the main nagiosexchange page which I've already searched and its coming up dry for add-ons that would address my question... maybe your URL was bad? A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan James Pratt wrote: Seems there is a new resource for this since The Fork... http://www.monitoringexchange.org/cgi-bin/page.cgi?d=1 hth, regards Jamie -Original Message- From: Andrew Davis [mailto:ncc...@gmail.com] Sent: Wednesday, May 20, 2009 1:19 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Monitoring clustered resources with Windows... One of our admins is actively migrating us from Server 2003 to Server 2008 and using the built-in clustering capabilities of '08 to enable service-level failover. So far, he's done so with Exchange, print server services, and SQL. I'm wondering how to best monitor shares resources on Windows hosts from Nagios. At present, we use nsclient++ to watch the physical servers. This is good for basic checks of load average, memory, local disk consumption, etc. I can even monitor services that are running. No, I know I can monitor anything that's accessible from an IP and port, but I'm somwhat stumped on other resources... For example, we have two physical Exchange servers. They're in a cluster and the various Exchange services are only active on one node at a time. I can watch OWA as its accessible from an IP and port, but the Exchange services themselves will stop on one server and start on the other if a server fails. Nagios can't dynamically adjust to watch this service on the new node. It will only yell that's its down on the failed node. Clustered file storage is another example. Again, I can watch the local CPU, memory, and local C: drive, etc. But let's say its sharing a large volume as drive F:. I can watch this fine on the primary node, but if it fails over, its no longer accessible from that node as its being shared on the new active node. I'm curious if any Nagios users are using clustered resources on the Windows side and how you handle service failover of services that aren't necessarily accessible by IP and port... -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring clustered resources with Windows...
Except that we're a 100% Sun and Dell shop (mostly Sun, phasing out Dell... though I guess I'll be calling Sun as Oracle soon enough). But, hence the reticence to install the HP agents... Thanks for the link all the same. It offers hope that if I can get into the native Server 2008 SNMP items for the cluster resources, then there may be a solution. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan James Pratt wrote: Understood It would be worth a shot to ask them anyhow though - IMO, as the HP hardware guy here, it's a *lot* easier to troubleshoot hardware issues with the agents/management pages , even if you don't use insight manager/SIM , no matter what OS is on the HP servers - but, this is now off-topic, apologies! :( -Original Message- From: Andrew Davis [mailto:ncc...@gmail.com] Sent: Wednesday, May 20, 2009 3:34 PM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Monitoring clustered resources with Windows... Thanks. Unfortunately I doubt our Windows Admin is going to install the HP agents on all the systems. I wonder if the MIB's for the cluster service in Server 2008 are public and queryable... off to dig around. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan James Pratt wrote: Apologies - that site is really acting strange lately. I used to use this one, but it requires HP hardware, and since we are mostly a vmware shop, it's no good to me anymore - http://www.monitoringexchange.org/cgi-bin/page.cgi?g=Detailed%2F1452.htm l;d=1 (let me know if that link is broken as well - if so, you can try to google for check_mscs_hpma instead) Sorry! James -Original Message- From: Andrew Davis [mailto:ncc...@gmail.com] Sent: Wednesday, May 20, 2009 2:01 PM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Monitoring clustered resources with Windows... Were you trying to link to a specific project/plugin ID cause it just took me to the main nagiosexchange page which I've already searched and its coming up dry for add-ons that would address my question... maybe your URL was bad? A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan James Pratt wrote: Seems there is a new resource for this since The Fork... http://www.monitoringexchange.org/cgi-bin/page.cgi?d=1 hth, regards Jamie -Original Message- From: Andrew Davis [mailto:ncc...@gmail.com] Sent: Wednesday, May 20, 2009 1:19 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Monitoring clustered resources with Windows... One of our admins is actively migrating us from Server 2003 to Server 2008 and using the built-in clustering capabilities of '08 to enable service-level failover. So far, he's done so with Exchange, print server services, and SQL. I'm wondering how to best monitor shares resources on Windows hosts from Nagios. At present, we use nsclient++ to watch the physical servers. This is good for basic checks of load average, memory, local disk consumption, etc. I can even monitor services that are running. No, I know I can monitor anything that's accessible from an IP and port, but I'm somwhat stumped on other resources... For example, we have two physical Exchange servers. They're in a cluster and the various Exchange services are only active on one node at a time. I can watch OWA as its accessible from an IP and port, but the Exchange services themselves will stop on one server and start on the other if a server fails. Nagios can't dynamically adjust to watch this service on the new node. It will only yell that's its down on the failed node. Clustered file storage is another example. Again, I can watch the local CPU, memory, and local C: drive, etc. But let's say its sharing a large volume as drive F:. I can watch this fine
Re: [Nagios-users] Unable to open MRTG log file
On most systems, mrtg will run as its own user and nagios the same. Make sure that the nagios user has permissions to access the mrtg owned files... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Dei Bertine wrote: Hi folks, I've been getting these messages from my nagios page under mrtg: check_mrtgtraf: Unable to open MRTG log file in which Im unable to figure it out why? here's my config file: # Monitor Port 1 Bandwidth Usage define service{ use mrtg-service host_name ciscoasa-firewall service_description SDFWPORT1_BAND_USE check_command check_local_mrtgtraf!/var/www/mrtg/192.168.1.253_1.log!AVG!100,100!500,500!10 } When I launch the command manually, same error. Here's my command looks like: # 'check_local_mrtgtraf' command definition define command{ command_namecheck_local_mrtgtraf command_line$USER1$/check_mrtgtraf -F $ARG1$ -a $ARG2$ -w $ARG3$ -c $ARG4$ -e $ARG5$ } Please someone advise. Thank you! DB * * -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Both hostgroup_name and host_name in services.cfg
Well, yes and no. The checks of the config files didn't show any errors, but that doesn't mean the developers accounted for the situation and that I won't see any anomalies as a result. I was hoping that another user might've had some experience with it... something like hey I tried this and it broke that or no problems here, go for it. That's what lists like this are for. I already RTFM'd and its not mentioned. Hence why I asked the list. And as for it being faster to try than to wait... not sure about you, but my Nagios server is considered a production system. Thus, for me it pays to wait for a response and play it safe than to rush ahead and potentially break something... few people in IT ever muck things up by waiting for good input, but quite a few have screwed stuff up by jumping ahead without enough information. FWIW: I have gone ahead and tried it and it does appear to work, though I'll have to wait to see if any issues result. Thanks for taking the time to write an answer that didn't provide any input on the actual question I posed. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Andreas Ericsson wrote: Andrew Davis wrote: If I want to use the same test(s) for a hostgroup and an individual host, am I able to define both in the same entry? This falls into the try-and-find-out category of questions. It would have been faster for you to do that than it was sending an email and awaiting the response. -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Both hostgroup_name and host_name in services.cfg
That's a good idea. Thanks. I hadn't really thought of defining a group, then excluding certain hosts. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On May 18, 2009, at 9:51 AM, Andrew Davis wrote: Well, yes and no. The checks of the config files didn't show any errors, but that doesn't mean the developers accounted for the situation and that I won't see any anomalies as a result. You'd be able to see the services assigned to the hosts so that's a strong indicator it's good. In any event -- http://nagios.sourceforge.net/docs/3_0/objecttricks.html#service - Fourth example. -- Marc -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] OID's missing for check_snmp_cpfw.pl
I'm trying to make use of check_snmp_cpfw.pl against a pair of redundant CP firewalls. For any test I try, I see: cannot find oids / CPFW Status : CRITICAL I've verified all the perl stuff including Net::SNMP and Getopt::Long ./check_snmp_cpfw.pl -H 10.1.1.250 -C obfuscated -swm FW : cannot find oids / SVN : cannot find oids / MGMT : cannot find oids / CPFW Status : CRITICAL it appears from the website that the script was last updated on 2007. We're running IP250's with the latest Firewall-1 s/w. Perhaps the script is too old for our h/w or s/w? I'm thinking that maybe the script only works on FW1 on local UNIX servers? But since its going over SNMP over the network, I'm sort of doubting this. Has anyone else experienced this? Any solution? Is anyone else using the latest, greatest FW-1 s/w on an IP250 and this script works for them? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Odd check_http error
You were correct. My check command showed: # 'check_http' command definition define command{ command_namecheck_http command_line$USER1$/check_http -I $HOSTADDRESS$ $ARG1$ } But my test was passing two arguments (-p and -u). Once I updated the test to use $HOSTADDRESS$ $ARG1$ $ARG2$ all worked immediately. Thanks for the help. :) A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Jon Angliss wrote: On Fri, 15 May 2009 16:17:50 -0400, Andrew Davis ncc...@gmail.com wrote: Oddly enough, low-level tests show an HTTP 302, which I expect. But Nagios is offering up a 400 error and a warning: From my workstation, confirming the server is configured correctly: gentoo:~ adavis$ telnet seth 8080 Trying 10.1.1.23... Connected to seth.fl.ad.scripps.edu. Escape character is '^]'. GET /gp HTTP/1.0 Host: seth Accept: */* Connection: Keep-Alive Pragma: no-cache Run the check_http as defined in the command definition, appending the -v argument. You'll get to see what the check_http command is executing. I suspect your command definition is incorrect, and you're missing something that is supposed to be triggering the host header. [.. snip ..] From services.cfg: ## HTTP - alternate port define service { host_name seth service_description HTTP check_command check_http!-p 8080!-u /gp/pages/login.jsf notes http://$HOSTADDRESS$:8080/gp/pages/login.jsf max_check_attempts 3 normal_check_interval 15 retry_check_interval1 check_period24x7 notification_interval 120 notification_period 24x7 notification_optionsw, u, c, r, f, s contact_groups unixadmins action_url /nagios/pnp/index.php?host=$HOSTNAME$srv=$SERVICEDESC$ } Where is your define command? And the result in Nagios: HTTP WARNING: HTTP/1.1 400 No Host matches server name seth Hrm... this is odd, looks like you are posting the right host entry (would still like to see the command definition). The host entry for seth, is the address defined the same as the IP you are getting when you telnet to the name? ie: define host { host_name seth address10.1.1.23 } -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Can't locate Net/SNMP.pm in @INC When Running check_ifstatus
As root, try perl -MCPAN -e shell then install Net::SNMP. You may want to first do install Bundle::CPAN, then install Net::SNMP. Also open up the test script (assuming its not a compiled binary) and check for all the use lines and make sure they're installed. For example, from another script I saw: use strict; use Net::SNMP; use Getopt::Long; So my steps were: sudo su - perl -MCPAN -e shell install Bundle::CPAN reload cpan reload index install Net::SNMP install Getopt::Long For each, I installed prereq's where prompted... Also, depending on your OS, you may want to update the OS first before doing the above. For example, with CentOS, an yum -y update is good practice every now and then... keeps you nice and current. But keep in mind that anything built against a specific package may fail (last time I did this, Cacti choked as it was built against a specific version of a library and needed to be recompiled). Other common perl modules to consider include DBI, DBD::mysql, etc... Oh, and put the Net::SNMP line that you changed back... unfortunately, the output of your RPM query is showing the SNMP programs themselves. The script you're using is looking for the Perl module Net::SNMP. net-snmp and Net::SNMP are two completely different things, yet often times codependent. Hope this helps... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Martin Fontanez wrote: From: Martin Fontanez jdmfo...@yahoo.com Subject: Can't locate Net/SNMP.pm in @INC When Running check_ifstatus To: Nagios-users@lists.sourceforge.net Date: Monday, May 18, 2009, 3:46 PM I am getting this error and not certain what gives. Tried various solutions including creating a link to ../net dir but still get errors. I did yum installs of relevant *snmp* packages. Is there a standard fix for this error? 1 - [r...@localhost libexec]# ./check_ifstatus -H localhost Can't locate Net/SNMP.pm in @INC (@INC contains: /usr/local/nagios/libexec /usr/lib/perl5/site_perl/5.8.8/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.7/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.6/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/site_perl/5.8.8 /usr/lib/perl5/site_perl/5.8.7 /usr/lib/perl5/site_perl/5.8.6 /usr/lib/perl5/site_perl/5.8.5 /usr/lib/perl5/site_perl /usr/lib/perl5/vendor_perl/5.8.8/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.7/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.6/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi /usr/lib/perl5/vendor_perl/5.8.8 /usr/lib/perl5/vendor_perl/5.8.7 /usr/lib/perl5/vendor_perl/5.8.6 /usr/lib/perl5/vendor_perl/5.8.5 /usr/lib/perl5/vendor_perl /usr/lib/perl5/5.8.8/i386-linux-thread-multi /usr/lib/perl5/5.8.8 .) at ./check_ifstatus line 39. BEGIN failed--compilation aborted at ./check_ifstatus line 39. 2 - SNMP.pm is (not on @INC path, but created a link to ...Net/) on the following location: [r...@localhost libexec]# find / -name SNMP.pm /usr/lib/perl5/vendor_perl/5.8.8/i386-linux-thread-multi/SNMP.pm [r...@localhost libexec]# 3 - Looks like I got relevant rpm installed: [r...@localhost libexec]# rpm -qa | grep -i snmp net-snmp-libs-5.3.2.2-5.el5 net-snmp-perl-5.3.2.2-5.el5 net-snmp-5.3.2.2-5.el5 [r...@localhost libexec]# 4. Edited the file ../check_ifconfig and replaced use Net::SNMP; with use SNMP; but that created other errors. -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net
[Nagios-users] Odd check_http error
I'm seeing the following: HTTP WARNING: HTTP/1.1 400 No Host matches server name seth.my.int.domain This is what the test looks like in Nagios: ## HTTP - alternate port define service { host_name seth service_description HTTP check_command check_http!-p 8080!-u /gp/pages/login.jsf notes http://$HOSTADDRESS$:8080/gp/pages/login.jsf max_check_attempts 3 normal_check_interval 15 retry_check_interval1 check_period24x7 notification_interval 120 notification_period 24x7 notification_optionsw, u, c, r, f, s contact_groups unixadmins action_url /nagios/pnp/index.php?host=$HOSTNAME$srv=$SERVICEDESC$ } Oddly enough, all other tests for this host seem to have no issue at all with the hostname. The URL also works in a web browser (http://seth.my.int.domain:8080/gp/pages/login.jsf works correctly). From the Nagios server, I go hit the URL from lynx and curl w/o issue. There's proper name resolution (forward and reverse) in place. I'm lost... -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Poller errors
The box does indeed have some load on it... hence my other performance tuning request to the list. At present, its running Nagios, mySQL, and Cacti with Cacti polling approx 1500 devices every 5 minutes. Its a fairly good server, but the disk I/O has been problematic on occasion. I think I'll up the timeout to 30 seconds and watch the load. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On May 14, 2009, at 8:06 AM, Andrew Davis wrote: I'm seeing items very similar to the one below in our log files quite frequently: nagios: Warning: Service performance data file processing command '/ bin/mv /var/nagios/service-perfdata /var/nagios/perfspool/service- perfdata-1241945165' timed out after 5 seconds Anyone know what the cause is and perhaps the solution? For some reason, it's taking longer than 5 seconds for /bin/mv to finish. Is the box heavily loaded (top, etc)? Are the disks extraordinarily busy (iostat, sar, etc)? It appears there's a 5 second timeout defined somewhere, but I can't seem to find it anywhere... [~nagios/etc] $ grep timeout nagios.cfg service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 -- Marc -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Concise list of best overall performance tweaks
If running the latest version(s) of Nagios and the plugins, using mysql, and gathering trending info, is there a list available of the best performance tweaks one can make? I seem to read a lot that mySQL is a resource whore when also on the Nagios server. Some have commented on seeing significant increases when moving it elsewhere. Others have mentioned adding lines like innodb_flush_log_at_trx_commit = 2 into the mysql config file, while others have also mentioned database indexes, etc. Of course, for non-DBA's, knowing which databases to create indexes on (or how to do it without mucking it up) are up for debate. Log file rotation is always a good one, but this seems to be built into Nagios 3.x, though I've read of people running Cacti on their Nagios server as well and Cacti has no default log rotation. So... perhaps everyone could include the steps they took to resolve various performance issues... perhaps also including the actual steps taken (ie: if making indexes, which ones and why) and we can cull them into the Nagios wiki on NagiosExchange? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3 - Migration to mysql
Plus, for what its worth, on most systems these days you can move an HD to another system with more CPU/RAM which would negate the muck up all the Nagios stuff. Virtualization is an alternative as well. We run Nagios and Cacti on one server and four instances of Splunk on four other servers. Last month, we used VMWare's P2V tool to make all five physical servers virtual, then moved them to a a Sun X4100 running ESXi. The only cost was the server. The overall load of each individual virtual server is under 50% and the ESXi box itself is sitting at only 30% load. If disk space is an issue, another alternative is to clone the HD(s), then perhaps move to another server (in the case of going to IDE or SATA to SAS or SCSI, this is likely a must). A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On May 14, 2009, at 8:10 AM, Harald Böhmecke wrote: Hi, I have a productive Nagios instance with 200 clients and 1200 Services. PNP4Nagios is also installed and working. Actually everything is working just fine, except the machine has a 75-100% CPU Load, although I have already tweaked around the settings. 75-100% cpu load isn't necessarily bad. What's the load average and specs for the box? What do you see in top using the CPU the most/most often? With 1200 services and assuming checks every 5 minutes, any machine in the last decade should be able to handle that. Are you sure it's nagios that's the problem? I want Nagios to have a mysql backend on another server so that the CPU Load decreases as other (also critical) services are currently running on this machine, which are starting to show errors. This won't help since nagios doesn't use a mysql backend. If you add NDOUtils to do this, it's only adding processing, not replacing any processing. - How do I migrate the existing performance data to the mysql database (I have very little knowledge of databases) You'd have to dump all the RRD databases (man rrddump), munge that data and insert it into the DB. It won't help though because of (3) below... - What is the command to send the performance data to the mysql database This is complex and beyond the scope of this list. It's also pretty much unrelated to nagios. It won't help though because... - PNP4Nagios: How do I tell PNP4Nagios to pull its performance data from the mysql database PNP4Nagios doesn't appear to support storing or reading performance data from a database. -- Marc -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Help with graphing
I read through and followed Wolfgang Barth's Nagios 2nd Edition for my Nagios 3/x install. Its been extremely helpful and covers lots of options. For graphing, it covered multiple options with reasons for each, including Nagiosgraph, Perf2rrd, drraw, NagiosGrapher, and PNP. After reading about them all and and such, I decided on PNP (pnp4nagios) as it was the best mix of ease of installation combined with functionality. Most that were easy to install were also somewhat limited. Those that were unlimited in scope were also difficult to install. pnp4nagios struck a good balance between both. There were some tweaks to make, though, so for this reason I recommend pnp4nagios but I also recommend Nagios 2nd Edition as it has all the changes and caveats covered. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan shadih rahman wrote: All, I am running nagios with ndoUtils. Ndo has been capturing data for about 6 months. Now I need find a good graphing utility that can be integrated with nagios. I am looking for graphing capability from traditional database not rrd utility. Can someone suggest or point me to the right direction. Thanks in advance. -- Cordially, Shadhin Rahman -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring switches via SNMP --
You're more likely to get a good response by emailing the entire list than you are by emailing me directly... A. Davis Mike Saldivar wrote: Hi there, I have a new installation of Nagios, Version 3.0.6, running on Ubuntu 8.10. I followed the Quick Start guide for Ubuntu, and everything went swimmingly, installing Nagios, the Plug-ins, Apache, et al. Nagios started up and monitored itself, and following the quick-starts for Linux and Windows, I've been able to get it to monitor all my hosts. So I'm now trying to get it to monitor a Cisco ASA 5510 VPN. On the ASA, I enabled SNMP, and pointed it to the Nagios machine for reporting. On the Nagios box, I copied up the switch.cfg, changed the IP address and hostname, etc, following the instructions: http://nagios.sourceforge.net/docs/3_0/monitoring-routers.html Restarting, I see these errors such as this. Posting the error, then the corresponding portion of the switch.cfg: Inside Interface CRITICAL 05-07-2009 16:54:07 0d 2h 53m 31s 3/3 (Return code of 127 is out of bounds - plugin may be missing) --- define service{ use generic-service ; Inherit values from a template host_name CCLogan-ASA5510-1 service_description Inside Interface check_command check_snmp!-C public -o ifOperStatus.INSIDE -r 1 -m RFC1213-MIB } --- I changed ifOperStatus.INSIDE from ifOperStatus.1 because an snmpwalk on the device said that's what this port name was, but I can change it back. But googling the error (Return code of 127 ...) seems to indicate I'm missing net-snmp or net-snmp-util, but I'm not sure how to install them -- can't do a rpm -qa on Ubuntu, and I don't know what the corresponding command is with apt-get. But searching the Ubuntu repositories, I can' find an equivalent to net-snmp-util, so I think that's the problem. Snmpwalk works though, so I figure I have something half-working... The other error I see is: --- Port 1 Bandwidth UNKNOWN 05-07-2009 17:02:24 0d 2h 38m 14s 3/3 check_mrtgtraf: Unable to open MRTG log file --- define service{ use generic-service ; Inherit values from a template host_name CCLogan-ASA5510-1 service_description Port 1 Bandwidth check_command check_local_mrtgtraf!/var/lib/mrtg/10.5.1.2_1.log!AVG!100,100!500,500!10 } --- I looked, and /var/lib/mrtg/ didn't exist. Googling, I saw a tip to change it to /var/www/mrtg/, but that directory didn't exist either. I created /var/lib/mrtg/ and chown'd it to nagios:nagios and restarted Nagios, but still nothing. Is there a mrtg config file I need to edit? Should I check into check_cisco.pl, or does that work better for the huge switches than the itty-bitty 4-porters? Thanks a heap, -Mike -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_disk and CD/DVD in Solaris
I have check_disk setup to do monitor all local disks so that I don't have to make edits everytime I add a new filesystem or mount point. Unfortunately, on Solaris, mounted CD's and DVD's show as local volumes and as 100% full. Unfortunately, check_disk (the stock one) is not a simple shell script that I can edit to throw in a grep -v cdrom line. I'm curious how you guys are handling this? Do you define all your filesystems manually for Solaris hosts? Are you using an alternate test? Any suggestions would be much appreciated... -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_disk and CD/DVD in Solaris
I have check_disk setup to do monitor all local disks so that I don't have to make edits everytime I add a new filesystem or mount point. Unfortunately, on Solaris, mounted CD's and DVD's show as local volumes and as 100% full. Unfortunately, check_disk (the stock one) is not a simple shell script that I can edit to throw in a grep -v cdrom line. I'm curious how you guys are handling this? Do you define all your filesystems manually for Solaris hosts? Are you using an alternate test? Any suggestions would be much appreciated... BTW: please don't remind me of the joys of the open source community and how I can just edit the code. I'm a sysadmin with lots of perl and shell scripting experience, but no C experience so I'd be more inclined to do the work of manually defining all the filesystems to be monitored than I am to figure out how to edit the code. Of course, if one of you knows how to make the edit and wants to send over your .c file for me to compile, I wouldn't complain... :) -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Register Now Save for Velocity, the Web Performance Operations Conference from O'Reilly Media. Velocity features a full day of expert-led, hands-on workshops and two days of sessions from industry leaders in dedicated Performance Operations tracks. Use code vel09scf and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Changing check_http.c
My checkcommands.cfg has an check_http entry that looks like this: # 'check_http' command definition define command{ command_namecheck_http command_line$USER1$/check_http -I $HOSTADDRESS$ $ARG1$ } My services.cfg has a correlating entry that looks like this: ## HTTP - alternate URL suffix define service { host_name server service_description HTTP check_command check_http!-p 9090 -u /proteus ~ action_url /nagios/pnp/index.php?host=$HOSTNAME$srv=$SERVICEDESC$ } When I run this at the command line, I see: /usr/local/nagios/libexec/check_http -I server -p 9090 -u /proteus HTTP OK - HTTP/1.1 301 Moved Permanently - 0.002 second response time |time=0.002480s;;;0.00 size=577B;;;0 So far, so good, but what I really want to see is the URL in the output. Obviously, looking at the test, I can combine the server name, port, and URL suffix to figure out the URL. Unfortunately, while I'm good at shell scripting and perl, I'm not a C guy and the http test (check_http) is written in C. Anyone know what I would change in check_http.c to have the output show something like this: /usr/local/nagios/libexec/check_http -I server -p 9090 -u /proteus HTTP OK - HTTP/1.1 301 Moved Permanently *http://server:9090/proteus* - 0.002 second response time |time=0.002480s;;;0.00 size=577B;;;0 Its one thing to be told that my website is down, but when I'm runnng multiple copies of Apache, Tomcat, and custom apps and hosting perhaps 20 different URL's and port combinations, it would be a LOT more convenient if Nagios showed me which URL was failing in the output of the test. -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Stay on top of everything new and different, both inside and around Java (TM) technology - register by April 22, and save $200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco. 300 plus technical and hands-on sessions. Register today. Use priority code J9JMT32. http://p.sf.net/sfu/p___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Integrating host notes into Nagios html...
I'm thinking about using something like sysreport to generate info about our hosts. It (and other tools like it) have the ability to output to html. To me that makes it somewhat logical to add a per-host html link into Nagios (clicking the host name would take you to a page that shows the detailed info). Have any of you tried this or something similar? Care to share your experience or what you edited, etc? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and Cacti
And just an FYI from my own experience... putting Nagios Cacti on the same server has been somewhat problematic for us. We have over 400 network devices between switches, routers, WAPs, etc. We also have about 300 monitored servers. Initially I had Nagios and Cacti both on one server with Cacti running via cron every 5 minutes. About every 5 minutes, my shells would become unresponsive for roughly 30 to 90 seconds. Turning off either Nagios or Cacti resolved the issue. Running both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does both trending and alerts of all servers. Cacti does trending only of all network devices/ports. Once I moved Cacti to its own server, all was fine as far as load/latency went. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marco Tirado wrote: Hello: There are a couple of examples in the nagios exchange page of different approachs for integrating nagios and cacti. You should check that out. I believe the synchronization is going to cost you time and money, a better approach is to use nagios + pnp4naigos (this generates nice graphs) + check_snmp_int.pl (this for bandwidth tests). That way you have only one place to place your configuration. There are tons of other snmp plugins you can use for other tests (CPU, Memory, etc), //Marco On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney crist...@gmail.com mailto:crist...@gmail.com wrote: Hi all, I've been looking into making use of Cacti to act as an SNMP management tool which runs alongside my Nagios instance. Ideally, what I would like to do is have Cacti monitor various SNMP-exposed metrics on my hosts, and then have a service check in Nagios which parses Cacti's results (which I believe are RRD files) and send alerts etc. Nagios itself will still be used for running directly checks for services running, errors in log files etc. Does this approach make sense? One issue that I can think of is the difficulty in keeping the config files of Nagios and Cacti synchronised. I was planning on using Lilac Platform to act as my Nagios config file management tool, but how that is kept in synch with Cacti is a problem. Has anyone ever set up an arrangement like this before? Cheers, Chris -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net mailto:Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and Cacti
I agree. Initially I had Nagios doing all the trending. But with 400+ network devices and many of them with multiple 48 port blades, I found Cacti was easier to configure... it scaled a lot better. For a smaller network, you could easily do just Nagios. I've had no issues at all with Nagios + PNP for alerts and trending. In fact, Nagios still watches my core network devices (but not all the ports of them... ie: Nagios watches that switch1 is up and available and trends its CPU and memory usage... however I use Cacti for trending the 6 blades each with 48 ports in switch1). This way, if switch1 fails or utilization is too high, Nagios tells me, but if a particular user is hogging all our bandwidth or having lots of packet loss, I find that via Cacti. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Daniel Emmanuel Feinsmith wrote: It depends on the intensity of your snmp usage. Cacti has a native daemon to do large scale snmp getting, and it does a great job of it. So if u have hundreds of devices, each with a lot of interfaces, u will probably like cacti. The user interface is also well done for graphing snmp data and thresholding on it using the threshold plugin. = Daniel Feinsmith = {sent from iPhone} On Apr 8, 2009, at 8:15 AM, Christopher McAtackney crist...@gmail.com wrote: 2009/4/8 Andrew Davis ncc...@gmail.com: And just an FYI from my own experience... putting Nagios Cacti on the same server has been somewhat problematic for us. We have over 400 network devices between switches, routers, WAPs, etc. We also have about 300 monitored servers. Initially I had Nagios and Cacti both on one server with Cacti running via cron every 5 minutes. About every 5 minutes, my shells would become unresponsive for roughly 30 to 90 seconds. Turning off either Nagios or Cacti resolved the issue. Running both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does both trending and alerts of all servers. Cacti does trending only of all network devices/ports. Once I moved Cacti to its own server, all was fine as far as load/latency went. That's useful to know Andrew, thanks. Regarding the trending of network devices - is there any reason why this can't be done by Nagios? I intend to install PNP4Nagios to take care of graphing anyway, but I think it would be nice to have all my monitored resources under the one system (for notifications and ease of administration). Is there some major advantage that Cacti provides when it comes to SNMP monitoring of network devices that cannot be achieved with Nagios and the various SNMP plug-ins available for it (e.g. like these ones http://nagios.manubulon.com) ? Cheers, Chris --- --- --- - This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Help please - test works fine EXCEPT over NRPE???
I've checked /etc/sudoers, /etc/nagios/nrpe.cfg, /etc/xinetd.d/nrpe, and the script itself (/usr/local/nagios/libexec/check_logs.pl) and none have any tty strings in them for me to comment out. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Lei Chen wrote: use visudo command, comment the following line: Defaultsrequiretty and try again. 2009/4/2 Andrew Davis ncc...@gmail.com: Its running under xinetd, so 'ps aux|grep nrpe' isn't showing the true user. However, the nrpe.cfg is set to use the nobody user. I've added the nobody user to sudo, and even tried it as the nagios user. What I see now from the server is: /usr/local/nagios/libexec/check_nrpe -H atum -c check_logs CHECK_NRPE: Socket timeout after 10 seconds. And in the messages file on the client: Apr 1 22:54:25 atum nrpe[25661]: Running command: /usr/bin/sudo /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 22:54:25 atum /usr/bin/sudo: nobody : TTY=unknown ; PWD=/ ; USER=root ; COMMAND=/usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan John Stile wrote: On the client, if you look at 'ps aux |grep nrpe' what user does nrpe run as? You might have to configure sudo to allow that user to run the plugin. On Wed, 2009-04-01 at 18:27 -0400, Andrew Davis wrote: I setup the check_logs.pl (http://www.nagiosexchange.org/cgi-bin/page.cgi?g=Detailed%2F1752.html;d=1) test and its config file on some local Linux servers tested via NRPE. All other NRPE tests work fine (including some custom ones). The check_logs.pl works fine locally, but fails over NRPE. I've enabled debugging in NRPE, but its not telling me much more... Client local test: atum:/etc/init.d # /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg faillog = OK; lastlog = OK; messages = OK; wtmp = OK; Server test to client via NRPE: /usr/local/nagios/libexec/check_nrpe -H atum -c check_logs CHECK_NRPE: No output returned from daemon. Local log (/var/log/messages) on client when test is run from server: Apr 1 18:05:52 atum nrpe[1412]: Added command[check_logs]=/usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 18:05:52 atum nrpe[1412]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:05:52 atum nrpe[1412]: Handling the connection... Apr 1 18:05:52 atum nrpe[1412]: Host is asking for command 'check_logs' to be run... Apr 1 18:05:52 atum nrpe[1412]: Running command: /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 18:05:52 atum nrpe[1412]: Command completed with return code 0 and output: Apr 1 18:05:52 atum nrpe[1412]: Return Code: 0, Output: The response is immediate, so its not a timeout issue. Other NRPE tests work fine: /usr/local/nagios/libexec/check_nrpe -H atum -c check_load OK - load average: 0.00, 0.00, 0.00| load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; /usr/local/nagios/libexec/check_nrpe -H atum -c check_memory CHECK_MEMORY OK - 1702M free | free=1785552896b;210236620.8:;105118310.4: And on the client: Apr 1 18:09:25 atum nrpe[1799]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:09:25 atum nrpe[1799]: Handling the connection... Apr 1 18:09:25 atum nrpe[1799]: Host is asking for command 'check_load' to be run... Apr 1 18:09:25 atum nrpe[1799]: Running command: /usr/local/nagios/libexec/check_load -r -w 5.0 -c 10.0 Apr 1 18:09:25 atum nrpe[1799]: Command completed with return code 0 and output: OK - load average: 0.00, 0.00, 0.00| load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; Apr 1 18:09:25 atum nrpe[1799]: Return Code: 0, Output: OK - load average: 0.00, 0.00, 0.00|load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; Apr 1 18:09:26 atum nrpe[1802]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:09:26 atum nrpe[1802]: Handling the connection... Apr 1 18:09:26 atum nrpe[1802]: Host is asking for command 'check_memory' to be run... Apr 1 18:09:26 atum nrpe[1802]: Running command: /usr/local/nagios/libexec/check_memory.pl -w 10% -c 5% Apr 1 18:09:26 atum nrpe[1802]: Command
Re: [Nagios-users] Help please - test works fine EXCEPT over NRPE???
I did that as a test and removed the entry later. I haven't created a nagios user for any of the NRPE clients. I'll try doing that... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan John Stile wrote: Can you add a line to your /etc/xinetd.d/nrpe to run as a different user. Something like like: user = nagios-nrpe Next create that user in /etc/passwd, /etc/shadow, and /etc/group The user doesn't need a passwd set or a login shell. Next add the user to the groups needed to see the logs, or set-up sudo for that user to see the logs? I think giving 'nobody' sudo rights isn't the best route. On Thu, 2009-04-02 at 15:52 +0800, Lei Chen wrote: use visudo command, comment the following line: Defaultsrequiretty and try again. 2009/4/2 Andrew Davis ncc...@gmail.com: Its running under xinetd, so 'ps aux|grep nrpe' isn't showing the true user. However, the nrpe.cfg is set to use the nobody user. I've added the nobody user to sudo, and even tried it as the nagios user. What I see now from the server is: /usr/local/nagios/libexec/check_nrpe -H atum -c check_logs CHECK_NRPE: Socket timeout after 10 seconds. And in the messages file on the client: Apr 1 22:54:25 atum nrpe[25661]: Running command: /usr/bin/sudo /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 22:54:25 atum /usr/bin/sudo: nobody : TTY=unknown ; PWD=/ ; USER=root ; COMMAND=/usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan John Stile wrote: On the client, if you look at 'ps aux |grep nrpe' what user does nrpe run as? You might have to configure sudo to allow that user to run the plugin. On Wed, 2009-04-01 at 18:27 -0400, Andrew Davis wrote: I setup the check_logs.pl (http://www.nagiosexchange.org/cgi-bin/page.cgi?g=Detailed%2F1752.html;d=1) test and its config file on some local Linux servers tested via NRPE. All other NRPE tests work fine (including some custom ones). The check_logs.pl works fine locally, but fails over NRPE. I've enabled debugging in NRPE, but its not telling me much more... Client local test: atum:/etc/init.d # /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg faillog = OK; lastlog = OK; messages = OK; wtmp = OK; Server test to client via NRPE: /usr/local/nagios/libexec/check_nrpe -H atum -c check_logs CHECK_NRPE: No output returned from daemon. Local log (/var/log/messages) on client when test is run from server: Apr 1 18:05:52 atum nrpe[1412]: Added command[check_logs]=/usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 18:05:52 atum nrpe[1412]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:05:52 atum nrpe[1412]: Handling the connection... Apr 1 18:05:52 atum nrpe[1412]: Host is asking for command 'check_logs' to be run... Apr 1 18:05:52 atum nrpe[1412]: Running command: /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 18:05:52 atum nrpe[1412]: Command completed with return code 0 and output: Apr 1 18:05:52 atum nrpe[1412]: Return Code: 0, Output: The response is immediate, so its not a timeout issue. Other NRPE tests work fine: /usr/local/nagios/libexec/check_nrpe -H atum -c check_load OK - load average: 0.00, 0.00, 0.00| load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; /usr/local/nagios/libexec/check_nrpe -H atum -c check_memory CHECK_MEMORY OK - 1702M free | free=1785552896b;210236620.8:;105118310.4: And on the client: Apr 1 18:09:25 atum nrpe[1799]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:09:25 atum nrpe[1799]: Handling the connection... Apr 1 18:09:25 atum nrpe[1799]: Host is asking for command 'check_load' to be run... Apr 1 18:09:25 atum nrpe[1799]: Running command: /usr/local/nagios/libexec/check_load -r -w 5.0 -c 10.0 Apr 1 18:09:25 atum nrpe[1799]: Command completed with return code 0 and output: OK - load average: 0.00, 0.00, 0.00| load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; Apr 1 18:09:25 atum nrpe[1799]: Return Code: 0, Output: OK - load average: 0.00, 0.00, 0.00|load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; Apr 1 18:09:26 atum nrpe[1802]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:09:26 atum nrpe
Re: [Nagios-users] Additional Info in email?
Perhaps I need to ask a clarification question... in my checkcommands.cfg, should I change $SERVICEOUTPUT$ to $LONGSERVICEOUTPUT$ or should I add the $LONGSERVICEOUTPUT$ macro to the notify-service-by-email command? I'm thinking that some tests will write some data to $SERVICEOUTPUT$, but others can include additional info with $LONGSERVICEOUTPUT$, so rather than change $SERVICEOUTPUT$ to $LONGSERVICEOUTPUT$, I should add it? Perhaps something like this: Before: ~\n\nAdditional Info:\n\n$SERVICEOUTPUT$ After: ~\n\nInfo:\n\n$SERVICEOUTPUT$\n\nAdditional Info:\n\n$LONGSERVICEOUTPUT$ A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On Mar 31, 2009, at 4:19 PM, Andrew Davis wrote: This seems to have worked for the WARNING or CRITICAL notifications, but not for recovery emails. If I grep for Additional Info in my checkcommands.cfg, it only matches for the notify-service-by-email command definition, yet in recovery emails, I still see the Additional Info line at the bottom, but no content. Any ideas on how I would get the $LONGSERVICEOUPUT$ in a recovery email as well. It should just work. I'm not aware of, nor can I find any difference in the code for handling macros differently based on the notification type. Maybe I'm overlooking it but I haven't heard of the problem before... -- Marc -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Help please - test works fine EXCEPT over NRPE???
I setup the check_logs.pl (http://www.nagiosexchange.org/cgi-bin/page.cgi?g=Detailed%2F1752.html;d=1) test and its config file on some local Linux servers tested via NRPE. All other NRPE tests work fine (including some custom ones). The check_logs.pl works fine locally, but fails over NRPE. I've enabled debugging in NRPE, but its not telling me much more... Client local test: atum:/etc/init.d # /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg faillog = OK; lastlog = OK; messages = OK; wtmp = OK; Server test to client via NRPE: /usr/local/nagios/libexec/check_nrpe -H atum -c check_logs CHECK_NRPE: No output returned from daemon. Local log (/var/log/messages) on client when test is run from server: Apr 1 18:05:52 atum nrpe[1412]: Added command[check_logs]=/usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 18:05:52 atum nrpe[1412]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:05:52 atum nrpe[1412]: Handling the connection... Apr 1 18:05:52 atum nrpe[1412]: Host is asking for command 'check_logs' to be run... Apr 1 18:05:52 atum nrpe[1412]: Running command: /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 18:05:52 atum nrpe[1412]: Command completed with return code 0 and output: Apr 1 18:05:52 atum nrpe[1412]: Return Code: 0, Output: The response is immediate, so its not a timeout issue. Other NRPE tests work fine: /usr/local/nagios/libexec/check_nrpe -H atum -c check_load OK - load average: 0.00, 0.00, 0.00|load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; /usr/local/nagios/libexec/check_nrpe -H atum -c check_memory CHECK_MEMORY OK - 1702M free | free=1785552896b;210236620.8:;105118310.4: And on the client: Apr 1 18:09:25 atum nrpe[1799]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:09:25 atum nrpe[1799]: Handling the connection... Apr 1 18:09:25 atum nrpe[1799]: Host is asking for command 'check_load' to be run... Apr 1 18:09:25 atum nrpe[1799]: Running command: /usr/local/nagios/libexec/check_load -r -w 5.0 -c 10.0 Apr 1 18:09:25 atum nrpe[1799]: Command completed with return code 0 and output: OK - load average: 0.00, 0.00, 0.00|load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; Apr 1 18:09:25 atum nrpe[1799]: Return Code: 0, Output: OK - load average: 0.00, 0.00, 0.00|load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; Apr 1 18:09:26 atum nrpe[1802]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:09:26 atum nrpe[1802]: Handling the connection... Apr 1 18:09:26 atum nrpe[1802]: Host is asking for command 'check_memory' to be run... Apr 1 18:09:26 atum nrpe[1802]: Running command: /usr/local/nagios/libexec/check_memory.pl -w 10% -c 5% Apr 1 18:09:26 atum nrpe[1802]: Command completed with return code 0 and output: CHECK_MEMORY OK - 1703M free | free=1786134528b;210236620.8:;105118310.4: Apr 1 18:09:26 atum nrpe[1802]: Return Code: 0, Output: CHECK_MEMORY OK - 1703M free | free=1786134528b;210236620.8:;105118310.4: Here's the local command in my /etc/nagios/nrpe.cfg: command[check_logs]=/usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg And on the server (when done in services.cfg, though its failing with manual tests too): define service { hostgroup_name linux-servers service_description LOGS check_command check_nrpe!check_logs max_check_attempts 3 normal_check_interval 15 retry_check_interval5 check_period24x7 notification_interval 120 notification_period 24x7 notification_optionsw, u, c, r, f, s contact_groups unixadmins } Considering it fails with a manual test (command line), I doubt its my services.cfg entry. It runs fine when called locally, so I'm thinking it could be an issue on the client in the nrpe.cfg, but if so I can't find it... I *do* see the obvious... namely, the other two tests that run over NRPE have something after Output: and the check_logs.pl does not. However, called at the command line it does... which is what stumps me. What would cause the test to run fine locally, but return nothing when called via NRPE??? (BTW: I'm running 3.x with the latest set of plugins and NRPE). One more thing: I know *someone* is going to ask why I don't just use the built-in check_log test. The answer is that check_logs.pl allows for multiple files and pattern matches and a seek file to speed things
Re: [Nagios-users] Help please - test works fine EXCEPT over NRPE???
Its running under xinetd, so 'ps aux|grep nrpe' isn't showing the true user. However, the nrpe.cfg is set to use the nobody user. I've added the nobody user to sudo, and even tried it as the nagios user. What I see now from the server is: /usr/local/nagios/libexec/check_nrpe -H atum -c check_logs CHECK_NRPE: Socket timeout after 10 seconds. And in the messages file on the client: Apr 1 22:54:25 atum nrpe[25661]: Running command: /usr/bin/sudo /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 22:54:25 atum /usr/bin/sudo: nobody : TTY=unknown ; PWD=/ ; USER=root ; COMMAND=/usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan John Stile wrote: On the client, if you look at 'ps aux |grep nrpe' what user does nrpe run as? You might have to configure sudo to allow that user to run the plugin. On Wed, 2009-04-01 at 18:27 -0400, Andrew Davis wrote: I setup the check_logs.pl (http://www.nagiosexchange.org/cgi-bin/page.cgi?g=Detailed%2F1752.html;d=1) test and its config file on some local Linux servers tested via NRPE. All other NRPE tests work fine (including some custom ones). The check_logs.pl works fine locally, but fails over NRPE. I've enabled debugging in NRPE, but its not telling me much more... Client local test: atum:/etc/init.d # /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg faillog = OK; lastlog = OK; messages = OK; wtmp = OK; Server test to client via NRPE: /usr/local/nagios/libexec/check_nrpe -H atum -c check_logs CHECK_NRPE: No output returned from daemon. Local log (/var/log/messages) on client when test is run from server: Apr 1 18:05:52 atum nrpe[1412]: Added command[check_logs]=/usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 18:05:52 atum nrpe[1412]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:05:52 atum nrpe[1412]: Handling the connection... Apr 1 18:05:52 atum nrpe[1412]: Host is asking for command 'check_logs' to be run... Apr 1 18:05:52 atum nrpe[1412]: Running command: /usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg Apr 1 18:05:52 atum nrpe[1412]: Command completed with return code 0 and output: Apr 1 18:05:52 atum nrpe[1412]: Return Code: 0, Output: The response is immediate, so its not a timeout issue. Other NRPE tests work fine: /usr/local/nagios/libexec/check_nrpe -H atum -c check_load OK - load average: 0.00, 0.00, 0.00| load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; /usr/local/nagios/libexec/check_nrpe -H atum -c check_memory CHECK_MEMORY OK - 1702M free | free=1785552896b;210236620.8:;105118310.4: And on the client: Apr 1 18:09:25 atum nrpe[1799]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:09:25 atum nrpe[1799]: Handling the connection... Apr 1 18:09:25 atum nrpe[1799]: Host is asking for command 'check_load' to be run... Apr 1 18:09:25 atum nrpe[1799]: Running command: /usr/local/nagios/libexec/check_load -r -w 5.0 -c 10.0 Apr 1 18:09:25 atum nrpe[1799]: Command completed with return code 0 and output: OK - load average: 0.00, 0.00, 0.00| load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; Apr 1 18:09:25 atum nrpe[1799]: Return Code: 0, Output: OK - load average: 0.00, 0.00, 0.00|load1=0.000;5.000;10.000;0; load5=0.000;5.000;10.000;0; load15=0.000;5.000;10.000;0; Apr 1 18:09:26 atum nrpe[1802]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Apr 1 18:09:26 atum nrpe[1802]: Handling the connection... Apr 1 18:09:26 atum nrpe[1802]: Host is asking for command 'check_memory' to be run... Apr 1 18:09:26 atum nrpe[1802]: Running command: /usr/local/nagios/libexec/check_memory.pl -w 10% -c 5% Apr 1 18:09:26 atum nrpe[1802]: Command completed with return code 0 and output: CHECK_MEMORY OK - 1703M free | free=1786134528b;210236620.8:;105118310.4: Apr 1 18:09:26 atum nrpe[1802]: Return Code: 0, Output: CHECK_MEMORY OK - 1703M free | free=1786134528b;210236620.8:;105118310.4: Here's the local command in my /etc/nagios/nrpe.cfg: command[check_logs]=/usr/local/nagios/libexec/check_logs.pl -c /etc/nagios/check_logs_linux.cfg And on the server (when done in services.cfg, though its failing with manual tests too): define service
[Nagios-users] Additional Info in email?
I'm using the check_multiprocs add on available from NagiosExchange. With it configured for a host, if I drill in on the test in the web interface, I see: Current Status: OK (for 0d 8h 51m 46s) Status Information:PROCS OK cron ok: 1 syslogd ok: 1 xntpd ok: 1 sshd ok: 7 sendmail ok: 2 rpcbind ok: 1 inetd ok: 1 lockd ok: 1 statd ok: 1 nfsd ok: 1 rpc.metad ok: 1 mountd ok: 2 However, last night one of the services was restarted (probably syslogd) due to cron. I received an alert via email about it, but all I saw was: * Nagios * Notification Type: PROBLEM Service: PROCS Host: server [truncated by me] Address: IP address [truncated by me] State: WARNING Date/Time: Tue Mar 31 06:04:47 EDT 2009 Additional Info: PROCS WARNING What I need is to see which processes are down in the Additional Info: field when its emailed. Anyone know how I would go about this? Unfortunately, I'm not a perl guy... I've included the script below... # libexec cat check_multiprocs #!/usr/bin/perl -w # Check Multiprocs use Getopt::Long; sub fail($); my ($opt_c,$opt_h); my $PROGNAME = PROCS; my $STATE = OK;# Initially, we're fine my $RETSTRING = ; # SYSTEM COMMANDS #my $PS = /bin/ps ax; my $PS = /bin/ps -aef; my $GREP = /bin/egrep; my %RETVALUE = ( 'OK' = 0, 'WARNING' = 1, 'CRITICAL' = 2 ); GetOptions (c=s= \$opt_c, config=s= \$opt_c, h= \$opt_h, help= \$opt_h); if ($opt_h) { print_help(); exit 3; } ($opt_c) || fail(No config file specified); my $inFile = $1 if ($opt_c =~ /([-.\/a-zA-Z0-9]+)/); open (INFILE,$inFile) || fail(Could not open file: $inFile); while (INFILE) { my ($wmin, $wmax, $cmin, $cmax); next if (/^#|^$/);# No comments my @line = split(/;/); ($#line == 5) || fail(Error parsing $inFile:\...@line);# We need 6 arguments per line my $host = $line[0]; my $proc = $line[1]; $wmin = ($line[2] =~ /[0-9]+/ ? $line[2] : 1);# Get an actual number for the mins/maxes $wmax = ($line[3] =~ /[0-9]+/ ? $line[3] : 1000); $cmin = ($line[4] =~ /[0-9]+/ ? $line[4] : 1); $cmax = ($line[5] =~ /[0-9]+/ ? $line[5] : 1000); if ($host eq localhost || $host eq $ENV{'HOSTNAME'}) {# Now, check each process my $procStr; if ($proc =~ /^.*$/) {# How does our process string look? $procStr = $proc;# For some reason this works, but the other way doesn't } else { $procStr = '$proc( |\$)+'; } my @OUT = `$PS | $GREP -e $procStr | $GREP -v grep`; my $count = $#OUT + 1; if ($count $cmin) {# Be more specific for crit procs $STATE = CRITICAL; $RETSTRING .= \n$proc, too few processes: $count; } elsif ($count $cmax ) { $STATE = CRITICAL; $RETSTRING .= \n$proc, too many processes: $count; } elsif ($count $wmin || $count $wmax) { if ( $STATE ne CRITICAL ) { $STATE = WARNING; } $RETSTRING .= \n$proc exceeded warn threhold: $count; } else { $RETSTRING .= \n$proc ok: $count; } } } if ( $RETSTRING eq ) { $RETSTRING = \nAll monitored processes OK . $RETSTRING; } print $PROGNAME $STATE $RETSTRING\n; exit $RETVALUE{$STATE}; # # SUBROUTINE DEFINITIONS # sub fail($) { print Error: @_\n; print_usage(); exit 3; } sub print_usage { print Usage: $PROGNAME -c config\n; return; } sub print_help { print $PROGNAME, Revision: 1.0\n; print \n; print_usage(); print -c, --config=config file Configuration file. Processes should be in the format: process;min;max ; print \n; return; } -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Additional Info in email?
Worked perfectly, thanks! Only now I have to update the BUFFER lines in NRPE's include/common.h and recompile it for the clients... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On Mar 31, 2009, at 12:52 PM, Andrew Davis wrote: Additional Info: PROCS WARNING What I need is to see which processes are down in the Additional Info: field when its emailed. Anyone know how I would go about this? Unfortunately, I'm not a perl guy... I've included the script below... I suspect you need to add the $LONGSERVICEOUTPUT$ macro after the $SERVICEOUTPUT$ macro in your notification command -- http://nagios.sourceforge.net/docs/3_0/macrolist.html#serviceoutput -- Marc -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Common messages file strings?
I'm playing with the check_log and check_logs.pl tools and am curious... do any of you have a list of strings/reg exp that you commonly look for? Should I be looking for warn|WARN|error|ERROR, etc Perhaps someone has a list of common warnings I should be looking for? I'll be monitoring both Linux and Solaris, so /var/log/messages, /var/adm/messages, /var/log/secure, /var/log/wtmp, /var/log/utmp, etc... -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Additional Info in email?
This seems to have worked for the WARNING or CRITICAL notifications, but not for recovery emails. If I grep for Additional Info in my checkcommands.cfg, it only matches for the notify-service-by-email command definition, yet in recovery emails, I still see the Additional Info line at the bottom, but no content. Any ideas on how I would get the $LONGSERVICEOUPUT$ in a recovery email as well? A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Marc Powell wrote: On Mar 31, 2009, at 12:52 PM, Andrew Davis wrote: Additional Info: PROCS WARNING What I need is to see which processes are down in the Additional Info: field when its emailed. Anyone know how I would go about this? Unfortunately, I'm not a perl guy... I've included the script below... I suspect you need to add the $LONGSERVICEOUTPUT$ macro after the $SERVICEOUTPUT$ macro in your notification command -- http://nagios.sourceforge.net/docs/3_0/macrolist.html#serviceoutput -- Marc -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Integrate top into output
I migrated from BB to Nagios. One of the things I used to do in BB was to show the output of top into the html page for that host. I'm not seeing how to do something similar in Nagios. Is this possible and does anyone have any suggestions on how I would go about it? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_ntp_peer unreliable on macs
I encountered this too. My solution was a script in /usr/local/scripts called by cron that stops the NTP daemon, runs ntpdate local_server twice, then restarts the NTP daemon. This runs in cron every 2 hours and seems to keep things in sync... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Peter Doherty wrote: On Mar 13, 2009, at 5:25 PM, Keith Erekson wrote: I found this in my mailing list archives, while looking for information about check_ntp_peer. As far as I can tell, nobody ever answered you... I was just looking into this exact problem. If you check the verbose output, you will probably see something like this: 0 candiate peers available warning: no synchronization source found warning: LI_ALARM bit is set I do get valid output from ntpq -p hostname, however. Apparently, the problems with OS X's NTP are well-known and documented. For example, http://knol.google.com/k/dirk-h-schulz/time-synchronization-ntp-on-mac-os-x/2bcee0ik2900p/18# http://support.ntp.org/bin/view/Support/KnownOsIssues#Section_9.2.5 As a way around this, I thought I would just use check_ntp_time, to compare the xserve's clock against that of the nagios box. However, no luck there either: sending request to peer 0 response from peer 0: offset -0.9300264975 sending request to peer 0 response from peer 0: offset -0.9299369976 sending request to peer 0 response from peer 0: offset -0.9299154976 sending request to peer 0 response from peer 0: offset -0.9298709977 discarding peer 0: stratum=0 overall average offset: 0 NTP CRITICAL: Offset unknown| It seems that OS X is responding as a stratum 0 server, which is a no-no. Also, while fiddling with check_ntp_peer, I noticed that it doesn't seem to accept a port (-p or --port), as the help output suggests it should be able to. Am I crazy? -Keith Yeah, after a little more diagnostic work I eventually concluded that it was just OS X's implementation of NTP that is just broke. It seems to be in sync for a while, then it just forgets it for a while, and eventually, maybe it'll sync up again. Maybe they'll fix that for 10.6 this year. --Peter -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE vs. check_by_ssh
Charlie Reddington wrote: On Mar 25, 2009, at 2:30 PM, RijilV wrote: 2009/3/24 Christopher McAtackney crist...@gmail.com: Hi all, I was wondering if someone could give a brief overview of the pros / cons of using NRPE to monitor my remote hosts versus using the check_by_ssh command? I'm aware that check_by_ssh increases the CPU overhead, but I'm not clear on the level of impact here - does this increase the load on the monitoring machine in direction relation to the number of hosts being monitored? For example, if I was using check_by_ssh to monitor, say, 2000 services spread across 200 hosts, would I experience significant slowdown on my monitoring machine? Cheers for any info, Chris SSH is going to slow it down on both sides of the communication. SSH does quite a bit more in terms of setting up the connection which involves using asymmetric encryption to setup a shared secret for symmetric encryption and verifying keys for the asymmetric part, verifying access, allocating a session. Whereas NRPE even with encryption just does a simple pre-shared secret for the symmetric encryption, much faster even if using the same encryption algorithm One thing you could do with SSH to speed it up (and I would argue make it faster than NRPE depending on the stability of your network)) would be to use ControlMaster. ControlMaster is a SSH v2 feature, where you create a connection and can open up multiple sessions with that ControlMaster for other SSH processes. This saves you not only the key-exchange heavy lifting but also you're not opening up a new socket on the remote host. In order to really make it worth it you'd have to spawn a process that was continuously connected. I wrote an ugly check_by_ssh that would spawn a ControlMaster if one didn't exist and use it if it did. Reduced the load/latency quite a bit for SSH checks. Though if I had to do it again I'd used 'ControlMaster auto' (man 5 ssh_config) and create a separate check that was responsible for maintaining the ControlMaster, then you could use the stock check_by_ssh without any modifications. That all being said, you might want to think about a distributed setup anyhow, if nothing more for redundancy. 200 servers and 2,000 checks is alot of responsibility for a singleton, you could break it 50/50 between two servers that could take over for the other one if it fails. .r' -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null +1 on the control master. We have about 1000 checks over 300 hosts and using control master made the box much more stable and quite frankly usable. Saved a lot of plug in time outs as well. Think about 1000 checks every 5 or 10 minutes. That's 1000 encrypted tunnels that are going up and down. That's a lot of overhead for a quick check, let along if your server is checking say 5 or 10 things back to back. http://www.torchbox.com/blog/ssh_tips_2.html Charlie -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null FWIW: I use both. I have about 400 internal servers that are considered to be trusted. I have another 50 or so that are outside our network (DMZ'd) and untrusted. To keep overhead low, I use NRPE on the internal hosts and check_by_ssh for the externals. Internally, using NRPE gives me greater flexibility in adjusting client thresholds (mounts to watch, varying memory ranges depending on how much is installed, etc). check_by_ssh gives me a secured, authenticated way of checking system externally (basic sshd_config setup to restrict ssh from nagios user and specific IP's only). I'm unwilling to use NRPE on an external, untrusted server, but don't want the overhead of encryption for internal, trusted systems... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Stuck on NRPE for OS X Server
My /etc/xinetd.d/nrpe is below: # /etc/xinetd.d/nrpe # description: NRPE # default: on service nrpe { flags = REUSE socket_type = stream port= 5666 wait= no #user= nobody user= daemon #group = nobody group = wheel server = /usr/local/sbin/nrpe server_args = -c /etc/nagios/nrpe.cfg --inetd log_on_failure += USERID disable = no only_from = 127.0.0.1 10.1.1.170 } Originally, it was set to nobody:nobody. As a test, I set it to daemon:wheel. In all cases, it gives the cannot run as root error. I guess I can try making a Nagios user group and testing with that. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Allan Clark wrote: Reply is bottom-posted. On Wed, Mar 18, 2009 at 16:57, Andrew Davis ncc...@gmail.com mailto:ncc...@gmail.com wrote: If I'm reading this correctly, the line about NRPE daemon cannot be run as user/group root! is directly from the source code of NRPE. Its not an xinetd thing. I've confirmed that xinetd is running and listening on port 5666. I tried changing the owner/group from nobody:nobody to another unprivileged user, but it didn't work. Same results. It appears that despite my configuring the /etc/nagios/nrpe.cfg and the /etc/xinetd.d/nrpe files to use a user other than root, it still tries to start it as the root user and thus when an incoming connection comes in, it gives the NRPE daemon cannot be run as user/group root! error. Any thoughts on how to rectify this? Since NRPE is working fine on Linux, is this just a Mac OS X thing? Any help would be immensely appreciated. AD Andrew Davis wrote: FYI: /var/log/system.log on the client shows: Mar 18 16:08:07 shu xinetd[29066]: START: nrpe pid=557 from=10.1.1.170 Mar 18 16:08:07 shu nrpe[557]: Error: NRPE daemon cannot be run as user/group root! whether I do the default test (with SSL) or use the -n flag to test w/o SSL. The odd thing is that the nrpe config in /etc/xinetd.d is set to run as nobody:nobody and /etc/nagios/nrpe.cfg is owned by nobody:nobody. Only /usr/local/sbin/nrpe is owned by root (as it should be), but is also set to 755 perms. I've compared to a Linux box I have with NRPE and xinetd working properly and the permissions are identical. I'm stumped... Andrew Davis wrote: I have two Mac OS X servers, one running 10.3, the other running 10.4. Neither can be upgraded to 10.5 due to third party s/w constraints. Both are PPC based XServe's. Trying to compile nrpe with: ./configure --sysconfdir=/etc/nagios --enable-ssl Initially, I got the cannot find ssl libraries error: ~ checking for SSL headers... SSL headers found in /usr/local/ssl checking for SSL libraries... configure: error: Cannot find ssl libraries I downloaded the latest openssl and built it with: ./config --prefix=/usr/local shared --openssldir=/usr/local/openssl make make test make install I then had to edit ~/src/nrpe/configure and change the reference from libssl.so to libssl.dylib After that, nrpe compiled cleanly and I was able to move ~src/nrpe/src/nrpe to /usr/local/sbin and start xinetd up. I've confirmed that port 5666 is open and xinetd is running: /usr/local/src/nrpe-2.12/src root# ps waux|grep xinet|grep -v greproot 29066 0.0 -0.027484308 ?? Ss 3:53PM 0:00.02 /usr/sbin/xinetd -pidfile /var/run/xinetd.pid -stayalive /usr/local/src/nrpe-2.12/src root# netstat -an|grep 5666tcp4 0 0 *.5666 *.*LISTEN However, when connecting from the remote server, I get: /usr/local/nagios/libexec/check_nrpe -H host.mydomain.org http://host.mydomain.org CHECK_NRPE: Error - Could not complete SSL handshake. The same test but w/o SSL gives yields: [nag...@nephilim src]$ /usr/local/nagios/libexec/check_nrpe -n -H host.mydomain.org http://host.mydomain.org CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. So two questions: 1) I'm a UNIX guy, but obviously Mac's are A) different and B) a tad different being BSD-based. So what's the proper way to stop/restart the xinetd daemon? 2) Any thoughts on SSL handshake error? I've googled it, but I'm not getting very far. Anyone have a step-by-step for compiling nagios plugins and NRPE from source on OS X 10.x (specifically 10.3 and 10.4)? I'm using NRPE for all other internal hosts, so I prefer to use
Re: [Nagios-users] Stuck on NRPE for OS X Server
# if the daemon was configured with the --enable-command-args configure script # option. # # *** ENABLING THIS OPTION IS A SECURITY RISK! *** # Read the SECURITY file for information on some of the security implications # of enabling this variable. # # Values: 0=do not allow arguments, 1=allow command arguments dont_blame_nrpe=0 # COMMAND PREFIX # This option allows you to prefix all commands with a user-defined string. # A space is automatically added between the specified prefix string and the # command line from the command definition. # # *** THIS EXAMPLE MAY POSE A POTENTIAL SECURITY RISK, SO USE WITH CAUTION! *** # Usage scenario: # Execute restricted commmands using sudo. For this to work, you need to add # the nagios user to your /etc/sudoers. An example entry for alllowing # execution of the plugins from might be: # # nagios ALL=(ALL) NOPASSWD: /usr/lib/nagios/plugins/ # # This lets the nagios user run all commands in that directory (and only them) # without asking for a password. If you do this, make sure you don't give # random users write access to that directory or its contents! # command_prefix=/usr/bin/sudo # DEBUGGING OPTION # This option determines whether or not debugging messages are logged to the # syslog facility. # Values: 0=debugging off, 1=debugging on debug=0 # COMMAND TIMEOUT # This specifies the maximum number of seconds that the NRPE daemon will # allow plugins to finish executing before killing them off. command_timeout=60 # CONNECTION TIMEOUT # This specifies the maximum number of seconds that the NRPE daemon will # wait for a connection to be established before exiting. This is sometimes # seen where a network problem stops the SSL being established even though # all network sessions are connected. This causes the nrpe daemons to # accumulate, eating system resources. Do not set this too low. connection_timeout=300 # WEEK RANDOM SEED OPTION # This directive allows you to use SSL even if your system does not have # a /dev/random or /dev/urandom (on purpose or because the necessary patches # were not applied). The random number generator will be seeded from a file # which is either a file pointed to by the environment valiable $RANDFILE # or $HOME/.rnd. If neither exists, the pseudo random number generator will # be initialized and a warning will be issued. # Values: 0=only seed from /dev/[u]random, 1=also seed from weak randomness #allow_weak_random_seed=1 # INCLUDE CONFIG FILE # This directive allows you to include definitions from an external config file. #include=somefile.cfg # INCLUDE CONFIG DIRECTORY # This directive allows you to include definitions from config files (with a # .cfg extension) in one or more directories (with recursion). #include_dir=somedirectory #include_dir=someotherdirectory # COMMAND DEFINITIONS # Command definitions that this daemon will run. Definitions # are in the following format: # # command[command_name]=command_line # # When the daemon receives a request to return the results of command_name # it will execute the command specified by the command_line argument. # # Unlike Nagios, the command line cannot contain macros - it must be # typed exactly as it should be executed. # # Note: Any plugins that are used in the command lines must reside # on the machine that this daemon is running on! The examples below # assume that you have plugins installed in a /usr/local/nagios/libexec # directory. Also note that you will have to modify the definitions below # to match the argument format the plugins expect. Remember, these are # examples only! # The following examples use hardcoded command arguments... command[check_disks]=/usr/local/nagios/libexec/check_disk -m -e -w 10% -c 5% command[check_load]=/usr/local/nagios/libexec/check_load -r -w 5.0 -c 10.0 command[check_memory]=/usr/local/nagios/libexec/check_memory.pl -w 10% -c 5% command[check_swap]=/usr/local/nagios/libexec/check_swap -a -w 50% -c 20% command[check_ntp]=/usr/local/nagios/libexec/check_ntp_time -H 10.1.1.14 -w 1.0 -c 1.5 command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c 200 command[check_users]=/usr/local/nagios/libexec/check_users -w 25 -c 50 A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Andrew Davis wrote: My /etc/xinetd.d/nrpe is below: # /etc/xinetd.d/nrpe # description: NRPE # default: on service nrpe { flags = REUSE socket_type = stream port
[Nagios-users] Memory test for OS X
Anyone know of a native memory test (preferably with statistical info) for OS X? The few I've tried either look for free (part of procps which doesn't compile on OS X) or vmstat (not vm_stat) geared for Solaris. I'd prefer something perhaps in perl that calls vm_stat or greps from top. Anyone aware of any? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Stuck on NRPE for OS X Server
Thanks much. FYI: I know for sure that xinetd was doing it as if I set /etc/xinetd.d/nrpe to disable=yes, then did a kill -HUP xinetd_proc_id, then a netstat -an|grep 5666 it would be listening anymore. Further, /var/log/system.log showed one service removed for xinetd. When I reversed this process, it was again listening on 5666 and system.log showed one service added for xinetd. That said, I gave up and just did check_by_ssh for these servers. The plugins all built cleanly. Only NRPE had issues. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Allan Clark wrote: On Thu, Mar 19, 2009 at 10:57, Andrew Davis ncc...@gmail.com mailto:ncc...@gmail.com wrote: One person suggested my openssl version might be too new (0.9.8). I just removed it and installed 0.9.7i, older enough version to be safe and one that I know another user has in a working configuration. After compiling it, I then recompiled NRPE against it and copied the files in place. It still fails with the same error. /var/log/system.log shows: Mar 19 10:45:17 seth xinetd[26057]: Started working: 1 available service Mar 19 10:45:25 seth nrpe[26064]: Error: NRPE daemon cannot be run as user/group root! I had it set to run as nobody:nobody, but that wasn’t working. I even tried setting to run as daemon:wheel, but the same results. Finally, I created a nagios user and configured /etc/xinetd.d/nrpe to run as nagios:nagios and updated /etc/nagios/nrpe.cfg to use the same. However, all remote tests still result in the following: From the server: [nag...@nagios ~]$ /usr/local/nagios/libexec/check_nrpe -H seth CHECK_NRPE: Error - Could not complete SSL handshake. From the client: Mar 19 10:45:17 seth xinetd[26057]: Started working: 1 available service Mar 19 10:45:25 seth nrpe[26064]: Error: NRPE daemon cannot be run as user/group root! Scouring Google shows that the “cannot be run as ... root” error is in the nrpe.c code. What I can’t figure out is why its trying to run as root instead of the configured user... Anyone running NRPE with xinetd for Mac’s? I’m frustrated enough that I almost just want to use check_by_ssh, but I’d prefer to get this working and keep things consistent (ie: with NRPE). My /etc/nagios/nrpe.cfg and /etc/xinetd.d/nrpe are below: seth:/etc/xinetd.d root# pwd /etc/xinetd.d seth:/etc/xinetd.d root# cat nrpe # /etc/xinetd.d/nrpe # description: NRPE # default: on service nrpe { flags = REUSE socket_type = stream port = 5666 wait = no user = nagios group = nagios server = /usr/local/sbin/nrpe server_args = -c /etc/nagios/nrpe.cfg --inetd log_on_failure += USERID disable = no only_from = 127.0.0.1 10.1.1.170 } Hi Andrew; I'm not convinced xinetd is running nrpe for you. As a simple test, try changing the port number from 5666 in /etc/xinetd.d/nrpe, but leave it as 5666 in nrpe.cfg, and see if you can connect on the old or new port -- just to ensure that the port is serviced as a hand-off from xinetd. (5666 or 5556?) Normally I'd confirm this with a sudo netstat -pant but I don't know the equivalent on MacOSX, so I'm suggesting quick molestation for proof, even though I see the only 127.0.0.1 setting in nrpe.cfg. You might want to run xinetd with -d option for debugging spam; it also doesn't background the process, so run on a different terminal. Looking for confirmation that xinetd is changing user after accept()/fork(). Allan -- all...@chickenandporn.com mailto:all...@chickenandporn.com 金鱼 http://linkedin.com/in/goldfish please, no proprietary attachments (http://tinyurl.com/cbgq) Sent from: New York NY United States. -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios plugins compile errors on OS X 10.3 Server
I'm having some issues compiling Nagios (natively, not through MacPorts, etc) on two Mac servers. One is running 10.3, the other 10.4. Both are PPC. Neither can be updated to 10.5 due to third party s/w constraints. On the 10.3 server, after running ./configure it seems to end okay, but I see the following when running make all: configure: creating ./config.status config.status: creating gl/Makefile config.status: creating Makefile config.status: creating lib/Makefile config.status: creating plugins/Makefile config.status: creating lib/tests/Makefile config.status: creating plugins-root/Makefile config.status: creating plugins-scripts/Makefile config.status: creating plugins-scripts/subst config.status: creating plugins-scripts/utils.pm config.status: creating plugins-scripts/utils.sh config.status: creating perlmods/Makefile config.status: creating command.cfg config.status: creating test.pl config.status: creating pkg/solaris/pkginfo config.status: creating po/Makefile.in config.status: creating config.h config.status: config.h is unchanged config.status: executing depfiles commands config.status: executing po-directories commands config.status: creating po/POTFILES config.status: creating po/Makefile --with-apt-get-command: /sw/bin/apt-get --with-ping6-command: /sbin/ping6 -n -c %d %s --with-ping-command: /sbin/ping -n -c %d %s --with-ipv6: yes --with-mysql: no --with-openssl: yes --with-gnutls: no --enable-extra-opts: no --with-perl: /usr/bin/perl --enable-perl-modules: no --with-cgiurl: /nagios/cgi-bin --with-trusted-path: /bin:/sbin:/usr/bin:/usr/sbin /usr/local/src/nagios-plugins-1.4.13 root# make all cd . /bin/sh /usr/local/src/nagios-plugins-1.4.13/build-aux/missing --run autoheader configure.in:3: error: Autoconf version 2.59 or higher is required configure.in:3: the top level autom4te: /usr/bin/gm4 failed with exit status: 1 autoheader: /usr/bin/autom4te failed with exit status: 1 make: *** [config.h.in] Error 1 Obviously it would seem that I might need a newer version of Autoconf? Any thoughts on a workaround or where I can get autoconf 2.59 or higher for OS X 10.3 Server? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios plugins compile errors on OS X 10.3 Server
Disregard. I pulled down m4 and automake from the GNU site, compiled them and put /usr/local/bin into my path and all is fine. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Andrew Davis wrote: I'm having some issues compiling Nagios (natively, not through MacPorts, etc) on two Mac servers. One is running 10.3, the other 10.4. Both are PPC. Neither can be updated to 10.5 due to third party s/w constraints. On the 10.3 server, after running ./configure it seems to end okay, but I see the following when running make all: configure: creating ./config.status config.status: creating gl/Makefile config.status: creating Makefile config.status: creating lib/Makefile config.status: creating plugins/Makefile config.status: creating lib/tests/Makefile config.status: creating plugins-root/Makefile config.status: creating plugins-scripts/Makefile config.status: creating plugins-scripts/subst config.status: creating plugins-scripts/utils.pm config.status: creating plugins-scripts/utils.sh config.status: creating perlmods/Makefile config.status: creating command.cfg config.status: creating test.pl config.status: creating pkg/solaris/pkginfo config.status: creating po/Makefile.in config.status: creating config.h config.status: config.h is unchanged config.status: executing depfiles commands config.status: executing po-directories commands config.status: creating po/POTFILES config.status: creating po/Makefile --with-apt-get-command: /sw/bin/apt-get --with-ping6-command: /sbin/ping6 -n -c %d %s --with-ping-command: /sbin/ping -n -c %d %s --with-ipv6: yes --with-mysql: no --with-openssl: yes --with-gnutls: no --enable-extra-opts: no --with-perl: /usr/bin/perl --enable-perl-modules: no --with-cgiurl: /nagios/cgi-bin --with-trusted-path: /bin:/sbin:/usr/bin:/usr/sbin /usr/local/src/nagios-plugins-1.4.13 root# make all cd . /bin/sh /usr/local/src/nagios-plugins-1.4.13/build-aux/missing --run autoheader configure.in:3: error: Autoconf version 2.59 or higher is required configure.in:3: the top level autom4te: /usr/bin/gm4 failed with exit status: 1 autoheader: /usr/bin/autom4te failed with exit status: 1 make: *** [config.h.in] Error 1 Obviously it would seem that I might need a newer version of Autoconf? Any thoughts on a workaround or where I can get autoconf 2.59 or higher for OS X 10.3 Server? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Stuck on NRPE for OS X Server
I have two Mac OS X servers, one running 10.3, the other running 10.4. Neither can be upgraded to 10.5 due to third party s/w constraints. Both are PPC based XServe's. Trying to compile nrpe with: ./configure --sysconfdir=/etc/nagios --enable-ssl Initially, I got the cannot find ssl libraries error: ~ checking for SSL headers... SSL headers found in /usr/local/ssl checking for SSL libraries... configure: error: Cannot find ssl libraries I downloaded the latest openssl and built it with: ./config --prefix=/usr/local shared --openssldir=/usr/local/openssl make make test make install I then had to edit ~/src/nrpe/configure and change the reference from libssl.so to libssl.dylib After that, nrpe compiled cleanly and I was able to move ~src/nrpe/src/nrpe to /usr/local/sbin and start xinetd up. I've confirmed that port 5666 is open and xinetd is running: /usr/local/src/nrpe-2.12/src root# ps waux|grep xinet|grep -v greproot 29066 0.0 -0.027484308 ?? Ss3:53PM 0:00.02 /usr/sbin/xinetd -pidfile /var/run/xinetd.pid -stayalive /usr/local/src/nrpe-2.12/src root# netstat -an|grep 5666tcp4 0 0 *.5666 *.*LISTEN However, when connecting from the remote server, I get: /usr/local/nagios/libexec/check_nrpe -H host.mydomain.org CHECK_NRPE: Error - Could not complete SSL handshake. The same test but w/o SSL gives yields: [nag...@nephilim src]$ /usr/local/nagios/libexec/check_nrpe -n -H host.mydomain.org CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. So two questions: 1) I'm a UNIX guy, but obviously Mac's are A) different and B) a tad different being BSD-based. So what's the proper way to stop/restart the xinetd daemon? 2) Any thoughts on SSL handshake error? I've googled it, but I'm not getting very far. Anyone have a step-by-step for compiling nagios plugins and NRPE from source on OS X 10.x (specifically 10.3 and 10.4)? I'm using NRPE for all other internal hosts, so I prefer to use it for the Mac's too. I know I could do it via check_by_ssh and get around this, but I prefer to use NRPE if I can. -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Stuck on NRPE for OS X Server
FYI: /var/log/system.log on the client shows: Mar 18 16:08:07 shu xinetd[29066]: START: nrpe pid=557 from=10.1.1.170 Mar 18 16:08:07 shu nrpe[557]: Error: NRPE daemon cannot be run as user/group root! whether I do the default test (with SSL) or use the -n flag to test w/o SSL. The odd thing is that the nrpe config in /etc/xinetd.d is set to run as nobody:nobody and /etc/nagios/nrpe.cfg is owned by nobody:nobody. Only /usr/local/sbin/nrpe is owned by root (as it should be), but is also set to 755 perms. I've compared to a Linux box I have with NRPE and xinetd working properly and the permissions are identical. I'm stumped... Andrew Davis wrote: I have two Mac OS X servers, one running 10.3, the other running 10.4. Neither can be upgraded to 10.5 due to third party s/w constraints. Both are PPC based XServe's. Trying to compile nrpe with: ./configure --sysconfdir=/etc/nagios --enable-ssl Initially, I got the cannot find ssl libraries error: ~ checking for SSL headers... SSL headers found in /usr/local/ssl checking for SSL libraries... configure: error: Cannot find ssl libraries I downloaded the latest openssl and built it with: ./config --prefix=/usr/local shared --openssldir=/usr/local/openssl make make test make install I then had to edit ~/src/nrpe/configure and change the reference from libssl.so to libssl.dylib After that, nrpe compiled cleanly and I was able to move ~src/nrpe/src/nrpe to /usr/local/sbin and start xinetd up. I've confirmed that port 5666 is open and xinetd is running: /usr/local/src/nrpe-2.12/src root# ps waux|grep xinet|grep -v greproot 29066 0.0 -0.027484308 ?? Ss3:53PM 0:00.02 /usr/sbin/xinetd -pidfile /var/run/xinetd.pid -stayalive /usr/local/src/nrpe-2.12/src root# netstat -an|grep 5666tcp4 0 0 *.5666 *.*LISTEN However, when connecting from the remote server, I get: /usr/local/nagios/libexec/check_nrpe -H host.mydomain.org CHECK_NRPE: Error - Could not complete SSL handshake. The same test but w/o SSL gives yields: [nag...@nephilim src]$ /usr/local/nagios/libexec/check_nrpe -n -H host.mydomain.org CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. So two questions: 1) I'm a UNIX guy, but obviously Mac's are A) different and B) a tad different being BSD-based. So what's the proper way to stop/restart the xinetd daemon? 2) Any thoughts on SSL handshake error? I've googled it, but I'm not getting very far. Anyone have a step-by-step for compiling nagios plugins and NRPE from source on OS X 10.x (specifically 10.3 and 10.4)? I'm using NRPE for all other internal hosts, so I prefer to use it for the Mac's too. I know I could do it via check_by_ssh and get around this, but I prefer to use NRPE if I can. -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Stuck on NRPE for OS X Server
If I'm reading this correctly, the line about NRPE daemon cannot be run as user/group root! is directly from the source code of NRPE. Its not an xinetd thing. I've confirmed that xinetd is running and listening on port 5666. I tried changing the owner/group from nobody:nobody to another unprivileged user, but it didn't work. Same results. It appears that despite my configuring the /etc/nagios/nrpe.cfg and the /etc/xinetd.d/nrpe files to use a user other than root, it still tries to start it as the root user and thus when an incoming connection comes in, it gives the NRPE daemon cannot be run as user/group root! error. Any thoughts on how to rectify this? Since NRPE is working fine on Linux, is this just a Mac OS X thing? Any help would be immensely appreciated. AD Andrew Davis wrote: FYI: /var/log/system.log on the client shows: Mar 18 16:08:07 shu xinetd[29066]: START: nrpe pid=557 from=10.1.1.170 Mar 18 16:08:07 shu nrpe[557]: Error: NRPE daemon cannot be run as user/group root! whether I do the default test (with SSL) or use the -n flag to test w/o SSL. The odd thing is that the nrpe config in /etc/xinetd.d is set to run as nobody:nobody and /etc/nagios/nrpe.cfg is owned by nobody:nobody. Only /usr/local/sbin/nrpe is owned by root (as it should be), but is also set to 755 perms. I've compared to a Linux box I have with NRPE and xinetd working properly and the permissions are identical. I'm stumped... Andrew Davis wrote: I have two Mac OS X servers, one running 10.3, the other running 10.4. Neither can be upgraded to 10.5 due to third party s/w constraints. Both are PPC based XServe's. Trying to compile nrpe with: ./configure --sysconfdir=/etc/nagios --enable-ssl Initially, I got the cannot find ssl libraries error: ~ checking for SSL headers... SSL headers found in /usr/local/ssl checking for SSL libraries... configure: error: Cannot find ssl libraries I downloaded the latest openssl and built it with: ./config --prefix=/usr/local shared --openssldir=/usr/local/openssl make make test make install I then had to edit ~/src/nrpe/configure and change the reference from libssl.so to libssl.dylib After that, nrpe compiled cleanly and I was able to move ~src/nrpe/src/nrpe to /usr/local/sbin and start xinetd up. I've confirmed that port 5666 is open and xinetd is running: /usr/local/src/nrpe-2.12/src root# ps waux|grep xinet|grep -v greproot 29066 0.0 -0.027484308 ?? Ss3:53PM 0:00.02 /usr/sbin/xinetd -pidfile /var/run/xinetd.pid -stayalive /usr/local/src/nrpe-2.12/src root# netstat -an|grep 5666tcp4 0 0 *.5666 *.*LISTEN However, when connecting from the remote server, I get: /usr/local/nagios/libexec/check_nrpe -H host.mydomain.org CHECK_NRPE: Error - Could not complete SSL handshake. The same test but w/o SSL gives yields: [nag...@nephilim src]$ /usr/local/nagios/libexec/check_nrpe -n -H host.mydomain.org CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. So two questions: 1) I'm a UNIX guy, but obviously Mac's are A) different and B) a tad different being BSD-based. So what's the proper way to stop/restart the xinetd daemon? 2) Any thoughts on SSL handshake error? I've googled it, but I'm not getting very far. Anyone have a step-by-step for compiling nagios plugins and NRPE from source on OS X 10.x (specifically 10.3 and 10.4)? I'm using NRPE for all other internal hosts, so I prefer to use it for the Mac's too. I know I could do it via check_by_ssh and get around this, but I prefer to use NRPE if I can. -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Test SMS
Well, you could setup SMS notifications, then bring down a service you monitor. That's always a simple one. Add http to a server you don't need it on, monitor it, then shut down the httpd process and wait for alerts... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Kaushal Shriyan wrote: Hi, is there a way to send Test Message as SMS from Nagios Server? Thanks and Regards Kaushal -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to receive notification alert by sms tomobile phone using Nagios
Like any other test... I do a simple http test to msn.com like you'd test any website with the standard plugin. I do a ping test to yahoo.com since they allow pings still (unlike many companies). And I do a dig to google.com. Those three are each defined as a host, then put in a hostgroup called sanity check. If one test fails, I assume there's an issue on there end and ignore it. If two fail, same thing. If all three fail, I know my ISP connection is down. The contactgroup for the hostgroup for these hosts goes to SMS (mine and the mgr at our ISP). A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Mirza Dedic wrote: Hi Andrew, How are you configuring your sanity checks in Nagios? Thanks. *From:* Andrew Davis [mailto:ncc...@gmail.com] *Sent:* March/11/2009 9:39 AM *To:* Martyn *Cc:* 'Onotsky, Steve x55328'; nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] How to receive notification alert by sms tomobile phone using Nagios A combination of the parents, hostgroups, and escalation. I'll post my configs later, but once you have confirmed that SMS is setup and working (make sure that for every email alert, you also get an SMS alert), then you define your network topology and put them into hostgroups. In my case, I have a hostgroup of network-core which includes my core routers, firewalls, Exchange server, Exchange storage, and the switch connecting the Exchange system with the firewall. I then have a contactgroup of the SMS addresses of three key people (myself included). If any of those device fail, Nagios sends notification via SMS. If anything else fails, it goes via emails. The kewl thing is I'll often sleep through emails, but if I get an SMS, I know something serious failed and I force myself to get up... Additionally, I tend to define a sanity check: a ping to yahoo and http to MSN and non-caching dns query of Google. If all sanity checks fail and are not resolved within 4 hours, then through escalations, an SMS is sent to the manager of the support team at our ISP as we likely have a bandwidth down issue. If sanity checks AND a core network device fail, my ISP doesn't get notified as we know its us... A. Davis Email: ncc...@gmail.com mailto:ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Martyn wrote: Beat me too it with the same question Martyn *From:* Onotsky, Steve x55328 [mailto:steve.onot...@broadridge.com] *Sent:* 11 March 2009 15:55 *To:* ncc...@gmail.com mailto:ncc...@gmail.com; nagios-users@lists.sourceforge.net mailto:nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] How to receive notification alert by sms tomobile phone using Nagios I've been messing around with this, off and on (as time permits). How did you set up your config to go to SMS only if email fails? Thanks Steve Onotsky Server Support Technologist Broadridge Investor Communication Solutions, Canada 5970 Chedworth Way Mississauga ON L5R 4G5 Tel: (905) 507-5328 Fax: (905) 507-5312 Inet: steve.onot...@broadridge.com mailto:steve.onot...@broadridge.com /Quis custodiet ipsos custodes?/ - Juvenal, /Satires/, VI, 347 *From:* Andrew Davis [mailto:ncc...@gmail.com] *Sent:* March 11, 2009 11:35 *To:* nagios-users@lists.sourceforge.net mailto:nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] How to receive notification alert by sms to mobile phone using Nagios In Nagios 2nd Edition from Wolfgang Barth, he covers this in section 12.4.2. He covers smsclient, yasp, and smssend, but he points out what Jim pointed out... most are email to SMS conversion tools so if email is done, you're still not notified. smsclient and a modem resolve this. I have it installed and working fine. Our normal notifications go via email, but if a network device or email server is down, then notifications go via SMS. A. Davis Email: ncc...@gmail.com mailto:ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Jim Avery wrote: 2009/3/11 tsedendorj oyunbat t_oyun...@yahoo.com mailto:t_oyun...@yahoo.com: I'm new one in Nagios system. I work in mobile cellular operator company. And I need to know is there any chance to get notification alert via sms to my mobile phone when BSC or HLR equipments go down or stop working? Is there any configuration on Nagios? And how to connect Nagios server to sms center. Currently I use an external email-sms service similar to http://www.intellisms.co.uk so I can send emails from Nagios in the usual way
Re: [Nagios-users] Mobile phone/non-WAP interface
My only desire is that it be able to login into SSL protected sites and handle any character in the password. Being able to cache the passwords would be a nice touch too. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan D. Emmanuel Feinsmith wrote: Hi Patrick, I have submitted a native iPhone Nagios App to the iPhone App Store. They say it takes between 30 and 60 days to get on the store. It's written in Objective-C and gives very good user responsiveness, even with large installations. Way better than using Safari on the iPhone or WAP on other phones. Here are some screen shots of the beta version: http://www.shastasystems.com/nagiostouch.html It works on any Nagios instance, no modifications are necessary to the server itself. I'll send a message to this list when it is up on the store. Best Regards, Daniel. On Mar 10, 2009, at 2:46 PM, Patrick Morris wrote: On Tue, 10 Mar 2009, Andrew Davis wrote: I see there's a WAP version built-in to Nagios, but what about phones that don't do WAP (ie: the iPhone)? Is there a simplied interface for viewing from a mobile device? I found inagios on Google, but the code, documentation, and such looks sketchy. I'm looking for something a bit more solid. Ideas? The WAP interface provided by nagios works fine on iPhones, as does the full standard web interface. -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Error collecting CPU/Load with NSClient++
I added two Windows hosts today and on one system am getting trending graphs for disks, memory, and CPU. However on the other, I'm only getting graphs for disk and memory... nothing for CPU. Looking at the local log on the client shows: Date Time error:.\PDHCollector.cpp:264: Failed to get CPU value: Strange error buffer pointers are f*cked up. Any thoughts? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Error collecting CPU/Load with NSClient++
BTW: running it from the Nagios server shows: nag...@nephilim libexec]$ ./check_nt -H 10.100.150.101 -p 12489 -v CPULOAD -l 5,85,95,15,80,95 CPU Load 2% (5 min average) 1% (15 min average) | '5 min avg Load'=2%;85;95;0;100 '15 min avg Load'=1%;80;95;0;100 So the daemon is (obviously or I wouldn't have other graphs) and it can manually poll it. The other system I mentioned is configured identically and is successfully graphing CPU trends... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Andrew Davis wrote: I added two Windows hosts today and on one system am getting trending graphs for disks, memory, and CPU. However on the other, I'm only getting graphs for disk and memory... nothing for CPU. Looking at the local log on the client shows: Date Time error:.\PDHCollector.cpp:264: Failed to get CPU value: Strange error buffer pointers are f*cked up. Any thoughts? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Mobile phone/non-WAP interface
I'm definitely looking forward to it... :) A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan D. Emmanuel Feinsmith wrote: Yes, my iPhone Nagios App parses the output of status.cgi. For the iPhone it's a particularly tricky problem due to the small amount of memory available to applications. It's a pretty sophisticated little multi-threaded application that does alot of incremental concurrent loading and various memory management tricks to keep the footprint as small as possible and keep the user interface snappy even when on an Edge iPhone. Daniel. On Mar 10, 2009, at 6:16 PM, Mathieu Gagné wrote: Hi, This sure sounds promising. How are status retrieved? Is it done by parsing the HTML output? Mathieu D. Emmanuel Feinsmith wrote: Hi Patrick, I have submitted a native iPhone Nagios App to the iPhone App Store. They say it takes between 30 and 60 days to get on the store. It's written in Objective-C and gives very good user responsiveness, even with large installations. Way better than using Safari on the iPhone or WAP on other phones. Here are some screen shots of the beta version: http://www.shastasystems.com/nagiostouch.html It works on any Nagios instance, no modifications are necessary to the server itself. I'll send a message to this list when it is up on the store. Best Regards, Daniel. On Mar 10, 2009, at 2:46 PM, Patrick Morris wrote: On Tue, 10 Mar 2009, Andrew Davis wrote: I see there's a WAP version built-in to Nagios, but what about phones that don't do WAP (ie: the iPhone)? Is there a simplied interface for viewing from a mobile device? I found inagios on Google, but the code, documentation, and such looks sketchy. I'm looking for something a bit more solid. Ideas? The WAP interface provided by nagios works fine on iPhones, as does the full standard web interface. -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/ null -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when
Re: [Nagios-users] How to receive notification alert by sms to mobile phone using Nagios
In Nagios 2nd Edition from Wolfgang Barth, he covers this in section 12.4.2. He covers smsclient, yasp, and smssend, but he points out what Jim pointed out... most are email to SMS conversion tools so if email is done, you're still not notified. smsclient and a modem resolve this. I have it installed and working fine. Our normal notifications go via email, but if a network device or email server is down, then notifications go via SMS. A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Jim Avery wrote: 2009/3/11 tsedendorj oyunbat t_oyun...@yahoo.com: I'm new one in Nagios system. I work in mobile cellular operator company. And I need to know is there any chance to get notification alert via sms to my mobile phone when BSC or HLR equipments go down or stop working? Is there any configuration on Nagios? And how to connect Nagios server to sms center. Currently I use an external email-sms service similar to http://www.intellisms.co.uk so I can send emails from Nagios in the usual way and have them converted to SMS. It's extremely easy to set up, but has the disadvantage that if the email system goes down, we don't know about it! Alternatively, you could connect a mobile 'phone to your Nagios server and send SMS using gnokii http://www.gnokii.org/ or similar, or your mobile operator might well have an http interface you could use or can recommend other options. In Nagios, you specify the notification methods in command definitions which you can write yourself so it's extremely flexible. See http://nagios.sourceforge.net/docs/3_0/notifications.html under the heading Notification Methods. hth, Jim -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to receive notification alert by sms tomobile phone using Nagios
A combination of the parents, hostgroups, and escalation. I'll post my configs later, but once you have confirmed that SMS is setup and working (make sure that for every email alert, you also get an SMS alert), then you define your network topology and put them into hostgroups. In my case, I have a hostgroup of network-core which includes my core routers, firewalls, Exchange server, Exchange storage, and the switch connecting the Exchange system with the firewall. I then have a contactgroup of the SMS addresses of three key people (myself included). If any of those device fail, Nagios sends notification via SMS. If anything else fails, it goes via emails. The kewl thing is I'll often sleep through emails, but if I get an SMS, I know something serious failed and I force myself to get up... Additionally, I tend to define a sanity check: a ping to yahoo and http to MSN and non-caching dns query of Google. If all sanity checks fail and are not resolved within 4 hours, then through escalations, an SMS is sent to the manager of the support team at our ISP as we likely have a bandwidth down issue. If sanity checks AND a core network device fail, my ISP doesn't get notified as we know its us... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Martyn wrote: Beat me too it with the same question Martyn *From:* Onotsky, Steve x55328 [mailto:steve.onot...@broadridge.com] *Sent:* 11 March 2009 15:55 *To:* ncc...@gmail.com; nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] How to receive notification alert by sms tomobile phone using Nagios I've been messing around with this, off and on (as time permits). How did you set up your config to go to SMS only if email fails? Thanks Steve Onotsky Server Support Technologist Broadridge Investor Communication Solutions, Canada 5970 Chedworth Way Mississauga ON L5R 4G5 Tel: (905) 507-5328 Fax: (905) 507-5312 Inet: steve.onot...@broadridge.com mailto:steve.onot...@broadridge.com //Quis custodiet ipsos custodes?// - Juvenal, //Satires//, VI, 347 *From:* Andrew Davis [mailto:ncc...@gmail.com] *Sent:* March 11, 2009 11:35 *To:* nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] How to receive notification alert by sms to mobile phone using Nagios In Nagios 2nd Edition from Wolfgang Barth, he covers this in section 12.4.2. He covers smsclient, yasp, and smssend, but he points out what Jim pointed out... most are email to SMS conversion tools so if email is done, you're still not notified. smsclient and a modem resolve this. I have it installed and working fine. Our normal notifications go via email, but if a network device or email server is down, then notifications go via SMS. A. Davis Email: ncc...@gmail.com mailto:ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Jim Avery wrote: 2009/3/11 tsedendorj oyunbat t_oyun...@yahoo.com mailto:t_oyun...@yahoo.com: I'm new one in Nagios system. I work in mobile cellular operator company. And I need to know is there any chance to get notification alert via sms to my mobile phone when BSC or HLR equipments go down or stop working? Is there any configuration on Nagios? And how to connect Nagios server to sms center. Currently I use an external email-sms service similar to http://www.intellisms.co.uk so I can send emails from Nagios in the usual way and have them converted to SMS. It's extremely easy to set up, but has the disadvantage that if the email system goes down, we don't know about it! Alternatively, you could connect a mobile 'phone to your Nagios server and send SMS using gnokii http://www.gnokii.org/ or similar, or your mobile operator might well have an http interface you could use or can recommend other options. In Nagios, you specify the notification methods in command definitions which you can write yourself so it's extremely flexible. See http://nagios.sourceforge.net/docs/3_0/notifications.html under the heading Notification Methods. hth, Jim -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net mailto:Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo
[Nagios-users] NSClient++ and .Net
If I recall, Windows 2000 Server didn't include .Net, but you could install it. Windows 2003 Server included .Net 1.x, but you could additionally add .Net 2.x. Now it seems that Server 2008 includes .Net 2.x by default and you can install .Net 3.x, though MS finall got smart and just did one big rollup package so you don't have to have .Net 1.x and 2.x both installed, etc. So my question is this: right now we're mostly Server 2003, but migrating to Server 2008. NSClient++ 4.x requires .Net 2.x, but will it work with .Net 3.x on Server 2008? Anyone know? Anyone tried? Thanks, -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSClient++ and .Net
You are correct. With BB it was much easier... there was just one client. With Nagios I got lost in the cloud of clients and transports. My apologies... :) A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Michael Medin wrote: Hello, I am guessing you are a bit confused. NSClient++ does not use .net (nor require it) perhaps you are thinking of NC_Net? (also NSClient++ version are along the lines of 0.3.6 or some such, I dont want to bloat the version numbers so I tend to increase them sparingly :) // Michael Medin Andrew Davis skrev: If I recall, Windows 2000 Server didn't include .Net, but you could install it. Windows 2003 Server included .Net 1.x, but you could additionally add .Net 2.x. Now it seems that Server 2008 includes .Net 2.x by default and you can install .Net 3.x, though MS finall got smart and just did one big rollup package so you don't have to have .Net 1.x and 2.x both installed, etc. So my question is this: right now we're mostly Server 2003, but migrating to Server 2008. NSClient++ 4.x requires .Net 2.x, but will it work with .Net 3.x on Server 2008? Anyone know? Anyone tried? Thanks, -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Any recommendations
http://www.amazon.com/Nagios-Network-Monitoring-Wolfgang-Barth/dp/1593271794/ref=pd_bbs_sr_1?ie=UTF8s=booksqid=1236649292sr=8-1 A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Martyn wrote: Yep I understand, which book to purchase is the question though. Building a Monitoring Infrastructure with Nagios http://www.amazon.com/Building-Monitoring-Infrastructure-Nagios-Josephsen/dp/0132236931/ref=pd_bxgy_b_img_b or Nagios 3 Enterprise Network Monitoring: Including Plug-Ins and Hardware Devices http://www.amazon.com/Nagios-Enterprise-Network-Monitoring-Including/dp/1597492671/ref=pd_bbs_sr_2?ie=UTF8s=booksqid=1222355174sr=8-2 http://www.amazon.com/Nagios-Enterprise-Network-Monitoring-Including/dp/1597492671/ref=pd_bbs_sr_2?ie=UTF8s=booksqid=1222355174sr=8-2 Am on the verge of purchasing Nagios 3 ent. Thanks *From:* Andrew Davis [mailto:ncc...@gmail.com] *Sent:* 09 March 2009 22:11 *To:* nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] Any recommendations Per the Nagios Exchange site, NagiosWeb only supports up to Nagios 2.x. It also hasn't been updated since 2006. If you're going to do a new Nagios install, you should be on 3.x. If you're going to try one, you're probably best off with NagMin, but realistically, unless you're going to make changes in the web frontend, then crawl your server for what's changed and diff it, you aren't likely to learn it. I just migrated from BB to Nagios following Wolfgang Barth's Nagios 2nd Edition and its been really helpful to understand the file structures and command-line, especially if you want to extend Nagios with trending, etc. So this is where I politely say suck it up and read the manual... or at least a good book about it. :) Of course, a few years of experience as a UNIX/Linux admin won't hurt either... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Martyn wrote: Hi Group, anybody used or can recommend ant of the below, or any others, I think if I can build a few via a GUI it will give me a better understanding. Nagmin Nagat Nagiosweb Thanks Martyn -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Best options for monitoring Windows servers
I'm a UNIX and network guy, but since Nagios is my task, I also have to add the Windows servers to Nagios. The network tests are fine, but I'm curious which is the best solution for monitoring local items on a Windows server (2000, 2003, and 2008). Here's what I think the options are: Use check_nt with NSClient, NSClient++, OpMon Agent, or NC_Net Use check_nrpe with NSClient++, OpMon Agent, or NRPE_NT Use check_ncnet with NSClient, NSClient++, OpMon Agent, or NC_Ne Use NSCA-Daemon with NC_Net Note: I'm interested in screwing with WMI... NSClient appears to be old, but stable and well known. However I see it has good support for NT4 and 2000 and 2003 R1, but not 2003 R2 and I don't see anything about 2008 Server. NC_Net looks current and well developed. NSClient++ seems to have NSClient compatibility, but is more current and supports NT4 thru 2008. It looks like I already have check_nt as part of a default 3.x installation. NRPE_NT also seems to have been around for a bit and stable. NSClient++ seems to have its own, native NRPE support. What are your opinions of the best combination to A) get the best overall support and B) have the least impact/installation on a Windows server? I want something current and currently developed with 2008 support, but I also want stability and the least work possible. Presently I'm using NRPE for my local linux/UNIX servers, so I'm inclined to give NRPE a try, but I'd also like to avoid loading plugins on the Windows client if I can. I'm leaning towards NSClient++ with check_nt, but I'm not sure this is the best solutions. Thoughts, opinions, and ideas are requested... -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Sub-groups of hostgroups
Is it possible to do a sub-group of a hostgroup? For example, right now I have the following hostgroups: linux-servers unix-servers mac-servers network-devices I'm going to be adding windows-servers as a hostgroup, but within this group I want to further subgroup both for readability and notifications. For example, I'd like subgroups of: directory-servers exchange-servers file-servers as sub-groups of the parent windows-servers hostgroup. Is this possible? If so, anyone have a working config/example they can share? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Mobile phone/non-WAP interface
I see there's a WAP version built-in to Nagios, but what about phones that don't do WAP (ie: the iPhone)? Is there a simplied interface for viewing from a mobile device? I found inagios on Google, but the code, documentation, and such looks sketchy. I'm looking for something a bit more solid. Ideas? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Monitoring Windows hosts...
I'm a UNIX and network guy, but since Nagios is my task, I also have to add the Windows servers to Nagios. The network tests are fine, but I'm curious which is the best solution for monitoring local items on a Windows server (2000, 2003, and 2008). Here's what I think the options are: Use check_nt with NSClient, NSClient++, OpMon Agent, or NC_Net Use check_nrpe with NSClient++, OpMon Agent, or NRPE_NT Use check_ncnet with NSClient, NSClient++, OpMon Agent, or NC_Ne Use NSCA-Daemon with NC_Net Note: I'm interested in screwing with WMI... NSClient appears to be old, but stable and well known. However I see it has good support for NT4 and 2000 and 2003 R1, but not 2003 R2 and I don't see anything about 2008 Server. NC_Net looks current and well developed. NSClient++ seems to have NSClient compatibility, but is more current and supports NT4 thru 2008. It looks like I already have check_nt as part of a default 3.x installation. NRPE_NT also seems to have been around for a bit and stable. NSClient++ seems to have its own, native NRPE support. What are your opinions of the best combination to A) get the best overall support and B) have the least impact/installation on a Windows server? I want something current and currently developed with 2008 support, but I also want stability and the least work possible. Presently I'm using NRPE for my local linux/UNIX servers, so I'm inclined to give NRPE a try, but I'd also like to avoid loading plugins on the Windows client if I can. I'm leaning towards NSClient++ with check_nt, but I'm not sure this is the best solutions. Thoughts, opinions, and ideas are requested... -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios, memory tests, and Linux hosts...
So along with disk and CPU tests, I want to test for memory usage. I pulled a perl memory script off Nagios Exchange and was quickly reminded of how linux handles memory... how it leaves whatever has been used previously as reserved. Thus, a server with 4Gb of RAM may show that 3.95Gb is in use, but in reality, more like 512Mb is in use and the swap usage is under 10%, if used at all, confirming the actual versus reserved issue. So how are you guys testing for memory on Linux boxes that flag it as reserved when its not? Are there any scripts on Nagios Exchange for memory testing that are better than others? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios, memory tests, and Linux hosts...
Looks good, but no performance data, which I would like to be able to trend memory usage and justify upgrades. Anyone familiar with a good memory test that properly handles memory reservations AND also includes performance trending data? A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Justin Pasher wrote: Andrew Davis wrote: So along with disk and CPU tests, I want to test for memory usage. I pulled a perl memory script off Nagios Exchange and was quickly reminded of how linux handles memory... how it leaves whatever has been used previously as reserved. Thus, a server with 4Gb of RAM may show that 3.95Gb is in use, but in reality, more like 512Mb is in use and the swap usage is under 10%, if used at all, confirming the actual versus reserved issue. So how are you guys testing for memory on Linux boxes that flag it as reserved when its not? Are there any scripts on Nagios Exchange for memory testing that are better than others? I use check_memory.pl. It properly takes buffers and cache into consideration (using the free command). http://www.nagiosexchange.org/cgi-bin/page.cgi?g=Detailed%2F1433.html;d=1 -- Justin Pasher -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Any recommendations
Per the Nagios Exchange site, NagiosWeb only supports up to Nagios 2.x. It also hasn't been updated since 2006. If you're going to do a new Nagios install, you should be on 3.x. If you're going to try one, you're probably best off with NagMin, but realistically, unless you're going to make changes in the web frontend, then crawl your server for what's changed and diff it, you aren't likely to learn it. I just migrated from BB to Nagios following Wolfgang Barth's Nagios 2nd Edition and its been really helpful to understand the file structures and command-line, especially if you want to extend Nagios with trending, etc. So this is where I politely say suck it up and read the manual... or at least a good book about it. :) Of course, a few years of experience as a UNIX/Linux admin won't hurt either... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Martyn wrote: Hi Group, anybody used or can recommend ant of the below, or any others, I think if I can build a few via a GUI it will give me a better understanding. Nagmin Nagat Nagiosweb Thanks Martyn -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Any Help
Someone already did: Nagios 2nd Edition by Wolfgang Barth. Buy it. Andrew Davis Email: ncc...@gmail.com Phone: 772-834-0010 There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan Jai Ram wrote: Hello, Can anyone post there experience in installing nagios on solaris server. Thanks jai. -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_http question
I've enabled check_http for quite a few hosts, including testing for custom ports, URL's, and sites needing authentication. So far, so good. However, one issue in the Nagios interface is bothering me. If I drill in on a host, then the HTTP link for the test, it only shows OK. It DOES NOT show the full URL that was tested. For example, for one of my hosts, I had to test for a non-standard port, SSL, a directory structure, and authentication (-I $HOSTNAME -p 8099 -u /dir/dir/dir/file.php -a user:password. All worked in the sense that the test returned an OK status, but I have no visual way of seeing the tested URL within Nagios. In contrast, doing a similar test in BB (which I'm migrating away from), shows the full URL that was tested. Am I simply missing something here? Is there a way of seeing the full URL/string that was tested with the check_http command? -- Andrew Davis -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_http question
Thanks much. I guess I may have to just live with it the way it is until a future revision... Andrew Davis jmose...@corp.xanadoo.com wrote: Unfortunately, that only displays the commands as they are defined in the command configuration file object defintions. You could look at the service commands being run, though. Back to the question, though. Nagios displays what is being returned by the plugin. The plugin, by default, doesn't show the port or URL being monitored in the plugin out. You could run the plugin with a -v option (for verbose). That generates a lot of output. However, I don't know if Nagios will be able to parse the verbose output properly. James Moseley Seth Simmons ssimm...@cymfony .com To ncc...@gmail.com, 02/27/2009 01:35 nagios-users@lists.sourceforge.net PM cc Subject Re: [Nagios-users] check_http question In nagios, go to view config - commands It will show the exact command line being used for the host or service check From: Andrew Davis [mailto:ncc...@gmail.com] Sent: Friday, February 27, 2009 2:28 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] check_http question I've enabled check_http for quite a few hosts, including testing for custom ports, URL's, and sites needing authentication. So far, so good. However, one issue in the Nagios interface is bothering me. If I drill in on a host, then the HTTP link for the test, it only shows OK. It DOES NOT show the full URL that was tested. For example, for one of my hosts, I had to test for a non-standard port, SSL, a directory structure, and authentication (-I $HOSTNAME -p 8099 -u /dir/dir/dir/file.php -a user:password. All worked in the sense that the test returned an OK status, but I have no visual way of seeing the tested URL within Nagios. In contrast, doing a similar test in BB (which I'm migrating away from), shows the full URL that was tested. Am I simply missing something here? Is there a way of seeing the full URL/string that was tested with the check_http command? -- Andrew Davis -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null