Re: [Nagios-users] displaying multiple lines of output in the GUI
Does anyone know how to configure nagios to display multiple lines output in the CGI? We're using nagios 3.2.0 and apparently, having multiple lines in the cgi is not done automagically. Thank you. On Thu, Aug 12, 2010 at 5:18 PM, Trisha Hoang tri...@rockyou.com wrote: Hi, I've upgraded nrpe to 2.12 and got multiple lines output from NRPE, and on the nagios server, I can see multiple lines in the Service State Information dialog box when clicked on the service link. However, on the main page, only 1 long line shows up and the 2nd line is cut off. Is it possible to have multiple lines of output per service on the 'main' page? Please let me know if you need more info. Thanks. Trisha -- Trisha Hoang | IT/Operations | Rockyou, Inc. | Phone: 408-472-3989 | AIM: rockyoutrisha -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] displaying multiple lines of output in the GUI
Hi, I've upgraded nrpe to 2.12 and got multiple lines output from NRPE, and on the nagios server, I can see multiple lines in the Service State Information dialog box when clicked on the service link. However, on the main page, only 1 long line shows up and the 2nd line is cut off. Is it possible to have multiple lines of output per service on the 'main' page? Please let me know if you need more info. Thanks. Trisha attachment: Screenshot.png-- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Looking for an alternative user interface with more advanced features
Thank you for all your suggestions. Sorry, I got no guts for 'bleeding edge' technology, and would like to sleep peacefully at night, though I will follow up on Ninja's development for future upgrades. Patrick's suggestions are great, simple, tried and true, and will fit our requirements. Thanks again. On Tue, Jun 15, 2010 at 1:14 PM, Andreas Ericsson a...@op5.se wrote: On 06/15/2010 04:18 AM, Trisha Hoang wrote: Hi, There are times when I need to disable notifications or submit downtime for *random* hosts/services that don't belong to any particular hostgroups/servicegroups, and the standard Nagios UI doesn't have this kind of feature. Would you recommend some tools out there that are stable, easy to install, easy to use, that have some of the more advanced features? That's a lot of easy for free tools with advanced features ;) Ninja has something along those lines if you're willing to run bleeding edge (I think). You can select multiple hosts, hostgroups, services or servicegroups and issue commands for them if you like. I think it's only in the bleeding edge versions though (meaning in our git repositories, which are readable for anyone that wants to clone them). It's been a few weeks since I worked on Ninja, so I can't say for sure. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Trisha Hoang | IT/Operations | Rockyou, Inc. | Phone: 408-472-3989 | AIM: rockyoutrisha -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Looking for an alternative user interface with more advanced features
There are times that we need to take couple of hosts from *multiple* hostgroups for upgrade/testing. It gets to be time consuming commiting downtime for 20+ hosts one by one. Nagios only has features for either hostgroups and/or servicegroups but not a listing of nodes where users can pick and choose which hosts and services to enable/disable/downtime. On Mon, Jun 14, 2010 at 8:39 PM, Matt Simmons standalone.sysad...@gmail.com wrote: Do you mean that you can't do it if you go to Services or Hosts, or you mean that you really do want to disable notifications and downtime for *truly* random hosts? Because I don't think there's a whole lot of use cases matching that. --Matt On Mon, Jun 14, 2010 at 10:18 PM, Trisha Hoang tri...@rockyou.com wrote: Hi, There are times when I need to disable notifications or submit downtime for *random* hosts/services that don't belong to any particular hostgroups/servicegroups, and the standard Nagios UI doesn't have this kind of feature. Would you recommend some tools out there that are stable, easy to install, easy to use, that have some of the more advanced features? Thank you. Trisha -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Trisha Hoang | IT/Operations | Rockyou, Inc. | Phone: 408-472-3989 | AIM: rockyoutrisha -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Looking for an alternative user interface with more advanced features
Hi, There are times when I need to disable notifications or submit downtime for *random* hosts/services that don't belong to any particular hostgroups/servicegroups, and the standard Nagios UI doesn't have this kind of feature. Would you recommend some tools out there that are stable, easy to install, easy to use, that have some of the more advanced features? Thank you. Trisha -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Strange fluctuation in load average
Hi all, When I first installed nagios-3.2.0 with embedded perl enabled, nagios experienced increasing latency, starting at 1 sec and climbed upto 300 within a few hours until restarting nagios. I read on one of the older post suggesting to recompile nagios *without* embedded perl, and that resolved the latency issue, with latency consistently at less than 1 sec. However, ever since, the system load average has fluctuated wildly from 1 to 12 and down to say ... 3 within a minute. This fluctuation happens 3-10 minutes each time and calms down for ... say an hour. There doesn't seem to be any cron jobs that can cause this kind of load, and cpu (1-quad core) is usually at least 50% idle , with plenty of free memory, no IO blocks, on Centos 5-2. What's strange is with nagios compiled with embedded perl, the load was consistently at 2-4. Could this be nagios related? Please let me know if you need more information. -- Trisha Hoang | IT/Operations | Rockyou, Inc. | Phone: 408-472-3989 | AIM: rockyoutrisha -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Strange fluctuation in load average
I'm using uptime to obtain the load average. Here's a snippet of the values. 09:17:34 up 5 days, 16:06, 3 users, load average: 2.07, 2.61, 3.45 09:19:34 up 5 days, 16:08, 3 users, load average: 9.09, 4.78, 4.13 09:21:34 up 5 days, 16:10, 3 users, load average: 10.05, 6.69, 4.91 09:23:34 up 5 days, 16:12, 3 users, load average: 8.83, 7.08, 5.24 09:25:34 up 5 days, 16:14, 3 users, load average: 9.42, 8.26, 5.91 09:27:34 up 5 days, 16:16, 3 users, load average: 4.43, 6.66, 5.60 09:29:34 up 5 days, 16:18, 3 users, load average: 13.06, 8.85, 6.51 09:31:34 up 5 days, 16:20, 3 users, load average: 7.35, 8.61, 6.73 09:33:34 up 5 days, 16:22, 3 users, load average: 7.87, 7.96, 6.69 09:35:34 up 5 days, 16:24, 3 users, load average: 4.25, 6.94, 6.49 09:37:34 up 5 days, 16:26, 3 users, load average: 2.50, 5.34, 5.95 09:39:34 up 5 days, 16:28, 3 users, load average: 7.53, 6.21, 6.19 09:41:34 up 5 days, 16:30, 3 users, load average: 5.71, 6.11, 6.15 09:43:34 up 5 days, 16:32, 3 users, load average: 1.56, 4.39, 5.51 -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds
We used nagios-3.2.1 and Centos5.2 and experienced somewhat the same problem as Michael described, but I don't see the same problem after moving to nagios-3.2.0. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Full Throttle Nagios
I spent couple weeks playing with 3.2.1 and found that it performs very well with active checks (6500+ in 5 min at 1-2 sec latency max) but could not pass 5000 passive checks on the master server. When switched to 3.2.0, it processes 7300-7500 passive checks out of 8055 at 0.2 sec latency using directives use_large_installation_tweaks=1 and child_processes_fork_twice=0. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Problems with distributed monitoring
Hi Sergio, Some of the directives I found helpful for our MASTER server are listed below. Since status.dat and nagios.cmd are disk bound, put them on ramdisk will be faster. status_file=/mnt/ramdisk/status.dat command_file=/mnt/ramdisk/nagios.cmd I don't think aggressive_host_checking is needed as nagios checks for host when a service is in error anyway. use_aggressive_host_checking=0 check_host_freshness=0 Service freshness is important as the MASTER tends to process passive checks much slower so the services may go stale. However, since our checks are 5 min interval, having the MASTER wait for the next round of check is fine. check_service_freshness=1 service_freshness_check_interval=420 We use nagios-3.2.1 and I think these directives are still experimental but they seem to help. You will see defunct nagios processes that come and go. I think it's caused by child forked once instead of twice so one gets killed (my theory), but again, it seems to be running ok. use_large_installation_tweaks=0 child_processes_fork_twice=0 Our MASTER receives ~7000 passive checks from the SLAVE but it could only process max ~5000 passive checks per 5 min. The latency is about 10 secs. For the rest, the MASTER actively checks them. If you or someone knows a way to improve passive check processing, that will be great. Also, in our setup, we don't use NSCA. The slaves have ocsp_command=send_service_check where this command inserts the checks into a file that gets sent every 5 sec to the master. On the master, there's a script that opens this file and inserts the lines directly into the nagios.cmd pipe every 5 sec. Trisha -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] high host latency on nagios master
Hi, The nagios *master *got really high host latency and I'm not sure how to tweak it. I ran the check_ping plugin on a handful of hosts and the rta averaged at 0.2 second so it's not the network. *Environment:* - 565 hosts - 6790 passive checks from the slaves - not using event broker - master server *actively* executes the hosts checks every 5 minutes and *passively *processes checks every 1 minute - not doing performance data *Nagiostats* Nagios Stats 3.2.1 Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org) Last Modified: 03-09-2010 License: GPL CURRENT STATUS DATA -- Status File:/var/log/nagios/status.dat Status File Age:0d 0h 0m 23s Status File Version:3.2.1 Program Running Time: 0d 1h 32m 19s Nagios PID: 28282 Used/High/Total Command Buffers:1316 / 3066 / 4096 Total Services: 7745 Services Checked: 7745 Services Scheduled: 1381 Services Actively Checked: 955 Services Passively Checked: 6790 Total Service State Change: 0.000 / 9.740 / 0.007 % Active Service Latency: 18.948 / 205.144 / 165.751 sec Active Service Execution Time: 0.007 / 9.051 / 0.055 sec Active Service State Change:0.000 / 5.460 / 0.006 % Active Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 Passive Service Latency:34.359 / 190.247 / 76.739 sec Passive Service State Change: 0.000 / 9.740 / 0.008 % Passive Services Last 1/5/15/60 min:0 / 3054 / 6774 / 6784 Services Ok/Warn/Unk/Crit: 7720 / 1 / 0 / 24 Services Flapping: 27 Services In Downtime: 0 Total Hosts:566 Hosts Checked: 566 Hosts Scheduled:566 Hosts Actively Checked: 566 Host Passively Checked: 0 Total Host State Change:0.000 / 0.000 / 0.000 % Active Host Latency:0.000 / 3410.087 / 2413.051 sec Active Host Execution Time: 0.007 / 10.010 / 0.063 sec Active Host State Change: 0.000 / 0.000 / 0.000 % Active Hosts Last 1/5/15/60 min:0 / 8 / 10 / 565 Passive Host Latency: 0.000 / 0.000 / 0.000 sec Passive Host State Change: 0.000 / 0.000 / 0.000 % Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 Hosts Up/Down/Unreach: 563 / 3 / 0 Hosts Flapping: 1 Hosts In Downtime: 0 Active Host Checks Last 1/5/15 min: 5 / 32 / 75 Scheduled: 0 / 0 / 0 On-demand: 5 / 32 / 75 Parallel:1 / 11 / 23 Serial: 0 / 0 / 0 Cached: 4 / 21 / 52 Passive Host Checks Last 1/5/15 min:0 / 0 / 0 Active Service Checks Last 1/5/15 min: 0 / 0 / 0 Scheduled: 0 / 0 / 0 On-demand: 0 / 0 / 0 Cached: 0 / 0 / 0 Passive Service Checks Last 1/5/15 min: 2 / 1455 / 1455 External Commands Last 1/5/15 min: 1302 / 6063 / 20253 *Nagios.cfg* # EXTERNAL COMMAND CHECK INTERVAL # This is the interval at which Nagios should check for external commands. # This value works of the interval_length you specify later. If you leave # that at its default value of 60 (seconds), a value of 1 here will cause # Nagios to check for external commands every minute. If you specify a # number followed by an s (i.e. 15s), this will be interpreted to mean # actual seconds rather than a multiple of the interval_length variable. # Note: In addition to reading the external command file at regularly # scheduled intervals, Nagios will also check for external commands after # event handlers are executed. # NOTE: Setting this value to -1 causes Nagios to check the external # command file as often as possible. #command_check_interval=15s command_check_interval=-1 # SERVICE INTER-CHECK DELAY METHOD # This is the method that Nagios should use when initially # spreading out service checks when it starts monitoring. The # default is to use smart delay calculation, which will try to # space all service checks out evenly to minimize CPU load. # Using the dumb setting will cause all checks to be scheduled # at the same time (with no delay between them)! This is not a # good thing for production, but is useful when testing the # parallelization functionality. # n = None - don't use any delay between checks # d = Use a dumb delay of 1 second between checks # s = Use smart inter-check delay calculation # x.xx= Use an inter-check delay of x.xx seconds service_inter_check_delay_method=s #
[Nagios-users] pnp4nagios
In our environment, the slaves send check results in bulk to the master, which means that all the nagios macros associated with the results are no longer available once reached the master. What I want to do is to install pnp4nagios on the master so that we'll have graphs for all the results. The problem now is how do I manually create the right syntax for the service-perfdata file to be processed by the pnp4nagios script process_perfdata.pl, when all the macros are not available? Perhaps if you have a sample service-perfdata file, that would be helpful. Trisha -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] pnp4nagios
This is the syntax for the service-perfdata file. Looks like the slave can send 1) the regular format to nagios.cmd, and 2) the below format to pnp4nagios. I think it's do-able :-). DATATYPE::SERVICEPERFDATA TIMET::1270684107 HOSTNAME::www157SERVICEDESC::PUPPET SERVICEPERFDATA:: SERVICECHECKCOMMAND::check_nrpe!check_file_age!259200!10800!/var/lib/puppet/state/state.yaml HOSTSTATE::UP HOSTSTATETYPE::HARD SERVICESTATE::OK SERVICESTATETYPE::HARD Thank you. Trisha -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.2.1 - browser refresh
I'm using 3.2.1 on Centos 5.2 and it doesn't work was well. On Tue, Mar 30, 2010 at 12:14 PM, patrick.mor...@hp.com wrote: On Tue, 30 Mar 2010, Joseph L. Casale wrote: See this recent thread for a couple of suggestions -- http://thread.gmane.org/gmane.network.nagios.user/66432/focus=66435 As I posted a couple of days ago, none of those options worked for me on my CentOS5 box, where as they did for 3.2.0. What changes have you tried, and where did you apply them? This issue is caused completely by the way Apache refreshes dynamic content (particularly PHP) by default. I can guarantee you that the suggestion I made previously in that thread works regardless of Nagios version. Caveat: I haven't looked at the index.php file under 3.2.1, but I'm 100% sure my suggestion will change the refresh behavior. Whether it'll break anything else added to that page since 3.2.0 is a different matter. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.2.1 - browser refresh
I have tried both and restarted apache. 1) rename file $prefix/share/index.php to index.html, and 2) placing this line ?php header(Cache-Control: max-age=7200, public); ? on the first line of index.php. On Tue, Mar 30, 2010 at 2:14 PM, Joseph L. Casale jcas...@activenetwerx.com wrote: What changes have you tried, and where did you apply them? All the possibilities in that thread, renaming from php to html or adding the single line of code. Thanks, jlc -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null