Re: [Nagios-users] [Nagios-devel] RFC/RFP Service sets
Hi, On first glance, I liked the nested config example, but after thinking about it for a little while, I'm starting to lean toward specifying the service_sets per service. In cases where multiple groups contribute to the Nagios configuration of an entity, doing this will allow each group to subscribe their services, and file permissions on the individual cfg files can be maintained, whereas storing things in the nested service_set object requires everyone to be able to write to that (plus it adds some additional complexity). For what it's worth, I'm one of those people who are using host groups to assign services to machines because I like being able to minimize my work and maximize my results, and that's the shortest cut I've seen so far. This replaces that functionality nicely, I think. --Matt On Tue, May 17, 2011 at 8:27 AM, Andreas Ericsson a...@op5.se wrote: Ahoy (again). One of the ideas that surfaced on the Nagios developer meeting in Bolzano was a concept dubbed service sets. Consider them basically partial host service profiles and you'll have roughly the right idea. The benefits of adding service sets is that users can share config settings for various types of hosts rather than some particular check, and also that the question does Nagios support monitoring X? is quite easily answerable on a higher level than no, but you can add checks for this and that, and this too, so it sort of does anyway, which tends to leave people who have no idea of how Nagios works quite baffled. There are two implementation suggestions so far, perhaps best explained in sample configuration: --%--%--%--%--%-- # compound-in-compound style (aka, extended template style): define service_set { name windows-services use windows-service-template contact_groups windows-admins parents NSClient define service { description NSClient ; parent of all the others ... } define service { description Disk usage C check_command check_nsclient!C!80!90 } } define service_set { use database-service-template name psql-services contact_groups db-admins parents PSQL Listener define service { description PSQL Listener; parent of the other ones } define service { description Cache hit ratio ... } define service { description Slow queries ... } } define host { host_name win-psql1 service_sets windows-services,psql-services } --%--%--%--%--%-- Pros: * Less typing. * Config is more normalized with less redundant information. * Service sets can also double as templates for the services they contain. * A service-set is obviously safe-contained and quite easy to share under whatever name the recipient wishes to set for it. * Rules can be set so that the 'parents' directive inside a service_set has to refer to a service inside the service_set, for which the parents directive is then ignored. * The service set object will always be created when we're adding services to it, so we needn't stash them separately for adding later (ie, much easier to parse). Cons: * The config style used means current config parsers have to be modified to grok multi-level compounds in order to understand service-sets. --%--%--%--%--%-- # regular object-by-object style define service { use windows-service-template description Disk usage C service_sets windows-services parents NSClient ... } define service { use windows-service-template description NSClient service_sets windows-services ... } define service { use database-service-template description PSQL Listener service_sets psql-services ... } define service { use database-service-template description Cache hit ratio parents PSQL Listener service_sets psql-services ... } define service { use database-service-template parents PSQL Listener description Slow queries service_sets psql-services ... } define host { host_name win-psql1 service_sets windows-services,psql-services } --%--%--%--%--%-- Pros: * Can be used very nearly seamlessly with the current configuration parser. * Current config parsers need very little modification to work. * A single service can belong to many service sets without requiring duplication. Cons: * Harder to separate and isolate service sets for sharing. * Services get overloaded so they have to belong to either a host, a hostgroup or a service_set, which leads to more
Re: [Nagios-users] exchange.nagios.org down?
Must have been my bad luck. It seems to be up now. Disregard complaining. Acquire plugins. --Matt On Sun, Dec 19, 2010 at 8:31 PM, Matt Simmons standalone.sysad...@gmail.com wrote: I'm having issues getting to the exchange, and according to the all-knowing, all-seeing powers that be (http://www.downforeveryoneorjustme.com/exchange.nagios.org), I'm not alone. Any knowledge of the issue or ETR? --Matt -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] exchange.nagios.org down?
I'm having issues getting to the exchange, and according to the all-knowing, all-seeing powers that be (http://www.downforeveryoneorjustme.com/exchange.nagios.org), I'm not alone. Any knowledge of the issue or ETR? --Matt -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Community Down?
Thanks Tony! On Mon, Aug 2, 2010 at 2:08 PM, Tony Yarusso tyaru...@nagios.com wrote: FYI, http://community.nagios.org/ is back up today. -- Tony Yarusso Technical Team ___ Nagios Enterprises, LLC Email: tyaru...@nagios.com Web: www.nagios.com -- The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://p.sf.net/sfu/dev2dev-palm ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios Community Down?
I'm getting a 302 whenever I try to connect to the Nagios community site (http://community.nagios.org/) as linked to by the main site. Is this related to the wiki issue or something else? (Or did I not get the memo?) --Matt -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://ad.doubleclick.net/clk;226879339;13503038;l? http://clk.atdmt.com/CRS/go/247765532/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Community Down?
Do the people who run the website look at this list, or does anyone have the contact for them? On Tue, Jul 27, 2010 at 1:20 PM, Michael Friedrich michael.friedr...@univie.ac.at wrote: On 2010-07-27 17:35, Matt Simmons wrote: I'm getting a 302 whenever I try to connect to the Nagios community site (http://community.nagios.org/) as linked to by the main site. This has been redirecting to the main page for a while now. Probably because the Nagios community site has been hacked if this blog post tells the truth. http://blog.sucuri.net/2010/07/nagios-community-site-hacked.html It looks like a .htaccess redirection, but it might be something else. They are using an old version of WordPress, which may explain the compromise. We recommend people stay way from the site until it gets fixed. Is this related to the wiki issue or something else? (Or did I not get the memo?) The wiki could have suffered from that too, but if no-one affiliated to those servers tells anything, we can just guess. Kind regards, Michael -- DI (FH) Michael Friedrich Vienna University Computer Center Universitaetsstrasse 7 A-1010 Vienna, Austria email: michael.friedr...@univie.ac.at phone: +43 1 4277 14359 fax: +43 1 4277 14279 web: http://www.univie.ac.at/zid Icinga Core IDOUtils Developer http://www.icinga.org -- The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://ad.doubleclick.net/clk;226879339;13503038;l? http://clk.atdmt.com/CRS/go/247765532/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://ad.doubleclick.net/clk;226879339;13503038;l? http://clk.atdmt.com/CRS/go/247765532/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] wiki down?
If only there were some kind of software available to let us know when websites were down... On Sat, Jun 26, 2010 at 1:07 PM, Roy Sigurd Karlsbakk r...@karlsbakk.net wrote: The Ip of the server points to a Us located server ., they may have not woken up yet , or they are having a HW issue . Well, it's still down. Vennlige hilsener / Best regards roy -- Roy Sigurd Karlsbakk (+47) 97542685 r...@karlsbakk.net http://blogg.karlsbakk.net/ -- I all pedagogikk er det essensielt at pensum presenteres intelligibelt. Det er et elementært imperativ for alle pedagoger å unngå eksessiv anvendelse av idiomer med fremmed opprinnelse. I de fleste tilfeller eksisterer adekvate og relevante synonymer på norsk. -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] wiki down?
Bah! If you don't have an event handler that fences the misbehaving machine at the first sign of trouble, you're not trying hard enough ;-) On Mon, Jun 28, 2010 at 8:34 AM, Max perld...@webwizarddesign.com wrote: On Mon, Jun 28, 2010 at 8:29 AM, Matt Simmons standalone.sysad...@gmail.com wrote: If only there were some kind of software available to let us know when websites were down... Or people to respond to alerts from the software :) -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_disk and Volumes 10 TB
If the underlying OSes are 32bit, then it needs to be 32 bit as well (indeed, if it were 64 bit, it probably wouldn't run). Does the issue show itself if you change the units that check_disk report in (for example: --units MB)? --Matt On Sat, Jun 19, 2010 at 9:35 AM, Dirk H. Schulz dirk.sch...@kinzesberg.de wrote: Am 12.06.10 13:16, schrieb Matt Simmons: I've seen people run into problems like this when they're checking a machine that's 64 bit, and the monitoring host is 32. That's not, by chance, the case now, is it? Well, the problem also arises if check_disk is run locally on the monitored host. Could it be that it is compiled for 32Bit and has to compiled for 64Bit? Dirk --Matt On Sat, Jun 12, 2010 at 12:33 AM, Dirk H. Schulz dirk.sch...@kinzesberg.de wrote: Hi folks, I have run into a problem with check_disk. I have a volume with 14 TB that is 91 % free: /dev/disk0s3 14Ti 1.2Ti 13Ti 9% /Volumes/EonStor1 check_disk states it is 0 % free: check_disk -u GB /Volumes/EonStor1 DISK OK - free space: /Volumes/EonStor1 0 GB (0% inode=91%);| /Volumes/EonStor1=1276GB;;;0;14665 Is there a known limitation concerning the size of the volumes? With a volume 2 TB I do not have this problem on the same machine. Is there something I can do to get around this? Any hint or help is appreciated. Dirk -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Looking for an alternative user interface with more advanced features
Do you mean that you can't do it if you go to Services or Hosts, or you mean that you really do want to disable notifications and downtime for *truly* random hosts? Because I don't think there's a whole lot of use cases matching that. --Matt On Mon, Jun 14, 2010 at 10:18 PM, Trisha Hoang tri...@rockyou.com wrote: Hi, There are times when I need to disable notifications or submit downtime for *random* hosts/services that don't belong to any particular hostgroups/servicegroups, and the standard Nagios UI doesn't have this kind of feature. Would you recommend some tools out there that are stable, easy to install, easy to use, that have some of the more advanced features? Thank you. Trisha -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_disk and Volumes 10 TB
I've seen people run into problems like this when they're checking a machine that's 64 bit, and the monitoring host is 32. That's not, by chance, the case now, is it? --Matt On Sat, Jun 12, 2010 at 12:33 AM, Dirk H. Schulz dirk.sch...@kinzesberg.de wrote: Hi folks, I have run into a problem with check_disk. I have a volume with 14 TB that is 91 % free: /dev/disk0s3 14Ti 1.2Ti 13Ti 9% /Volumes/EonStor1 check_disk states it is 0 % free: check_disk -u GB /Volumes/EonStor1 DISK OK - free space: /Volumes/EonStor1 0 GB (0% inode=91%);| /Volumes/EonStor1=1276GB;;;0;14665 Is there a known limitation concerning the size of the volumes? With a volume 2 TB I do not have this problem on the same machine. Is there something I can do to get around this? Any hint or help is appreciated. Dirk -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Strange fluctuation in load average
When you say load average, do you mean the 1 minute moving average? And what are you using to display the load average? --Matt On Thu, Jun 10, 2010 at 3:48 PM, Trisha Hoang tri...@rockyou.com wrote: Hi all, When I first installed nagios-3.2.0 with embedded perl enabled, nagios experienced increasing latency, starting at 1 sec and climbed upto 300 within a few hours until restarting nagios. I read on one of the older post suggesting to recompile nagios *without* embedded perl, and that resolved the latency issue, with latency consistently at less than 1 sec. However, ever since, the system load average has fluctuated wildly from 1 to 12 and down to say ... 3 within a minute. This fluctuation happens 3-10 minutes each time and calms down for ... say an hour. There doesn't seem to be any cron jobs that can cause this kind of load, and cpu (1-quad core) is usually at least 50% idle , with plenty of free memory, no IO blocks, on Centos 5-2. What's strange is with nagios compiled with embedded perl, the load was consistently at 2-4. Could this be nagios related? Please let me know if you need more information. -- Trisha Hoang | IT/Operations | Rockyou, Inc. | Phone: 408-472-3989 | AIM: rockyoutrisha -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Large Installation
Make sure to read these pages: http://nagios.sourceforge.net/docs/3_0/tuning.html http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html Also, if you're monitoring 800 machines across WANs, you might look into distributed monitoring: http://nagios.sourceforge.net/docs/3_0/distributed.html Let us know how it goes! --Matt BTW, what are you using for your config maintenance? On Thu, Jun 10, 2010 at 1:51 PM, Scott Ward 13.sward...@gmail.com wrote: We are looking to do an large installation of Nagios. Is it possible to monitor over 800 machines and over 14000 services? Has anyone tried doing anything like this? If you have how successful was it and how did you configure it? ~Rultax -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Large Installation
I can't say that I've solved the scalability problem, but I I don't have it, just because I've implemented a policy such that I never check any server over a WAN link, with the exception of another Nagios server (plus both ends of all of the WAN links themselves). This does require one Nagios server per site, but to me, that's an appealing idea anyway, because I don't have a single point of failure. Any of my Nagios installations could die completely, and I'd be alerted by the others, just like any one internet connection could die, and I'd still get alerts about it. In the event of a weird failure, I can pretty much construct the network diagram based on which links are reporting up, and from where. It does require a certain amount of configuration overhead, but most of that is done with templating anyway. I don't have my system laid out exactly like I want, but I'm implementing version control (subversion, in my case) and I have a different Nagios repository for each site. If I had more templates (or more shared configuration files), I would probably have a 'nagios-shared' repository, so I wouldn't have to replicate everything manually. As for the arrangement of my configs, it mostly follows this howto that I did a year ago: http://www.standalone-sysadmin.com/blog/2009/07/nagios-config/ Hope it can help someone --Matt On Thu, Jun 10, 2010 at 3:55 PM, Kevin Keane subscript...@kkeane.com wrote: Nagios does have some scalability issues, but for the most part you won’t run into them until you get to truly huge installations. I can see three main scalability issues: config file maintenance and the need for one central server, and firewall issues. Config file maintenance can be improved to some extent with careful design of the config files, as well as tools. It is an issue that I am running into with a relatively small installation with 80+ hosts and 400+ services. My installation is highly heterogeneous and very dynamic, which makes config file maintenance a nightmare. Having to restart Nagios after a configuration change doesn’t help either. On the other hand, a network with 2000 identical machines is probably going to be much easier to manage than my type of network. The central server is an obvious bottleneck. No matter how powerful the machine and the network connection, there are only so many checks results it can handle. Fortunately, Nagios doesn’t require much horsepower. Distributed monitoring helps with this issue because the most expensive part of Nagios is running active checks. With distributed monitoring, the active checks can run on multiple smaller boxes, and then send the check results back as passive checks. Of course distributed monitoring compounds the config file maintenance issue, because you have to configure each check multiple times. The third issue is not directly a scalability issue. Nagios is built with the assumption of a local and mostly trusted network. It’s non-trivial to securely get checks to work on remote machines without pretty gaping poking holes into firewalls, and/or frequently establishing and tearing down encrypted connections with the attendant processing load. There are some third-party solutions for this issue, though. From: Scott Ward [mailto:13.sward...@gmail.com] Sent: Thursday, June 10, 2010 12:34 PM To: Nagios Users List Subject: Re: [Nagios-users] Large Installation Make sure to read these pages: http://nagios.sourceforge.net/docs/3_0/tuning.html http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html Also, if you're monitoring 800 machines across WANs, you might look into distributed monitoring: http://nagios.sourceforge.net/docs/3_0/distributed.html Let us know how it goes! Thanks for the links. So the distributive monitoring provided by the Nagios docs can handle what we're trying to do? I have read in a few places that Nagios has scalability issues. --Matt BTW, what are you using for your config maintenance? We haven't decided yet. Do you have any recommendations? ~S On Thu, Jun 10, 2010 at 2:23 PM, Matt Simmons standalone.sysad...@gmail.com wrote: Make sure to read these pages: http://nagios.sourceforge.net/docs/3_0/tuning.html http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html Also, if you're monitoring 800 machines across WANs, you might look into distributed monitoring: http://nagios.sourceforge.net/docs/3_0/distributed.html Let us know how it goes! --Matt BTW, what are you using for your config maintenance? On Thu, Jun 10, 2010 at 1:51 PM, Scott Ward 13.sward...@gmail.com wrote: We are looking to do an large installation of Nagios. Is it possible to monitor over 800 machines and over 14000 services? Has anyone tried doing anything like this? If you have how successful was it and how did you configure it? ~Rultax -- ThinkGeek and WIRED's GeekDad
Re: [Nagios-users] **SPAM** Re: upgrade from 3.0.6 to 3.2.1 using rpmforge (yum)
If you're in the habit of compiling things from source on a distro that uses package management, it's not too hard to set up your own internal repository. With CentOS, it's farcically easy. The hard part is making an RPM, and even that just takes a little time to figure out. IBM's guide to packaging RPMs http://www.ibm.com/developerworks/library/l-rpm1/ Example spec file http://kmymoney2.sourceforge.net/phb/rpm-example.html Creating a local YUM repository http://www.g-loaded.eu/2005/12/11/local-yum-repository/ --Matt On Sat, Jun 5, 2010 at 7:24 AM, Robert Wolfe robert.wo...@robertwolfe.org wrote: On Fri, 4 Jun 2010 01:17:21 +, Joseph L. Casale jcas...@activenetwerx.com wrote: I put the EPEL repo on to the system then ran a yum search nagios. Rpmforge should up but EPEL didn't I guess I should try an yum info for fun. Keep in mind anytime you add a 3rd party repo, you should use a yum plugin like priorities to keep Base protected. This is why for anything like this, I always upgrade from source. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_multi advice to check Internet available
Have you considered adding your upstream router as a host, and making it the parent of your remote hosts? Also, remember to take off the 'u' notification flag of the uplink's children, otherwise you'll still be buried under a pile of notifications. --Matt On Mon, May 24, 2010 at 5:37 AM, Marc Haber mh+nagios-us...@zugschlus.de wrote: Hi, I am checking a number of servers which are on the Internet, and are thus likely to become unreachable for my Nagios installations in case of an outage out there on the Internet. I would like to have a host which I could use as parent host for all those servers so that I don't get buried under an avalanche of HOST DOWN alarms should my Internet go partly or fully down. I have investigated using check_multi do help me with that, and have come up with a solution which can help me with this task. I would like to solicit your comments to that solution. I have defined a number of hosts which are checked by Nagios proper, just as normal hosts are. Their individual notifications may be switched off, but I haven't done so since I like to see what's going on at least in the start phase. Their hostnames all start with mh-internet, so I can use a single check_multi config line to pull them all into a single check_multi check. check_multi is then used as service and host check on a virtual host called internet, which goes CRITICAL when more than one mh-internet-foo host is CRITICAL. Here is my detailed configuration: /etc/nagios3/conf.d/internet.cmd: statusdat [ internet ] = /^mh-internet-.*$/:my-defaultping state[WARNING] = COUNT(WARNING) 1 state[CRITICAL] = COUNT(CRITICAL) 1 state[UNKNOWN] = COUNT(UNKNOWN) 1 Nagios configuration: define host { use mh-host host_name mh-internet alias internet address 127.0.0.1 parents localhost check_command check_multi!-f /etc/nagios3/conf.d/internet.cmd } define service { name my-internet use my-service service_description my-internet check_command check_multi!-f /etc/nagios3/conf.d/internet.cmd host_name mh-internet } define command { command_name check_multi command_line /usr/lib/nagios/plugins/check_multi $ARG1$ $ARG2$ $ARG3$ $ARG4$ } define host { use mh-host host_name mh-internet-torres hostgroups mh, ping alias torres.zugschlus.de address 85.214.68.41 parents mh-internet } define host { use mh-host host_name mh-internet-heise hostgroups mh, ping alias www.heise.de address www.heise.de parents mh-internet } define host { use mh-host host_name mh-internet-google-de hostgroups mh, ping alias www.google.de address www.google.de parents mh-internet } I am wondering whether this is a valid approach, or whether there are better solutions for this challenge. If somebody familiar with check_multi configuration, I would like to know whether there are any possible issues hiding in my use of check_multi. Any comments will be appreciated. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things. Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when
Re: [Nagios-users] $SERVICEOUTPUT$ missing from emails.. Nagios 3.2.1
Have you verified that your MTA is operating correctly? Try sending mail from the command line (echo test 1 | mail -s test 1 em...@address.com) and see if that goes through, before you suspect Nagios. --Matt On Tue, Apr 20, 2010 at 7:34 AM, Marc Powell li...@xodus.org wrote: On Apr 19, 2010, at 10:34 PM, Peter Kranz wrote: It's the default that comes with the .tar: # 'notify-by-email' command definition define command{ command_name notify-by-email command_line /usr/bin/printf %b * Nagios *\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time:$LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$ | /usr/bin/mail -s ** $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ ** $CONTACTEMAIL$ } Make sure the contact is using 'notify-by-email' as their service notification command. There have been no reports of this as a bug so I'd greatly lean toward it being a configuration issue. -- Marc -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitor Thin Clients with Nagios
Like anything else, the thin clients have to be able to report, somehow. Given the fact that there is a Wyse MIB ( http://www.oidview.com/mibs/714/WYSE-MIB.html ), I'd wager they support SNMP, and thus could probably be set up like any other network accessible host. Also, there's a chance (though I'm unfamiliar with the equipment, so I don't know how good of a chance) that they could be setup for SNMP traps, in which case you can probably be alerted for hardware faults. Good luck. Get it done, then write some documentation! ;-) --Matt On Wed, Mar 31, 2010 at 4:06 PM, Victor Lanza vicja...@gmail.com wrote: Hi, Has anyone tried to monitor Thin Clients with Nagios..as far as hardware health or anything other than just ping? Specifically with Wyse thin clients and Nagios 3.0.6. I'm curious to see what can be monitored. Thanks, Victor -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Unable to Open Nagios WebConsole-'PageCannotbeDisplayed'
Try this from a command prompt: telnet localhost 80 then if it connects, type GET / HTTP/1.0 and hit enter twice. Paste the output. For instance, mine looks like this: Trying 127.0.0.1... Connected to newcastle.int.ia. Escape character is '^]'. GET / HTTP/1.0 HTTP/1.1 200 OK Date: Tue, 30 Mar 2010 13:40:31 GMT Server: Apache/2.2.8 (Ubuntu) PHP/5.2.4-2ubuntu5.10 with Suhosin-Patch mod_perl/2.0.3 Perl/v5.8.8 Last-Modified: Wed, 21 Jan 2009 14:22:15 GMT ETag: 3eadb-2d-460fee1e9d3c0 Accept-Ranges: bytes Content-Length: 45 Connection: close Content-Type: text/html htmlbodyh1It works!/h1/body/html Reply with the output. I doubt that you're connecting to the web server, but this should determine whether it can be connected to, or if something else is broken. --Matt PS - You don't know if your web browser is set up to talk to a proxy, do you? On Mon, Mar 29, 2010 at 9:03 PM, ravishankar.gundlap...@wipro.com wrote: Hi, Please see the output mentioned in the attachment. Request someone to please help me on this as I am not able to understand what is the error. Thanks Regards, To quickly summarize: You mentioned you are entering http://localhost and getting a Page Cannot Be Displayed error. This may be a dumb question, byt where are you entering http://localhost;? I would assume it's a web browser, but is it a web browser being run from the same machine you've installed Apache on? Yes I am running the web browser from the same machine where I have installed Apche. Also, it would help to know *specifically* what the error is. Is it a timeout? A DNS lookup error? Something else? there should be more details, and it would help if you could pass those on. I am getting as Page Cannot be displayed. Off the top of my head, given that you have Apache running and bound to port 80, the following seem the most likely: Either a) Your browser and your webserver are on two different machines, so using http://localhost; is not going to work, since it points to the machine your browser is on, b) localhost is somehow not defined on the machine you're using, or c) your Apache configuration is incorrect. Let me throw in a little tip on asking questions on mailing lists while I'm here. Details are good. Knowing what your Apache configuration looks like, what's showing up in the logs, *exactly* what you're doing to test and what the *complete* results are of those tests, and any other information you have that we don't would really help someone else troubleshoot your issue. You've given us very little, and as a result what you now have is a bunch of people throwing out wild guesses since we've got no real data from you to work with. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk
Re: [Nagios-users] HI ALL
We use email and SMS to deliver notifications. We do it the cheap way, which is by using our cell provider's SMS gateway. We're in the states, so we can get away with that. I have heard that it might not be the case that cell providers offer a gateway service. --Matt On Fri, Mar 12, 2010 at 9:43 AM, wale Ajayi werty...@gmail.com wrote: Please i need info on web sms + Nagios integration, thanks for your response -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Keeping the Nagios Configuration Sane
Hi All, I'm attending the 2010 Professional IT Community Conference (http://www.picconf.org) being held in New Brunswick, NJ, and I'm giving a talk about staying sane while working with the Nagios configuration. The talk will be 45 minutes long, and will primarily be an outshoot from this article that I wrote on my blog: http://www.standalone-sysadmin.com/blog/2009/07/nagios-config/ I could talk about that and some other things that I've been figuring out, but I was wondering if anyone had any tricks or tips for dealing with the Nagios config? Is there anything special that you do to keep things straight? I'm going to be putting my slides and any additional material online following the conference, so hopefully someone else can get some use from it. By the way, if anyone on this list is in the north east of the US, you should come visit the conference. Without training, it's only $275 for 2 days. With a full day and a half of training, it's still only $400 for the whole shebang. Anyway, this isn't a sales email. I'm looking forward to any tips you would want to share. Thanks in advance! --Matt -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_by_ssh from a different username
Yes. And you can simplify this by running ssh-copy-id as the Nagios user: -- nag...@monserv:~$ ssh-copy-id remu...@remhost remu...@remhost's password: Now try logging into the machine, with ssh 'r...@webdb1.int.ia', and check in: .ssh/authorized_keys to make sure we haven't added extra keys that you weren't expecting. -- Most Linux systems that I've seen provide this script. Other Unices may not have it. If it's there, it makes it much faster than manually copying and pasting public keys. --Matt On Tue, Mar 9, 2010 at 8:23 AM, Marc Powell m...@ena.com wrote: On Mar 9, 2010, at 12:46 AM, Morris, Patrick wrote: How can I use check_by_ssh to monitor remote servers, when the username on the remote box is different from the Nagios user name? Install the private keys of the remote users on the Nagios machine under the nagios user. You've already determined they work, so now copy them over. Actually, it's the opposite. He needs to add the nagios user's public key to the authorized_keys file for each user on the remote machines. Moving private keys around is a security no-no. -- Marc -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring a process with its pid
If you have snmp enabled on the hosts, you can pull that information out: ~$ snmpwalk -v 2c -c COMMUNITY myhost.mydomain.com HOST-RESOURCES-MIB::hrSWRunName.3309 HOST-RESOURCES-MIB::hrSWRunName.3309 = STRING: crond On Mon, Mar 8, 2010 at 8:10 AM, Jatin Davey jasho...@cisco.com wrote: Let me re-phrase it a little , Basically what i want is to give the process id as the input and make sure on that target machine i have the process with the given id running , once it stops there might not be the process running with that id and hence i need to raise an email notification alert. Thanks Jatin On 3/8/2010 5:41 PM, Jatin Davey wrote: Hi All Do we have a plugin that can monitor process id (pid) on a target machine , the target machine could be anything such as windows , linux , solaris etc and alert with an email notification when the process id changes. Appreciate your responses. Thanks Jatin -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Failure Prediction: Are we there yet?
Alright, so going through all of the things in my generic-services definition that I never ever look at (event_handler_enabled? who would set that to 0?), I came across failure_prediction_enabled 1. I couldn't recall memorizing that particular line from the object definitions. I checked, and sure enough, that directive isn't listed (http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service). I did some more research. I found out that in July of 2002, Ethan hoped it would be out in 2003 or 2004. (http://support.nagios.com/knowledgebase/faqs/index.php?option=com_contentview=articleid=52catid=35faq_id=25expand=falseshowdesc=true). It's 6 years later, and for some reason, it's still in my config file. I'll admit...my config file isn't the most...ahem..chaste, of configs. It's been around the block, and is no spring chicken. I guess what I am asking is, since it isn't in the object definitions, and no official noise has been made about it in nearly a decade, I can safely assume that it no longer matters. Anyone have a recent copy of the source code lying around that they could grep? I just want to make sure that it isn't set to More Magic (http://catb.org/jargon/html/magic-story.html) --Matt -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] VMware Appliance
It strikes me that if there isn't one available and relatively up to date, it would be a good endeavor for the Nagios community to work on. I don't have a lot of time, but I'd be willing to work with people to get an image prepared. Anyone want to help, or have suggestions? --Matt On Wed, Mar 3, 2010 at 8:30 AM, Antonio Pereira apere...@rha.ca wrote: Hello All, I was looking around for VMware nagios appliance and I see a couple but they are old and I can't find any to download. Is there anymore of these around? Thanks -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] multiple check_interval with single service check
You could write a wrapper around the plugin to check the time of day and execute it as needed. On Mon, Mar 1, 2010 at 2:47 PM, shadih rahman shadhi...@gmail.com wrote: All, Is it possible to use multiple check_interval based on the hours of the day for a single check? For example during day time service should be executed every 15 minutes and during night it will get executed every 2 hours. Can someone please show me how its done. Thanks -- Cordially, Shadhin Rahman -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] customized init script
Do you want to monitor whether they ran correctly, or whether the daemons they started are still running correctly? --Matt On Fri, Feb 26, 2010 at 2:43 AM, Kaushal Shriyan kaushalshri...@gmail.com wrote: Hi, Any recommended method to monitor customized init scripts. I know check_tcp will only monitor ports. Please suggest. Thanks, Kaushal -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Acknowledgement text in emails...
As Patrick Morris suggested, it's possible to include the acknowledgement text using the $SERVICEACKCOMMENT$ macro. Just insert it into the proper place in your notify-host-by-email command definition. --Matt On Fri, Feb 12, 2010 at 10:09 AM, Andrew Davis ncc...@gmail.com wrote: For months I've been acknowledging alerts and including a reason why, etc but I just realized that the acknowledgment emails don't include the reasons that I take the time to include... and thus, my boss sees my ack'ing them, but doesn't know if its a non-critical issue, etc. Is there an option I can add to include the acknowledgment reasons in the acknowledgment emails that go out? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Problem with check_by_ssh
My first guess would be that there is some sort of environmental variable being set when you log in that wouldn't be set when you aren't (ie, when Nagios triggers it). Are you using keys that have passphrases, maybe? Can you give us a look at the contents of check_by_ssh? --Matt On Fri, Feb 5, 2010 at 10:12 AM, mirko stefanelli mirko.stefane...@gmail.com wrote: Hi to all, We run nagios on Ubuntu server with 2Gb ramm kernel version 2.6.24-23-server. Nagios version is 3.2.0. We have configured all our server (Linux REL) to execute our custom plugin by check_by_ssh. The problem is that for some server this check fails and the output is: CRITICAL - Plugin timed out while executing system call If we try to do same command, executed with check_by_shh, from user nagios on cli the command work fine! Any idea? Regards, Mirko. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to accomplish this task
What manufacturer / model of blades do you have? There are sometimes additions to ESX(i) that can be implemented (such as Dell's openmanage) that provide interfaces to monitor hardware. On Wed, Jan 27, 2010 at 6:33 PM, ReynierPM rper...@uci.cu wrote: Hi every: I have 6 Blade Servers with ESX vSphere and in top of this structure I have 78 more virtualized servers. Exists any way to check Blade Servers processor temperature, ram status, and all this stuff related to hardware? -- Cheers and thx in advance ReynierPM -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to run a perl script instead of sending emails?
Would you be willing to share this script? I asked a question a while back on serverfault (http://serverfault.com/questions/21121/nagios-alerts-by-telephone) and this could be useful to a lot of people (assuming I'm not missing a general solution that's out there and just not known by me). --Matt On Sun, Jan 24, 2010 at 5:48 PM, Mr Gabriel gabr...@impactteachers.com wrote: Dear All, I have a perl script that connects to my asterisk instance, and calls who ever is on shift to log in, and take care of whatever broke. I would like to initiate this script whenever there is a critical alert that has gone off more than once - how can I best achieve this? I would like these to run instead of an email when out of hours, and when a critical alert has been left unattended to for some time -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Problem with check_by_ssh
Are you able to ssh to the remote machine as the remote user? I've been bitten by not having the remote host in my nagios user's known_hosts file, and my scripts get prompted, and for some reason that stops me from obtaining statistics ;-) --Matt On Fri, Jan 22, 2010 at 7:35 AM, Mario Rimann ma...@rimann.org wrote: Hi there We're experiencing an issue on a new setup where check_by_ssh should run a script on a foreign host with two parameters and handle the response. Executing the script on the remote server works - but the parameters don't get transmitted to the remote server. Afther fiddling around with the service and the command definition, I started to run the command on the shell to see what happens: If I run it locally as root, all works out: [r...@monitoring ~]# /path/to/libexec/check_by_ssh -l root -H xxx.xxx.xxx.xxx -v -C /root/bin/checkSomeThing.sh 300 500 Running the same stuff as the nagios user fails (script get's called, but doesn't get the parameters) [nag...@monitoring ~]$ /path/to/libexec/check_by_ssh -l root -H xxx.xxx.xxx.xxx -v -C /root/bin/checkSomeThing.sh 300 500 I already checked /etc/passwd - but besides the user's names, user-id, group-id and the home directory path all looks the same - both have /bin/bash as their logon shell. Any hint? Cheers, Mario -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null