[Nagios-users] How many hosts and services are you monitoring with Nagios?
Hi, I like it when people periodically post numbers and architecture summaries, I am guessing with the distributed frameworks out now for Nagios this thread might be seeing bigger numbers than past threads have. With our custom-built distributed Nagios-based monitoring system, we are currently monitoring 18000+ hosts every 5 minutes and 100k+ active services (plenty of passive services in addition to the actives) every 5 mins as well. We collect performance data from every check as well and pass that on to a highly distributed and scalabe time-series data warehouse another team in our organization has built (which is why we have the 5 min interval requirement) We also do trap ingest using SNMPTT with a few custom mods, but not going to include those numbers as they never have required the optimizations the polling has required. This isn't a monolithic instance, we have 6 projects using instances of our distributed Nagios-based software, called Racon (soon my manager will give our team to package it as open source - so I hear at least). We built it on core Nagios with a custom database layer based on a very very early version of Merlin's database abstraction layer (thank you Andreas!) - we have a custom client/server network-based notification framework in use (we will release that as well) along with a custom NEB/perl based client-server framework (also releasable, just need time scheduled) for sending and processing performance data - the performance and notification framework are both horizontally scalabe and network fault tolerant. What kinds of numbers of hosts and services are you all monitoring? Which add-ons / distributed frameworks are you using? -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How many hosts and services are you monitoring with Nagios?
On Thu, May 17, 2012 at 11:57 AM, C. Bensend be...@bennyvision.com wrote: Yes, I know, VM bad. :) Just not bad enough to spend real dollars on more physical hosts. Nothing bad about using a VM, they just fall over horribly (generally speaking) when you try to push the virtual machine's virtual CPU cores and disk hard :p - kudos to you for making that work and pretty interesting setup! Thanks for sharing. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How many hosts and services are you monitoring with Nagios?
On Thu, May 17, 2012 at 12:19 PM, Mike Guthrie mguth...@nagios.com wrote: Had to reply to this one since I actually studied Max's blog articles for ideas when I was researching performance tuning. Thanks Max for not only pioneering some of these ideas, but actually documenting what you did. ; ) http://www.semintelligent.com/blog/articles/38/nagios-performance-tuning-early-lessons-learned-lessons-shared-part-4-scalable-performance-data-graphing Wow! Forgot about that, glad the ideas in it helped you - Nothing pioneering (though thank you for saying that), I culled work from others and my teammate and I then took that and built on it. I am hoping we can release our network performance data sending framework first, which was initially based on the PNP modpnpsender module and has now been re-written from scratch by several of us based on Andreas's test NEB module that showed how to use the NEB module to write to a pipe with a process listening to the pipe in order to minimize time spent in Nagios and therefore latency added by the NEB-based add in. Nagios XI Benchmarking Test Box - Running on one of our old Desktops: 1 Dual Core CPU, 4GB of RAM, CentOS 5.7 - Ndoutils/MySQL offloaded to a second server - 1800 Hosts, 8200 services, 10k checks running every 5mn - Processing perfdata for all checks - rrdcached in use with PNP - RAM Disk being used for status.dat, objects.cache, host-perfdata, service-perfdata, all spool and tmp directories Nice! -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_load -w $ARG1$ -c $ARG2$
Load average is a relative measure of how much work over time a system is doing. Read these to start: http://www.linuxjournal.com/article/9001 http://www.teamquest.com/pdfs/whitepaper/ldavg1.pdf http://www.teamquest.com/pdfs/whitepaper/ldavg2.pdf -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Centralized Dashboard
+1 for Thruk + multi-site - well done project, easy to set up and easy to replicate a set up across multiple hosts. I haven't tried it with a large Nagios installation. Query capabilities are very cool as well. Max On 2/8/12, Joerg Linge pitchf...@ederdrom.de wrote: Am 08.02.2012 um 14:40 schrieb trm asn: Dear Friend, I have 7 Nagios core server running at 7 different location. I have installed MNTOS to get the consolidated tactical views. Is there any other way to do so. Yes! mathias-kettner.de/checkmk_multisite.html http://thruk.org/ Joerg -- Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Optimization on CentOS
I have written a number of blog posts about optimizing and tuning Nagios performance as well - you might find some of them useful: http://www.semintelligent.com/blog/ - Max -- Write once. Port to many. Get the SDK and tools to simplify cross-platform app development. Create new or port existing apps to sell to consumers worldwide. Explore the Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join http://p.sf.net/sfu/intel-appdev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Feeding status information to SCOM?
There is a commercial Nagios to SCOM connector, we use it at my place of work - avoid it and work on a custom solution (which is the path you are on) - the commercial connector is awful * Only allows for one Nagios system to send to SCOM using SNMP traps * Does not translate statuses properly * Does not allow for event chaining - Max -- Write once. Port to many. Get the SDK and tools to simplify cross-platform app development. Create new or port existing apps to sell to consumers worldwide. Explore the Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join http://p.sf.net/sfu/intel-appdev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Average Check latency and execution time growth - 3.2.3
What minor RHEL rev are you running? We had one poller that was running RHEL 5.3 that had constantly increasing latency - a Compaw / AMD based host. None of the optimizations / configuration changes we made to the other pollers we ran at the time seemed to help this one - we updated the poller in-box from 5.3 to 5.4 and voila - issue gone. As Joerge mentioned, probably was a memory leak / bug in a library the parent Nagios poller process was using, we never did determine which one and we haven't hit that same issue since then with any 5.4 or 5.5 pollers. Even with stable software we end up bouncing our pollers every 2-3 days - 1) because we have an active customer base who make config changes often and 2) because we take the metrics from the checks and put them in a time series data warehouse that is sensitive to interval skew...any poller that hits 10 seconds latency has to be bounced. We are at 12 pollers or so right now and we will be up to almost 20 by next year at this time. Max On 10/2/11, Stuart Browne stuart.bro...@ausregistry.com.au wrote: Hi, I know this topic has been covered many times, but I've tried those tweaks and I have the remaining issue. After a few days, the latency on checks explodes. It goes along quite happily with small values, then after (about) 3 days, the values rise quite sharply. I've recently been graphing performance statistics (nagiostats, mrtg) and as you can see by the two attachments (day, week), it's rather surprising. We restart Nagios every few days (for other reasons) so thankfully the issue never gets completely out of control, but as you can see, it gets a bit crazy. I can't think of any combination of settings that would cause such growth after such a long period of time. Does anybody have any knowledge as to why it would suddenly increase after running for days without issue? Basic Nagios system stats: 2 x dual-core Xeon 5160 (3Ghz) 6GB Memory 4 x SAS, RAID1 (hardware, BBU, LVM over RAID1) RHEL5, fully patched Load average between 0.5 and 3.2 'nagios -s /etc/nagios/nagios.cfg' output (trimmed): HOST SCHEDULING INFORMATION --- Total hosts: 252 Total scheduled hosts: 252 Host inter-check delay method: SMART Average host check interval: 300.00 sec Host inter-check delay: 1.19 sec Max host check spread: 30 min First scheduled check: Mon Oct 3 14:31:17 2011 Last scheduled check:Mon Oct 3 14:36:15 2011 SERVICE SCHEDULING INFORMATION --- Total services: 1575 Total scheduled services: 1386 Service inter-check delay method: SMART Average service check interval: 878.40 sec Inter-check delay: 0.63 sec Interleave factor method: SMART Average services per host: 6.25 Service interleave factor: 6 Max service check spread: 30 min First scheduled check: Mon Oct 3 14:33:43 2011 Last scheduled check: Mon Oct 3 14:48:21 2011 CHECK PROCESSING INFORMATION Check result reaper interval: 5 sec Max concurrent service checks: Unlimited PERFORMANCE SUGGESTIONS --- I have no suggestions - things look okay. Stuart J. Browne Senior Linux Administrator -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2dcopy2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Observing latency
RAM disk for the NPCD queue + rrdcached is a necessity for scaling a PNP backend - PNP in asynchonous mode is a must as well, preferrably on a server other than the primary pollers - for example, use merlin to distribute events and then designate one server or more to purely processing PNP's RRD-based graphs. - Max -- Special Offer -- Download ArcSight Logger for FREE! Finally, a world-class log management solution at an even better price-free! And you'll get a free Love Thy Logs t-shirt when you download Logger. Secure your free ArcSight Logger TODAY! http://p.sf.net/sfu/arcsisghtdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Look soku89
04y10q5o5qqq. http://ianscott.biz/wp-content/uploads/img/oafv.html 79o9xjv 4q6rlp n1nprps0, 6a8svdq kh7uwj. fpiwe quqcnwb2a 6t5e5e. -- Thanks, Max Alex Voip Developer -- uberSVN's rich system and user administration capabilities and model configuration take the hassle out of deploying and managing Subversion and the tools developers use with it. Learn more about uberSVN and get a free download at: http://p.sf.net/sfu/wandisco-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Help required on Nagios Setup
On Tue, Jun 14, 2011 at 10:28 PM, sunilang...@rediffmail.com wrote: Hi All Let me introduce myself. My name is Sunil Angane and I am working with Softenger (I) Pvt. Ltd., an IT Infrastructure Management Services company. We are doing Nagios setup at one of our prestigious client in Mumbai to cover OS and DB monitoring. As we are doing this for the first time, we need guidance from you all on this. Under OS part we are covering AIX, LINUX, SOLARIS and WINDOWS, under DB part ORACLE, SQL, SYBASE and DB2. Kindly share the installation / configuration documents related to above listed OS and DB flavors. Kindly let me know if anyone from negios user is residing in Mumbai or India so that it will be easier for me to contact. Do you really expect the community to execute your contract for you? Interesting! Start here! http://wiki.nagios.org/index.php/Main_Page If you have directed questions that show you have done some reading and educated yourself you will get useful answers from the group. - Max -- EditLive Enterprise is the world's most technically advanced content authoring tool. Experience the power of Track Changes, Inline Image Editing and ensure content is compliant with Accessibility Checking. http://p.sf.net/sfu/ephox-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Is the nagios log file keeps growing over time
On Fri, Jun 10, 2011 at 2:59 PM, Manish Kumar manikuma...@gmail.com wrote: Hello Frnds, I have got a concern, Will the nagios log file (nagios.log) in /usr/local/nagios/var/nagios.log will keep increasing in size or there is some auto-rotation facility. If it is there how it can be done. http://nagios.sourceforge.net/docs/3_0/configmain.html#log_rotation_method - Max -- EditLive Enterprise is the world's most technically advanced content authoring tool. Experience the power of Track Changes, Inline Image Editing and ensure content is compliant with Accessibility Checking. http://p.sf.net/sfu/ephox-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] [Nagios-devel] Q: Service Escalation Recovery Notifications.
Hi, On Thu, May 19, 2011 at 10:10 AM, Andreas Ericsson a...@op5.se wrote: On 05/19/2011 03:32 PM, Paul M. Dubuc wrote: OK, but wouldn't it be nice if all contacts who got an error notification were able to get the recovery message instead of just the one last notified? Is there any way to do that? Setting up an explicit serviceescalation for recovery notifications doesn't seem to work. Max Schubert is working on a patch that does something similar to that. If he doesn't complete it, I might take a look at adding it myself. I will send out my partial patch to the list sometime today along with an explanation of my thinking / approach for it - feel free to use it or discard it as you see fit :)! - Max -- What Every C/C++ and Fortran developer Should Know! Read this article and learn how Intel has extended the reach of its next-generation tools to help Windows* and Linux* C/C++ and Fortran developers boost performance applications - including clusters. http://p.sf.net/sfu/intel-dev2devmay ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification configuration (Was [Nagios-devel] RFC/RFP: Service parents)
Andreas, On Tue, May 17, 2011 at 7:57 AM, Andreas Ericsson a...@op5.se wrote: Any plans to detatch notification attributes from service / host definitions in 4.x and make them their own top-level configuration class like escalations to make it easier to scale notification definitions for large projects? Not really. What would such an object look like? How would it add additional benefit compared to using templates for hosts and services? I think if I could just see some sort of example definition of it I'd get an inkling of why some seem to think it's such a great idea. Right now, I see no additional benefit to it. It would look just like an escalation. What doesn't work well for large configurations with notification policies being stuck into host and service objects is this scenario (which is the one we are in at work by design): * Multiple configuration editors who own various parts of the Nagios configuration tree - in our case this used to be one big tree, now we have set up separate trees for separate projects - we have about 20-30 people who can edit their project-specific configurations. * A set of services that are global in nature - service - hostgroup - host - baseline monitoring required by all projects using standards established by multiple organizations in our company - for our example, base host monitoring with an SNMP agent (6 services across every host) - we have other global services as well and a core team who develop, maintain an augment both our distributed Nagios software and these global services and configurations * A set of services that are specific to each project using our distributed variant of Nagios - managed by subject matter experts on each team. With this scenario, how do we let each group that is responsible for hosts that have these global services on them create individually tailored notification policies since there is one notification policy per service? * We configure our base service and host to 'notify' on every state change using the command name do_nothing * We created a custom patch so that when the string 'do_nothing' is seen in the command name this state change only increments the notification count - it does not trigger any external command to run * We created a patch (partial - no serialization to disk) for escalation logic that tracks in memory when a fault escalation was sent so that OK escalations are only sent in response to something that was in a fault state. We are working on completing this patch so that across restarts the state is saved. * We have all groups use escalations to define their notification policies - the service and host notification commands then trigger our distributed pollers to send escalation requests to a network-based notification service we have that then lets the notification requests trigger email, SMS, SNMP traps, etc without having to re-configure Nagios for every notification transport /. method change. Yeah, it is very ugly, and why? Because 1 notification policy per service, that doesn't scale well when taking advantage of service - hostgroup - host mappings, which is a critical pattern to use when scaling a configuration. We have over 9000 hosts being monitored by our distributed framework (and growing) with around 30 configuration editors and 120+ users. Our distributed framework was centralized and a ''one project for all but now is a cluster of distributed set ups, one distributed set up per project, which is scaling nicely. Our largest distributed installations have 3900 and 5100 hosts in them respectively - we have 4 other distributed instances that are just getting ramped up and only have a few dozen hosts apiece at this point. So while this is ugly, it works! All editors can define escalation objects that take into account both their individual needs for global service notifications as well as any project-specific notifications - and by putting project-specific hosts in project-specific host groups, for most groups, two escalation policy definitions are all that are needed per project - one for hosts, one for services. If all notifications were just done through an escalation like configuration object, life for a big project would be much easier. 1) Having notifications clearly separated as their own configuration template in the Nagios DSL makes it much less confusing for people new to Nagios to understand 'where to configure notifications' 2) The configuration flexibility of the escalation template makes it very easy to work with for a large configuration. Our global and project specific scenario and all the notification changes we made is also serving us very well as we grow. Notifications as separate objects would let us back out a number of patches and would reallly simplify our configuraiton and let our pollers run hotter . - Max -- What Every C/C++ and Fortran developer Should Know! Read
Re: [Nagios-users] [Nagios-devel] RFC/RFP: Service parents
Andreas, How does this differ functionality from the current ability to describe dependencies with servicedependency? Is it just easier to define (I agree that would be a win) -- Achieve unprecedented app performance and reliability What every C/C++ and Fortran developer should know. Learn how Intel has extended the reach of its next-generation tools to help boost performance applications - inlcuding clusters. http://p.sf.net/sfu/intel-dev2devmay ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] [Nagios-devel] RFC/RFP: Service parents
On Tue, May 17, 2011 at 5:48 AM, Andreas Ericsson a...@op5.se wrote: On 05/17/2011 11:34 AM, Max Schubert wrote: Andreas, How does this differ functionality from the current ability to describe dependencies with servicedependency? Is it just easier to define (I agree that would be a win) It's just easier to describe, and it ties in nicely with different additions to the configuration syntax being planned. It does have some limitations, but in 99.5% of all dependencies I've seen, it matches the usecase people put dependencies to so it should be quite a nice win anyway. I agree - the current syntax is very flexible but understanding how to implement it properly for some users results in a very painful learning curve. Any plans to detatch notification attributes from service / host definitions in 4.x and make them their own top-level configuration class like escalations to make it easier to scale notification definitions for large projects? - Max -- Achieve unprecedented app performance and reliability What every C/C++ and Fortran developer should know. Learn how Intel has extended the reach of its next-generation tools to help boost performance applications - inlcuding clusters. http://p.sf.net/sfu/intel-dev2devmay ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Opsview 3.9 -- Configuring the Check OpenManage Plugin for dell servers
On 03/30/2011 02:00 PM, m...@mnnwahl.com wrote: Has anyone had success is getting this to work? I have never gotten this work and would love to monitor the Dell openmanage parts in opsview. I'd suggest contacting the Opsview community forum, as this really isn't a Nagios issue. Regards, Max -- Create and publish websites with WebMatrix Use the most popular FREE web apps or write code yourself; WebMatrix provides all the features you need to develop and publish your website. http://p.sf.net/sfu/ms-webmatrix-sf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] WARNING: Unknown contact keyword: serviceescalation
Please respond to the list with the version of Nagios you are using and an example contact and/or serviceescalation definition that you are using. - Max -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] WARNING: Unknown contact keyword: serviceescalation
Have you tried removing the trailing commas from the contact attribute 'members' for the two contactgroup defintions? - Max On Mon, Feb 28, 2011 at 4:47 PM, Chad Rhyner crhy...@box.net wrote: On Mon, Feb 28, 2011 at 12:27 PM, Max Schubert m...@webwizarddesign.com wrote: Please respond to the list with the version of Nagios you are using and an example contact and/or serviceescalation definition that you are using. - Max Certainly. The version I am using is 3.0.6. Here is a copy of the output: Nagios 3.0.6 Copyright (c) 1999-2008 Ethan Galstad (http://www.nagios.org) Last Modified: 12-01-2008 License: GPL This is being run on a 64-bit Linux machine. Here is an example of the serviceescalation definition: define serviceescalation{ hostgroup_name sms service_description HTTP first_notification 1 last_notification 2 notification_interval 10 contact_groups oncall,email } Here are the contactgroups related to that serviceescalation: define contactgroup{ contactgroup_name oncall alias Oncall members user1,user2, } define contactgroup{ contactgroup_name email alias Email members user1_email,user2_email,user3_email,...usern_email, } For both contactgroups oncall and email, there is a contact defined for each user that is named. I have changed the usernames for privacy reasons. ~Chad -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Which GUI to configure Nagios 3 ?
We do have 60+ sysadmins using a shared instance of Nagios without a configuration UI - all Nagios configuration files are in SVN, current we monitor around 5300 hosts. We found with a little training the sysadmins were happy to take the hit on learning in order to have the flexibility of the full power of the config language. We avoided implementing or integrating a web-based UI because all of them take away various capabilities of the language - we did not want to treat our audience like dummies, so instead we went the path of providing training and mentoring to them and empowered them to use the language themselves. Was this a lot of work for us, especially when we started? YES. Was it an approach that respected our user base and gave them the ability to do what they needed to do with minimal intervention from us? YES. - Max On 2/24/11, Breandan Dezendorf brean...@dezendorf.com wrote: On Thu, Feb 24, 2011 at 1:54 PM, Ennis Ibarra en...@nmcourts.gov wrote: My two cents: Once you lock up your configuration to a given tool, then you stick with the tool capabilities to configure stuff. The real power is understanding the Nagios structure and files, leave the GUIs for presentation only. I agree - but when you're trying to serve a large community of sysadmins, a GUI tool is a useful tool for getting the job done. I can't teach 60+ admins how to write clean nagios configs, but I can teach them to use web-based tools. -- Breandan Dezendorf brean...@dezendorf.com bwdez...@gmail.com -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_hpasm and check_openmanage over nrpe in windows
you can configure the snmp agent to send snmp v3 traps with encryption. On 2/25/11, Assaf Flatto nag...@flatto.net wrote: the main issue is that SNMP is a clear text protocol and no way to encrypt the date , while you can encrypt the NRPE traffic with SSL . Assaf Alexandre wrote: hello, I think using nrpe is not a good idea. I use snmp Trap of dell openmanage for alert in Nagios. Alex On 25/02/2011 12:38, Assaf Flatto wrote: Hello All I have encountered an issue with wanting to monitor HP and Dell servers running windows OS . The main issue is that due to security issue we can not use the NSClient++ internal functionality , but use the NRPE module option . I have not used the nrpe on windows boxes extensively before and wanted to know if anyone has deployed the check_hpasm and check_openmanage on windows boxes where the nagios server can only access the NPRE port and not have SNMP access direct to the server ? Thanks Assaf -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Multi-site frontend options
Definitely check out Merlin - through configuration you can have Nagios instances communicate events with each other using a few types of relationships and you can then serialize the data from an instance to a database for either using their custom UI or making one of your own. http://www.op5.org/community/plugin-inventory/op5-projects/merlin When we used it back in early 2010 it was not quite ready with regards to event distribution but I would think it is more mature by now - definitely worth spending some proof of concept time on to explore as an option. We have been using the database serialization portion of Merlin for the last 1.5 years, we have made mods for it to meet our purposes (hope to share some of that code soon on our github account - one cool thing we added was threaded MySQL connections from Merlind to the database which increased throughput tremendously). - Max -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NAGIOS does not register Critical | Warning status.
There is a debug level you can set in nagios.cfg that will show you all commands being executed - an easier way would be to make a shell wrapper for check_int that logs all args and then execs them with the real check_int. Something like 1. Copy check_int to check_int.bin 2. Make check_int a shell script like !#/bin/bash echo $* /path/tlo/logfile.log exec /path/to/check_int.bin $* Now whenever check_int is called, the full command line and args will be appended to the log each run for you to inspect - without cranking up debug or restarting nagios. Max On 2/22/11, socomm soc...@crowbar.cc wrote: On Mon, 21 Feb 2011 20:23:54 -0500, Allan Clark all...@chickenandporn.com wrote: Can you confirm that %% is needed for a single % token? I don't remember that being needed, but I don't use % in my commands. I could see $ needing doubling up due to token replacement. Apparently check_nt will returns same code whether I pass it the '%' sign or not, however the double '%' was a typo which I've corrected. Also, see about increasing logging, see if you can log what the actual command is running as, and what it responds as a result code. Maybe proving that the check_nt script is being called in the way you think it is can help bisect the problem. I am looking for a way to log commands, but best I've found is the debug option which will give me option to log commands to a debug file. However, the output I get is the following: - OUTPUT - Raw Command Input: /usr/lib/nagios/plugins/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ Expanded Command Output: /usr/lib/nagios/plugins/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ Which I'm not sure why variables are not being expanded to the actual arguments being passed. Anyone more familiar as what I may need to do to view the expanded variables being passed? -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Free Software Download: Index, Search Analyze Logs and other IT data in Real-Time with Splunk. Collect, index and harness all the fast moving IT data generated by your applications, servers and devices whether physical, virtual or in the cloud. Deliver compliance at lower cost and gain new business insights. http://p.sf.net/sfu/splunk-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios::Plugin and Nagios Embedded perl interpretor
Marc, On Tue, Feb 8, 2011 at 6:16 AM, Marc-André Doll m...@b-care.net wrote: Hi list, I'm developing some Nagios plugin with the Nagios::Plugin perl module and it's saving me a great amount of time. However, I encountered a little problem when I tried to activate the embedded perl interpretor in Nagios: all checks returned an UNKNOWN state with an error from the ePN saying that an error occurred in the Getopt::Long perl module about the use of an uninitialized value line 459. I suppose the problem come from my usage of the add_arg() function but I don't know how to correct it. Does somebody have already encountered that kind of behaviour or have an idea on how to debug a perl script when it's launched by the ePN? Check out your command lines, especially if you use custom attribute substitution in them - look for attributes that are empty (no spaces) - that gets translated into a NULL or some other non-printable character that Getopt::Long chokes on http://www.semintelligent.com/blog/articles/50/epn-why-do-i-get-an-unitialized-value-error-message-from-my-getopt-longpm-when-nagios-runs-my-perl-based-plugin-under-epn - Max -- The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE: Pinpoint memory and threading errors before they happen. Find and fix more than 250 security defects in the development cycle. Locate bottlenecks in serial and parallel code that limit performance. http://p.sf.net/sfu/intel-dev2devfeb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Backing up Nagios on Ubuntu 10.04
On 01/05/2011 05:39 PM, Max Schubert wrote: While a live back up is definitely a good idea, if you start storing your configuration tree in a version control system off server - CVS, GIT, SVN, any other one you choose - then the back up issue on the live server only becomes one of backing up retention.dat, which has changes you / your users have made to the states of notifications, flapping, etc from the CGIs or via the named pipe. That's what we do at my employer, that is use SVN for the Nagios configuration files. I still backup all data off-site for backup purposes, though. Where SVN comes into play for us is when every change has been made for Nagios, there is a comment on the commit. This allows us to easily navigate back knowing what changes were made, by whom, and what the date was when the change occurred to the configs. I'd still opt for backup of all the system files off-site, however, and depending on your company, this may be policy for disaster recovery anyways. But SVN can also serve as backup too. You can never have too many, right. :) Regards, Max -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to Monitor VMware ESX Server and Storage Units
On 01/05/2011 10:28 AM, Marc Haber wrote: Are there other possibilities of connecting Nagios to ESX and MSA devices? I'd appreciate your input. Have a look at the check_esx.pl plugin. You can do quite a bit with it. http://www.op5.org/community/plugin-inventory/op5-projects/check-esx-plugin It just requires the VMware Perl SDK kit to be installed on the machine you want to connect from, in this instance, Nagios. Regards, Max -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Backing up Nagios on Ubuntu 10.04
While a live back up is definitely a good idea, if you start storing your configuration tree in a version control system off server - CVS, GIT, SVN, any other one you choose - then the back up issue on the live server only becomes one of backing up retention.dat, which has changes you / your users have made to the states of notifications, flapping, etc from the CGIs or via the named pipe. - Max -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] URL monitoring
On 01/03/2011 10:01 PM, anish kumar wrote: Hi folks, I just want to know how to monitor urls using nagios.i.e running web applications under tomcat. can any one pleasAe provide me links or docs that narrates,how to configure nagios in a way to monitor our web applications. we already configured nagios for monitoring remote servers load,login details, disk space etc.. But we could not find any documents for configuring nagios in order to monitor webapplications. Your valuable suggestions please. Look around Monitoring Exchange. Here's two URL checks after a quick search on check_url. https://www.monitoringexchange.org/search?query=check_url Regards, Max -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Alert host down with passive check
Read up on freshness checking: http://nagios.sourceforge.net/docs/3_0/freshness.html max On 12/10/10, Rikard Dahlberg ej_...@hotmail.com wrote: Heya guys! Is there any possible way to configure nagios to report host as DOWN, if nagios havn't got any passive checkresult within like 5 minutes? Can I change that somehow? For now, when a host dies in my current configuration, it actually doesn't report is as DOWN, since im using passive checks. But i want to set nagios to if i havn't got a new passive.-check result in 5 minutes, i want nagios to automaticly set it as DOWN... Any idea please? :) Regards Rikard -- Sent from my mobile device -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] low-cost snmp-enabled temperature sensor?
On 12/09/2010 01:16 PM, Furnish, Trever G wrote: Can anyone recommend a low-cost external temperature sensor that doesn't require the buyer to break out a soldering iron? Preferably it would be SNMP-enabled so I can poll it from anything. Power-over-ethernet would be great too. The least expensive snmp-enabled sensor I've found so far is this one at 195 USD: http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm I've also noted probes like this one at 15 USD: http://www.ibuttonlink.com/t-sense.aspx ...but the systems in the site where I'd hook this up are all Windows, so I'm not sure how I'd get from there into Nagios. I use these guys here. http://store.enviromon.net/cart.php?target=productproduct_id=255category_id=78 They are a little more expensive, but are ethernet connected, and have no issues using SNMP to incorporate into Nagios. We have two sensors hooked up to it, one is temp/humidity and the other is a water sensor. Regards, Max -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] high latency
I find it interesting that a number of users get performance improvements with embedded perl off - we lose 20-40% polling capacity perl poller with it off. - Max -- Increase Visibility of Your 3D Game App Earn a Chance To Win $500! Tap into the largest installed PC base get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] high latency
Latency increases much more quickly for us without epn as execution times are noticably longer per check. We use rhel 5.x, so the perl is 5.8.8. We have semi dailoy updates to our pollers and with epn that means cold restarts - memory leaks have not been noticable given that scenrio, but on test hosts or hosts where we are doing burn ins it is negligable enough that we can go for 2-3 days with no memory issues - we always hit service latency thresholds first. 7 seconds is in general where we have to force a restart of our pollers to prevent metric collection and snmp delta calculation issues. Max On 12/3/10, Andreas Ericsson a...@op5.se wrote: On 12/03/2010 12:46 PM, Max Schubert wrote: I find it interesting that a number of users get performance improvements with embedded perl off - we lose 20-40% polling capacity perl poller with it off. How do you mean that you're losing capacity? Does latency start to creep upwards or is load increasing? Out of interest; How much memory does epn leak nowadays, and which perl version is it compiled against? -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Increase Visibility of Your 3D Game App Earn a Chance To Win $500! Tap into the largest installed PC base get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Check size of large directory
On 12/02/2010 10:31 AM, Sebastian Ries wrote: Does anyone know about a plugin that can do this? Check out the check_file plugins from the nagios-of-plugins. http://www.openfusion.com.au/labs/nagios/ I'm pretty sure it will work on a directory the same as a file. Should be something like: check_file -s -30MB -f /path/to/dir Regards, Max -- Increase Visibility of Your 3D Game App Earn a Chance To Win $500! Tap into the largest installed PC base get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How can i show remote nagios results in one interface?
How about Thruk? http://www.thruk.org/ - Max -- Increase Visibility of Your 3D Game App Earn a Chance To Win $500! Tap into the largest installed PC base get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroup with no members - Enhancement
Thank you, Andreas - I actually really enjoy writing tests and understand their value - will see if I can put a few in place using tap over the next week or so and update the patch with them. On 11/29/10, Andreas Ericsson a...@op5.se wrote: On 11/29/2010 09:20 AM, Andreas Ericsson wrote: This looks nice. It's in my compile and test queue right now, so assuming it works out ok it'll be committed before the week is out. And now it's out there. More testing would be much appreciated though, but considering the patch is fairly well written I have few qualms about it. Cheers, and thanks again. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Increase Visibility of Your 3D Game App Earn a Chance To Win $500! Tap into the largest installed PC base get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroup with no members - Enhancement
Hi, Long overdue and anxiously waited on by at least 3 people :), here is the patch, diffed against the CVS HEAD (as of tonight) revision of Nagios, with a documentation patch for configmain.html as well (as requested by Andreas): The option is now named allow_empty_hostgroup_assignment Thanks to Dan Whittenberg and Michael Friedrich for their input on the option name and the patch itself - thanks to my managers at Comcast (a great place to work as a developer) Eric Scholz and Mike Fischer for allowing the team I am on to contribute code back to the community, and thanks to the team I am on for their hard work and dedication with our distributed Nagios set up and the many other projects we are involved with. Andreas, please let me know if you'd like any further modifications or documentation for the patch beyond what is provided. https://github.com/netoisstools/Nagios-3-Patches/blob/master/CVS-HEAD-201011292220/201011281330-allow_empty_hostgroup_assignment.patch - Max -- Increase Visibility of Your 3D Game App Earn a Chance To Win $500! Tap into the largest installed PC base get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] More than one command defined
We went through a couple of rounds of changes with this: * We first hacked our copy of process_perfdata.pl and added logging to it, then built a relay to our long term trending data warehouse for relaying Nagios perfdata to it after allowing PNP to process it for short term views of the data Nagios - PNP NEB - network - performance data server - NPCD queue - process_perfdata.pl - log - relay - network - data warehouse * Now we have a POE-based solution in place that processes the perfdata for PNP and our relay as two data driven events Nagios - PNP NEB - network - performance data server - POE server POE server - RRD files (replaces npcd and process_perfdata.pl) POE server - network - data warehouse network listener For both methods we built off of the (at the time very young) modpnpsender.c NEB module source included with PNP. ; we then migrated to a NEB modules who's code was based on some NEB code Andreas wrote that showed how to write to the input of a second external program over a pipe + an external perl script that relayed the data to the destination servers over the network (or queued if the network was down). Code will be available open source this year for people who are interested. - Max -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroup with no members - Enhancement
Will try to get this done this weekend - sorry all, it is an irony to be working for a company that allows us to release code from work to open source and be so busy at the job that I rarely have time to do so :). My apologies for the delay - and thanks for your input, help, and feedback on the patch, Dan. Glad you find it useful. - Max On Fri, Nov 19, 2010 at 10:40 AM, Daniel Wittenberg daniel.wittenberg.r...@statefarm.com wrote: I'll check with the original author of the patch...I re-rolled it already against head and tested great, been a lifesaver. I sent the updates to him so he could post and move forward with so if he's not had time I'll see about submitting directly. -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notifications on passive service checks
Escalations can resolve this - if you set an escalation to trigger at notification 1 and set a re-notify interval in the escalation you will be re-notified at the set interval while the service remains in the configured alarm state. - Max On Tue, Nov 16, 2010 at 4:41 PM, Alberto Menichetti alb.meniche...@tai.it wrote: Hi all, I noticed the same strange behavior, but I don't think it's the right behavior. Operating in this way, a linkDown trap will be notified only once (in fact the sender device will generate a single trap in response to state change). Is it possibile to modify this behavior? Hall, JC wrote: After some testing, it looks like it will only re-notify after receiving another passive check result. It won't simply re-notify because it's still in a non-ok state after the notification_interval has expired. So to combat this I just used the check freshness attribute to re-execute my external script and feed the passive check result into nagios and thus re-sending a non-ok notification at what would have been the interval for notifications. So technically my external scripts are running at every interval to check the freshnes, not only when it's called for by my event_handler from another active service check... which I'm ok with. -Original Message- From: Andreas Ericsson [mailto:a...@op5.se] Sent: Friday, November 12, 2010 5:00 AM To: Nagios Users List Cc: Hall, JC Subject: Re: [Nagios-users] Notifications on passive service checks On 11/11/2010 11:27 PM, Hall, JC wrote: Is it accurate that Nagios will only send 1 notification for a passive service check? IE, the notification_interval definition for a passively checked service won't instruct Nagios to re-send a notification such as with actively checked services? To be honest, I haven't got the faintest idea. An educated guess is that it will re-send the notification if it receives another passive check-result and enough time has passed though, or that it simply re-sends the notification when enough time has passed. If you try and find out, let me know either way and I'll amend the docs. -- TAI S.r.l. Alberto Menichetti Area Mercato - Ingegneria dei Sistemi System Engineer 50141 Firenze - Via Pazzagli, 2 Voice: +39 055 42661 - Fax +39 055 4266356 56125 Pisa - Viale Gramsci, 12 Voice: +39 050 220221 - Fax: +39 050 24421 e-mail: alb.meniche...@tai.it http://www.tai.it --- COMUNICAZIONE AI SENSI LEGGE 196/03 Il presente messaggio di posta elettronica viene inviato al Vostro indirizzo email, che abbiamo acquisito da Vostre Visite, da incontri commerciali, elenchi di pubblico dominio, Vostre precedenti comunicazioni. Il Vostro dato in questione e' in possesso di TAI S.r.l., che lo ha immagazzinato in formato elettronico. Tali informazioni non saranno divulgate a terzi. Se desiderate verificare, cancellare o modificare i dati in nostro possesso, inviate fax al numero 0554266356. -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroup with no members - Enhancement
Will do my best to do this this week - if I don't get to it this week I will forget about it for sure, all the patches our team has done to make Nagios (some very very specific to our env) work for us will be available on our team github account; will do my best to get that updated this week as well and post that url along with the patch / docs to this list or I will forget to do that for another few months too. Max On 11/3/10, Andreas Ericsson a...@op5.se wrote: On 11/02/2010 04:54 PM, Max Schubert wrote: allow_empty_hostgroups = 1 I'll take this, but rename the option to something more mnemonic, as empty hostgroups are already allowed and it's a bit ambiguous what it means. Any chance you could re-roll the patch against latest CVS head and include a brief documentation update? Name the var allow_empty_hostgroups in the docs so I can find it when I sed it in. If you want any special attributions (company or person thanks, beyond the addition of your name to the THANKS file and your name and email in the commit log message), let me know. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Achieve Improved Network Security with IP and DNS Reputation. Defend against bad network traffic, including botnets, malware, phishing sites, and compromised hosts - saving your company time, money, and embarrassment. Learn More! http://p.sf.net/sfu/hpdev2dev-nov ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroup with no members - Enhancement
On Tue, Nov 2, 2010 at 10:52 AM, Fredrik Barrud fred...@barrud.se wrote: Hi, I would also like to be able to create empty hostgroups with services assigned. The reason for this is that I've setup a nagios installation for a customer where a host is only member to one hostgroup to inherit services from other hostgroups, like all-windows-citrix-server and for a few special servertypes I would like to prepare the chain with hostgroup and services so that when the servers are ready to be monitored everything is prepared and the servers only needs to be added to the apropriate hostgroup. We have a local patch to turn the error into a warning, patched against 3.2.0 - we needed this because we have customers who can use host - hostgroup - N related services mappings but we do not control their configs, we only define the global service sets - so we could not have this be fatal and fail everyone's changes just because one customer stopped using the mapping. We have been using this patch in production for a year without issues. To apply: 1) Put patch in top level source directory 2) patch -p0 name-of-patch - Max 20101026171548-allow_empty_hostgroups.patch Description: Binary data -- Nokia and ATT present the 2010 Calling All Innovators-North America contest Create new apps games for the Nokia N8 for consumers in U.S. and Canada $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store http://p.sf.net/sfu/nokia-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroup with no members - Enhancement
To activate the downgrade from error to warning, put allow_empty_hostgroups = 1 in your nagios.cfg after compiling Nagios with the patch in place. - Max -- Nokia and ATT present the 2010 Calling All Innovators-North America contest Create new apps games for the Nokia N8 for consumers in U.S. and Canada $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store http://p.sf.net/sfu/nokia-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Detecting Windows Backups?
On 10/12/2010 08:16 AM, Kevin Keane wrote: I’m wondering if there is a way to detect if a Windows backup is currently running. Every night during the backup window, my checks for disk and CPU activity go haywire. That is expected, of course. What I’m looking for is a way to get Nagios to either automatically schedule a downtime when the backup starts, or to use a service dependency or something like that to suppress these false alarms. Does your backup run at the same time every night? Max -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today. http://p.sf.net/sfu/beautyoftheweb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Detecting Windows Backups?
On 10/12/2010 08:16 AM, Kevin Keane wrote: I’m wondering if there is a way to detect if a Windows backup is currently running. Every night during the backup window, my checks for disk and CPU activity go haywire. That is expected, of course. What I’m looking for is a way to get Nagios to either automatically schedule a downtime when the backup starts, or to use a service dependency or something like that to suppress these false alarms. I had to dig up the link for the script I use for scheduling downtime remotely. Anyways, if you know your backup times are always the same, or always the same window, I'd just use a cron job to run this nagios_downtime script written in Perl. http://nagios.larsmichelsen.com/nagios-downtime/ Regards, Max -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today. http://p.sf.net/sfu/beautyoftheweb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Disabling Nagios in the event of network card failure.
On 10/12/2010 07:56 AM, Kristan Webb wrote: Hi All, Hopefully an easy query. We run Nagios 3 on an Ubuntu box and I'm looking for a way to stop Nagios falsely reporting services/hosts as down/unknown if, for example, the network cable was pulled out of the server / the card failed. This has happened recently for over an hour and now all hosts/services have a false hours downtime. I'm not too bothered, but I like to try and keep things as neat as possible. Does anyone know of a way of preventing this? All I can think is some way of detecting when the server has lost network and then automatically quitting / disabling Nagios? This is pretty much the entire point of Nagios, that is to report when something has failed. If you're intentionally taking down the network on the host, then I would suggest scheduling downtime for that host if you or others know you're going to take it down for a period of time, otherwise, Nagios is doing it's job. Regards, Max -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today. http://p.sf.net/sfu/beautyoftheweb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Deploying Nagios on Windows box
On 10/12/2010 08:20 PM, Yu Watanabe wrote: Hello all Has anyone has a experience for deploying a Nagios 3.x + apache on a windows box using Cygwin? Perhaps, Windows Server 2003 (32bit,64bit) or later? I don't think you're going to be able to do so, at least not easily, but I did run across this on Nagios Exchange. It looks very experimental, so beware, but maybe a possibility for you. http://exchange.nagios.org/directory/Distributions/Nagios-Core-32bit-Windows-Installer/details Regards, Max -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today. http://p.sf.net/sfu/beautyoftheweb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Alleviating Nagios i/o contention problem
I like the suggestions Matthias makes; those suggestions have worked well for us. RRD updates are very expensive - I am pretty sure without knowing anything more about your system that the RRD writes are causing most of the I/O load. Our current largest Nagios-based system has around 7500 hosts and around 40k active services spread across 4 pollers - the four pollers send perfdata to two report servers that do nothing but host the trap databases for traps from SNMPTT from the pollers, RRD files / PNP web UI, and the server side of our client/server notification system. The snmptt dbs and notification server dbs are replicated master master between the two hosts. Even with rrdcached and raid 10 these hosts regularly have 3 - 10 pct I/O wait. We hope to lower that number a bit by moving the DBs onto separate dedicated DB hosts. - Max On 9/25/10, Matthias Flacke matthias.fla...@gmx.de wrote: On 9/25/10 2:30 PM, Frost, Mark {PBC} wrote: Greetings, listers, We've got an on-going issue with i/o contention. There's the obvious problem that we've got a whole lot of things all writing to the same partition. In this case, there's just one big chunk of RAID 5 disk on a single controller so I don't believe that making more partitions is going to help. On this same partition we have: 1) Nagios 3.2.1 running as the central/reporting server for a couple of other Nagios nodes that are sending check results via NSCA. Approximately 6-7K checks. 2) pnp4nagios 0.6.2 (with rrd 1.4.2) writing graph data. There's a 2nd server configured identically to the first that's acting as a hot spare so it also receives check data from the 2 distributed nodes and writes its own copy of the graph data locally as well. At the moment I'm concerned about the graphdata, but because I can only see i/o utilization as an aggregate, I can't tell what is the worst component on that filesystem -- status.dat updates? graph data? writes to the var/spool directory? We also look at continued growth so this is only going to get worse. These systems are quite lightly loaded from a CPU (2 dual-core CPUs) and memory (4GB) perspective, but the i/o to the nagios filesystem is queuing now. We're about to order new hardware for these servers and I want to make a reasonable choice. I'd like to make some reasonable changes without requiring too exotic of a setup. I believe these servers are currently Dell 2950s and they're all running Suse Linux 10.3 SP2. My first thought was to potentially move the graphs to a NAS share which would shift that i/o to the network. I don't know how that would work though and it would ultimately be an experiment. What experiences do people out there have handling this kind of i/o and what have you done to ease it? You didn't say how many of your checks create perfdata - but I assume that most of your disk I/O is related to RRD updates. RRD cached (see http://docs.pnp4nagios.org/pnp-0.6/rrdcached for PNP integration) is a good means to collect multiple RRD updates and burst write the RRD files. status.dat and the checkresults directory are always good candidates to be stored on a ramdisk, especially since they're volatile data. As a side note: status.dat on ramdisk is a pure boost for the CGIs :). I know people which also store nagios.log on a ramdisk and regularily save them via rsync onto a hard disk. My own systems with ~4000 checks and ~20.000 performance relevant data sets went down from 30% to less than 2% wait I/O with rrdcached and ramdisk use. Cheers, -Matthias -- Start uncovering the many advantages of virtual appliances and start using them to simplify application deployment and accelerate your shift to cloud computing. http://p.sf.net/sfu/novell-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Start uncovering the many advantages of virtual appliances and start using them to simplify application deployment and accelerate your shift to cloud computing. http://p.sf.net/sfu/novell-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Creating multiple force checks quickly
I suggest putting them all in a service group - then you can issue a force check command from the Nagios UI. Scriot is not a bad way to go either though - the Nagios externalcommand API (via the named pipe) is easy to use and well documented. Max On 9/22/10, Brian McKee brian.mc...@gmail.com wrote: Hi All I'm running an old version of Nagios (2.4) Sometimes I end up with the same service warning on multiple machines and I'd like to force a check on all of them. Using the web interface is a lot of clicks when it's more than just a couple machines. I'm thinking I should be able to gin up a for loop in bash with curl or wget to do it for me, or am I wrong? Does this exist already somewhere? Pointers in the right direction appreciated Brian -- Hey, it's your computer isn't it? -- Start uncovering the many advantages of virtual appliances and start using them to simplify application deployment and accelerate your shift to cloud computing. http://p.sf.net/sfu/novell-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Creating multiple force checks quickly
On Wed, Sep 22, 2010 at 11:58 AM, Brian McKee brian.mc...@gmail.com wrote: On Wed, Sep 22, 2010 at 9:34 AM, Max perld...@webwizarddesign.com wrote: PS Still curious about the service group thing though I looked further into this and the service group control panel does not include an 'schedule the next check of all services in this group' option, so the script sounds like the right way to go. - Max -- Start uncovering the many advantages of virtual appliances and start using them to simplify application deployment and accelerate your shift to cloud computing. http://p.sf.net/sfu/novell-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Enumerating Interfaces
What I have done in the past and currently do is that each port that is connected to another networking device is modelled as a Nagios host - the host check is then the SNMP port status and the IF stats are collected as services on the host. The parent of the port is the network device itself and that way any child devices can be associated with the port and will be marked as unreachable should the port be administratively or operationally turned down. - Max On Wed, Sep 8, 2010 at 12:22 PM, Jason Frisvold frisv...@lafayette.edu wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On 09/07/2010 05:45 PM, Greg Pangrazio wrote: Well we use similar switches here. what we do is create the host then services for the interfaces and if we have port channels I use the multiple check_service_cluster for all the interfaces in the port channel. I just use check snmp for the interfaces. If you want more specifics i can give you config definitions and command defs etc. Yeah, that's kind of what I settled with so far.. Testing it out on the SAN switches and maybe I'll move that config to other switches too.. I'm using PNP4Nagios, so I wrote a custom script to check the ifOperStatus and get the traffic information on the port as well. So a single check is telling me status, throughput, errors, and discards... I am running into one problem, though, and I'm not sure how to handle it.. How are you handling parent/child for these ports, or are you? In other words, I'd like to pair the device connected to the port with the port itself. However, I'm enumerating each switch for the ports, so I don't have individual services for each device/port combo... My config looks something like this : define service { use service-standard hosts fcswitch1.example.com,fcswitch2.example.com servicegroups fc-switch-standard service_description Switch Port fc1/1 check_command check_snmp_traffic!fc1/1!-C $USER2$ } I think I'm going to have to enumerate each host/port combo, or live without parent/child... *sigh* Greg Pangrazio - -- - --- Jason Frisvold Network Engineer frisv...@lafayette.edu - --- What I cannot create, I do not understand - Richard Feynman -BEGIN PGP SIGNATURE- Version: GnuPG v2.0.14 (GNU/Linux) Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org/ iEYEARECAAYFAkyHuFwACgkQO80o6DJ8UvmAsACaAugytCgxrgsePwlNbrw27Cpi rkgAn0e4leboXWricN/+5CtiC07TkUm0 =rEMU -END PGP SIGNATURE- -- This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios plain files vs database storage
Fyi - plain text files are less system intensive than a db - with our standalone instances we monitor about 2k hosts per poller (about 10k active checks). A db back end makes management and command / control easier with large deployments but .not needed for just. 400 hosts. If you want a nice UI, check out fruity or lillac for configuration. Max On 9/7/10, yuris yu...@smtp.com wrote: Hello, I'm using Nagios for several months with plain files setup, and nagiosgrapher. It works pretty nice and I have no problem monitoring ~50 hosts; ~10 service_checks per host. Now I'm thinking about migration of other hosts from Zabbix to Nagios and totally it is going to be ~350 hosts. Since the number of hosts is significant plain test configs does not seem to be a good option, and keeping hosts/hostgroups in database seems preferable but from experience with Zabbix I know that storing everything in database might be not that good idea but pretty obvious bottleneck :/ Could someone please give me an idea what drawbacks I may face if switch from plain nagios3 to ndoutils-nagios3-mysql with 350 - 400 hosts ? Can I store in mysql db only hosts and hostgroups and manipulate them (add/remove) via web interface while keep everything else in old way (plain test configs) to reduce load on mysql? -- wbr, Yuris Linux System Engineer, SMTP.com -- This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Distributed Setup of Nagios
On Thu, Aug 19, 2010 at 10:39 AM, Kyle O'Donnell nag...@isprime.org wrote: We have done some other significant changes to GW to scale as large as we did, but these had less to do with nagios and more do to with GW. We are using a ramdisk for the nagios log directory, which helps quite a bit for nagios performance. I can second that - RAM disk for nagios temp files and log files, objects.cache and retention.dat (we rsync to disk every N minutes for disaster recovery) makes a very noticable difference. - max -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Centreon nagiosadmin login says invalid user!!!
On Thu, Aug 19, 2010 at 10:41 PM, Toonz IT it.to...@gmail.com wrote: We can login to Nagios using user name nagiosadmin and password nagiosadmin. But Centreon returns Invalid User! We are using FAN 2.0 What did the users / developers on the Centreon / FAN mailing list say about this? They might be able to help. -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Distributed Setup of Nagios
On Wed, Aug 18, 2010 at 7:22 AM, Ton Voon tonv...@gmail.com wrote: You may want to look at Opsview (http://opsview.com). From a single point of configuration, it pushes out the nagios configuration to the remote slaves which are independently running their own copy of Nagios. We have users going up to 25 slaves! Cool - how many active service checks / active host checks per poller? - Max -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Distributed Setup of Nagios
On Wed, Aug 18, 2010 at 11:07 AM, Kyle O'Donnell nag...@isprime.org wrote: we have ~ 3 services and ~3000 hosts we have 6 pollers (each have a backup) processing checks and forwarding back to a central nagios host. our busiest poller has ~1000 hosts and ~9000 services... avg service check interval is 5 minutes, but there are a bunch at 1 and 2 minute intervals. avg service check latency is less than 1 second This is ~3yr old hardware too, i suspect we could increase capacity by 50% if we move to the new intel nahalems Nice - appreciate you sharing your numbers - everyone who does distributed code around Nagios adds overhead, so it is nice to see real numbers as opposed to 'as many as can be done' as we all know how wildly that varies :) - I have spent many many hours with my colleagues tuning the 'as many as can be done' numbers. We have done a distributed variant of Nagios as well - our non-distributed pollers (Compaq 380s with 8 GB RAM + RAID 10) poll 2k host checks (every 10 minutes) and 11k service checks (avg interval 5 minutes), all checks send performance data through a NEB module as well to our performance data processing tier - with our distributed code in place that falls to around 1.5k host checks and 8-9k service checks per poller. Average non-distributed host and service check latency around 1.2 seconds, distributed around 2.4 seconds. Our new hardware consists of Dell R710s - dual 8 core processors, wow do those rock - with our distributed code we are getting around 2x those numbers per poller even with the overhead of the distribution mechanism in place. We will be releasing our distributed variant as open source software in the next month or so - i suspect that our methodology is org specific enough that it will not work for many places, but for higher volume polling it might be worthwhile to adopt and some of the concepts and metholodigies in it we hope will lead to sparking ideas in others for better ways to do distributed Nagios. We also take the approach of pushing out configs to remote pollers - we have a redundant UI tier where we stage a configuration - after the configuration is staged, we have code (will allow for manual operator adjustment in a dot release) that will equally distribute checks among pollers desginated as being available for use - that code then builds out a common retention.dat file for all pollers along with objects.pre-cache files for each poller - those files are pushed out to each poller and the pollers are restarted (yes, we have thought through and worked out all the synchronization issues involved). Our UI then lets users take the actions the Nagios Ui does and knows where to send the commands to affect the real poller instances. Working well so far, and as with all the alternate Nagios UIs, we are able to make a much more intuitive and flexible UI. Code should be available in early October. - Max -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Distributed Setup of Nagios
On Tue, Aug 17, 2010 at 10:26 AM, Bradley Radjoo bradley.rad...@is.co.za wrote: Hello All, Please can you assist me as I a 1st timer to Nagios itself as I need assistance on a Distributed Monitoring Setup of Nagios. Where one server manages other instances of Nagios Servers in differant locations. Any help or documentation on installation and configuration would be of great help ;-) DNX might work for you: http://dnx.sourceforge.net/ - Max -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Applying a multiplier to check_snmp results
I have written and maintain a perl library that extends Nagios::Plugin - it is called Nagios::Plugin::SNMP. It is available on CPAN and while it does not take the complexity out of SNMP data it does simplify writing plugins that retrieve SNMP data. The team I am on at my job uses it extensively - you might find it helpful for creating custom plugins tht need to not only retrieve but also cook SNMP data before doing thresholding on it. - Max On 8/12/10, Hall, JC jh...@iodatacenters.com wrote: I certainly wish I could do that. I'm just a novice. I can basically accomplish what I want (in regards to multiplying the output) with snmpget. printf scale=2;%d*0.01\n $(snmpget -v 1 -c public 192.168.104.37 .enterprises.6347.1.1.17 -O Qve)|bc 208.72 Any thoughts surrounding how I could run this as an external command and pipe it back into nagios? -Original Message- From: Matthew J. Salerno [mailto:vagabond_k...@yahoo.com] Sent: Thursday, August 12, 2010 2:17 PM To: Nagios Users List Subject: Re: [Nagios-users] Applying a multiplier to check_snmp results I ran into the same thing, I ended up writing my own plugins. Just look for a non binary plugin (perl/python etc) and then hack away. - Original Message From: Jim Avery j...@jimavery.me.uk To: Nagios Users List nagios-users@lists.sourceforge.net Sent: Thu, August 12, 2010 5:05:42 PM Subject: Re: [Nagios-users] Applying a multiplier to check_snmp results On 12 August 2010 21:27, Hall, JC jh...@iodatacenters.com wrote: I have a situation where I would like to apply a multiplier to the results of a check_snmp output. The situation is this: The value received from my check_snmp = 20717. The integer multiplier for this particular OID is 0.01. The output I want reported to Nagios is 20717 x 0.01 = 207.17. Looking through the options of the check_snmp plugin, I can’t determine how I could accomplish this. Any thoughts are appreciated. +1 to that! I was doing some monitoring on a UPS yesterday and found the current is given as an integer which is 10x the Amps, so it would be great to be able to divide by 10 (or multiply by whatever) in check_snmp. -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] checks per host
On Mon, Jul 12, 2010 at 1:39 PM, Joel Brooks jbro...@oddelement.com wrote: hey gang, I'm trying to get a sense of what's normal for the number of checks per host. I'm pushing nagios to a number of servers and the list of things I want to monitor keeps growing. For some servers, I've got 30 checks - some 50. what is normal out there? is there a practical limit? The limit is what your HW can handle and what your people can handle seeing per host. Our teams try to make checks that check multiple elements at once to reduce the numbers of checks per host without loss of alarm granularity - for example, our disk checks check all partitions on a host and allow for rich thresholding - user can specify many thresholds to check per service. Same for critical processes - one check to look for all critical processes and report any that have failed. - Max -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Running Nagios on Vmware
Kent Saunders wrote: We run a large part of our environment on VMware, however there are certain things that we deem worthy of hardware, Nagios being one of them. Since we rely on Nagios to inform us of problems across our environment, we need it to be highly reliable, with performance being a lesser consideration. We've experience issues with VMware that cause large numbers of guests to lock up, and if this happened to Nagios we wouldn't know, or have as deep a level of alerting. However, if you really like the idea of running it in a vm, I'd suggest putting a simple is Nagios alive type test outside VMware somewhere, to give you an extra layer of protection should the guest have troubles. Same here. We take the philosophy with wanting Nagios on separate hardware running nothing else but monitoring and graphing/trending. For the exact same reasons listed, we want to know if there's a problem with one of our virtualization servers, and if Nagios was a VM on the same box that's having a problem, we wouldn't get those notifications. My Nagios instance is on an older server with a RAID 1, and of course backed up, so it could easily be restored should the machine die. Of course running it as a VM allows the use of snapshots and very very easy backup and restores, but I still think having it isolated is important. Regards, Max -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] wiki down?
On Wed, Jun 30, 2010 at 8:45 AM, Roy Sigurd Karlsbakk r...@karlsbakk.net wrote: This is getting silly - can someone please fix that wiki, or should we all move over to Icinga and do some real work for once? Why would you assume that any of us are not doing real work? -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] IP and hostname mapping control
On Tue, Jun 29, 2010 at 1:40 AM, Network Operation Center FMC Luxemburg n...@eurofmc.com wrote: Hi everybody, I'm looking for a way to check out the mapping between a hostname and IP address. Example: IP 192.168.0.1 exists and if the hostname foo.mylan.com is not associated with this IP, I would have an alarm. Indeed the script below returns no alarm : Take a look at check_dns. - Max -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] how to fix excessive latency
Clock skew can be an issue as well depending on the virtualization platform. On 6/29/10, Giorgio Zarrelli zarre...@linux.it wrote: I agree, better not to use Nagios on virtual machine. The I/O layer of vms have poor performances. Ciao, Giorgio Il giorno 29/giu/2010, alle ore 14:23, Andreas Ericsson a...@op5.se ha scritto: On 06/29/2010 03:57 AM, wwanghongrui wrote: Thanks your reply. We are writing to mysql database by ndoutils.We don't use nsca. About external_command_buffer_slots, we don't set it up. status_update_interval =15 I use vmstate to capture system performance,like below.Maybe the bottleneck is not at system. Endeavour to not run Nagios on a virtual server. If you must use a virtual server, make very sure that your checkresult spooldirectory and status data files are on a ramdisk, or you will certainly run into trouble. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage
is Ready OK | 0:0 | Connector 0 [SAS] on controller 0 is Ready OK | 0:1 | Connector 1 [SAS] on controller 0 is Ready OK | 1:0 | Connector 0 [SAS] on controller 1 is Ready OK | 1:1 | Connector 1 [SAS] on controller 1 is Ready OK |0:0:0 | Enclosure 0:0:0 [Backplane] on controller 0 is Ready OK |1:0:1 | Enclosure 1:0:1 [MD1000] on controller 1 is Ready OK |1:0:0 | Enclosure 1:0:0 [MD1000] on controller 1 is Ready OK |1:1:0 | Enclosure 1:1:0 [MD1000] on controller 1 is Ready OK | 1:0:1:1 | Fan 1 in enclosure 1 [MD1000] is Ready (speed=slow) OK | 1:0:1:2 | Fan 2 in enclosure 1 [MD1000] is Ready (speed=slow) OK | 1:0:1:3 | Fan 3 in enclosure 1 [MD1000] is Ready (speed=slow) OK | 1:0:1:4 | Fan 4 in enclosure 1 [MD1000] is Ready (speed=slow) OK | 1:0:0:1 | Fan 1 in enclosure 2 [MD1000] is Ready (speed=slow) OK | 1:0:0:2 | Fan 2 in enclosure 2 [MD1000] is Ready (speed=slow) OK | 1:0:0:3 | Fan 3 in enclosure 2 [MD1000] is Ready (speed=slow) OK | 1:0:0:4 | Fan 4 in enclosure 2 [MD1000] is Ready (speed=slow) OK | 1:1:0:1 | Fan 1 in enclosure 3 [MD1000] is Ready (speed=slow) OK | 1:1:0:2 | Fan 2 in enclosure 3 [MD1000] is Ready (speed=slow) OK | 1:1:0:3 | Fan 3 in enclosure 3 [MD1000] is Ready (speed=slow) OK | 1:1:0:4 | Fan 4 in enclosure 3 [MD1000] is Ready (speed=slow) OK | 1:0:1:1 | Power Supply 1 in enclosure 1 [MD1000] is Ready OK | 1:0:1:2 | Power Supply 2 in enclosure 1 [MD1000] is Ready OK | 1:0:0:1 | Power Supply 1 in enclosure 2 [MD1000] is Ready OK | 1:0:0:2 | Power Supply 2 in enclosure 2 [MD1000] is Ready OK | 1:1:0:1 | Power Supply 1 in enclosure 3 [MD1000] is Ready OK | 1:1:0:2 | Power Supply 2 in enclosure 3 [MD1000] is Ready OK | 1:0:1:0 | Temperature Probe 0 in enclosure 1 [MD1000]: 31 C (65 max) OK | 1:0:1:1 | Temperature Probe 1 in enclosure 1 [MD1000]: 32 C (65 max) OK | 1:0:1:2 | Temperature Probe 2 in enclosure 1 [MD1000]: 24 C (55 max) OK | 1:0:1:3 | Temperature Probe 3 in enclosure 1 [MD1000]: 24 C (55 max) OK | 1:0:0:0 | Temperature Probe 0 in enclosure 2 [MD1000]: 31 C (65 max) OK | 1:0:0:1 | Temperature Probe 1 in enclosure 2 [MD1000]: 33 C (65 max) OK | 1:0:0:2 | Temperature Probe 2 in enclosure 2 [MD1000]: 25 C (55 max) OK | 1:0:0:3 | Temperature Probe 3 in enclosure 2 [MD1000]: 25 C (55 max) OK | 1:1:0:0 | Temperature Probe 0 in enclosure 3 [MD1000]: 31 C (65 max) OK | 1:1:0:1 | Temperature Probe 1 in enclosure 3 [MD1000] is Inactive OK | 1:1:0:2 | Temperature Probe 2 in enclosure 3 [MD1000]: C ( max) OK | 1:1:0:3 | Temperature Probe 3 in enclosure 3 [MD1000]: C ( max) OK | 1:0:1:0 | EMM 0 in enclosure 1 [MD1000] is Ready OK | 1:0:1:1 | EMM 1 in enclosure 1 [MD1000] is Ready OK | 1:0:0:0 | EMM 0 in enclosure 2 [MD1000] is Ready OK | 1:0:0:1 | EMM 1 in enclosure 2 [MD1000] is Ready OK | 1:1:0:0 | EMM 0 in enclosure 3 [MD1000] is Ready CRITICAL | 1:1:0:1 | EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed - Chassis Components = STATE | ID | MESSAGE TEXT -+--+ OK |1 | Memory module 1 [DIMM1, 4096 MB] is Ok OK |2 | Memory module 2 [DIMM2, 4096 MB] is Ok OK |3 | Memory module 3 [DIMM3, 4096 MB] is Ok OK |4 | Memory module 4 [DIMM4, 4096 MB] is Ok OK |5 | Memory module 5 [DIMM5, 4096 MB] is Ok OK |6 | Memory module 6 [DIMM6, 4096 MB] is Ok OK |7 | Memory module 7 [DIMM7, 4096 MB] is Ok OK |8 | Memory module 8 [DIMM8, 4096 MB] is Ok OK |1 | Chassis fan 1 [System Board FAN 1 RPM]: 8700 OK |2 | Chassis fan 2 [System Board FAN 2 RPM]: 8850 OK |3 | Chassis fan 3 [System Board FAN 3 RPM]: 8775 OK |4 | Chassis fan 4 [System Board FAN 4 RPM]: 8400 OK |0 | Power Supply 0 [AC]: Presence detected OK |1 | Power Supply 1 [AC]: Presence detected OK |0 | Temperature Probe 0 [System Board Ambient Temp] reads 16 C (min=8/3, max=42/47) OK |0 | Processor 0 [Intel Xeon E5420 2.50GHz] is Present OK |0 | Voltage sensor 0 [CPU1 VCORE] is Good OK |1 | Voltage sensor 1 [System Board CPU VTT] is Good OK |2 | Voltage sensor 2 [System Board 1.5V PG] is Good OK |3 | Voltage sensor 3 [System Board 1.8V PG] is Good OK |4 | Voltage sensor 4 [System Board 3.3V PG] is Good OK |5 | Voltage sensor 5 [System Board 5V PG] is Good OK |6 | Voltage sensor 6 [Riser 1.5V PXH PG] is Good OK |7 | Voltage sensor 7 [Riser 5V Riser PG
Re: [Nagios-users] wiki down?
On Mon, Jun 28, 2010 at 8:29 AM, Matt Simmons standalone.sysad...@gmail.com wrote: If only there were some kind of software available to let us know when websites were down... Or people to respond to alerts from the software :) -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage
Excellent, sorted, everything reports as OK now. Thanks so much Trond, amazing support and an amazingly useful plugin! Best Regards, Max Williams -Original Message- From: Trond Hasle Amundsen [mailto:t.h.amund...@usit.uio.no] Sent: 28 June 2010 15:21 To: Nagios Users List Subject: Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage Max Williams max.willi...@mflow.com writes: Here is the output, the inactive temperature probe is sorted but the missing EMM still produces an alert: OK | 1:1:0:1 | Temperature Probe 1 in enclosure 3 [MD1000] is Inactive This one works as expected :) OK | 1:1:0:2 | Temperature Probe 2 in enclosure 3 [MD1000]: C ( max) OK | 1:1:0:3 | Temperature Probe 3 in enclosure 3 [MD1000]: C ( max) Hmm... something strange going on here. I wonder why this happens, in the SNMP output you attached previously the values are there. Anyway, I've added some extra checking in the code to make it report better if the reading is unavailable for some reason. It should now report simply: Temperature Probe 0 in enclosure 2:0:0 [MD1000] is Ready if the temp reading is not an integer and OMSA reports the status as OK. CRITICAL | 1:1:0:1 | EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed Ah.. I misread the SNMP output.. The status is Unknown when reported by omreport, but Other when reported with SNMP. One little annoying difference between the two.. The output should be: EMM 0 in enclosure 2:0:0 [MD1000] is Not Installed with an OK state. I've created a second test version: http://folk.uio.no/trondham/software/beta/check_openmanage Please give this one a try and see if it performs better. Cheers, -- Trond H. Amundsen t.h.amund...@usit.uio.no Center for Information Technology Services, University of Oslo -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage
Hi, After adding more storage to a couple of our servers we are getting this error: [r...@host ~]# /usr/lib64/nagios/plugins/check_openmanage -C password -b ctrl_driver=0,1,2 -b ctrl_fw=0,1,2 -b intr=0 -H host2 Temperature Probe 1 in enclosure 3 [MD1000] is Inactive C at ( max) EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2312. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2312. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2318. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2318. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2318. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2318. [r...@host ~]# We didn't get this error before adding a new cabinet of disks which now brings the total up to 47 (2x internal disk and 3x full MD1000s). Has any one else come across this error? I am not perl literate so not sure how to debug or fix this. Cheers, Max -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.4 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.5 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.6 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.1 = STRING: A.04 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.2 = STRING: A.04 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.3 = STRING: A.04 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.4 = STRING: A.04 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.5 = STRING: A.04 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.5 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.6 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.5 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.6 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.1 = STRING: \\1\\0\\1\\0 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.2 = STRING: \\1\\0\\1\\1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.3 = STRING: \\1\\0\\0\\0 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.4 = STRING: \\1\\0\\0\\1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.5 = STRING: \\1\\1\\0\\0 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.6 = STRING: \\1\\1\\0\\1 Let me know if you need anything else. Best Regards, Max Williams -Original Message- From: Trond Hasle Amundsen [mailto:t.h.amund...@usit.uio.no] Sent: 25 June 2010 16:02 To: Nagios Users List Subject: Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage Max Williams max.willi...@mflow.com writes: Hi, After adding more storage to a couple of our servers we are getting this error: [r...@host ~]# /usr/lib64/nagios/plugins/check_openmanage -C password -b ctrl_driver=0,1,2 -b ctrl_fw=0,1,2 -b intr=0 -H host2 Temperature Probe 1 in enclosure 3 [MD1000] is Inactive C at ( max) EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ plugins/check_openmanage line 2312. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ plugins/check_openmanage line 2312. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ plugins/check_openmanage line 2318. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ plugins/check_openmanage line 2318. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ plugins/check_openmanage line 2318. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ plugins/check_openmanage line 2318. [r...@host ~]# We didn?t get this error before adding a new cabinet of disks which now brings the total up to 47 (2x internal disk and 3x full MD1000s). Has any one else come across this error? I am not perl literate so not sure how to debug or fix this. Hi Max, This is interesting. I've never seen Inactive temperature sensors in external enclosures. Also, that the plugin reports missing EMMs seems like a misfeature. Can you post the output from the following commands: On the monitored host: omreport storage enclosure controller=id enclosure=id info=temps omreport storage enclosure controller=id enclosure=id info=emms Replace id with controller/enclosure pairs. You'll get the enclosure and controller IDs with commands omreport storage controller omreport storage enclosure Also, since you're checking with SNMP, I'll need the output from an snmpwalk of the enclosures wrt. temperatures and EMMs. From the Nagios server: snmpwalk -v2c -c community hostname 1.3.6.1.4.1.674.10893.1.20.130.11 snmpwalk -v2c -c community hostname 1.3.6.1.4.1.674.10893.1.20.130.13 If you are uncomfortable with posting this information on the mailinglist, feel free to email me directly. Debug output from the plugin could also be useful: check_openmanage -H hostname -C community -d Cheers, -- Trond H. Amundsen t.h.amund...@usit.uio.no Center for Information Technology Services, University of Oslo -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list
Re: [Nagios-users] Help - I just blew away my configs
Shows running settings (config.cgi) Objects.cache contains a flattened version of all the configuration objects your configuration tree had; the primary loss is that there are no templates in objects.cache, so you will have to recreate those. Isave a copy of objects.cache in a very safe place off server and save a copy on server - nagios has a command line option that will let you start it using an objects.cache formatted file ( search for pre-cached configs)m If you had plugins in your config tree those are lost, cgi.cfg settings are lost, nagios.cfg settings are lost, and resource.cfg settings are lost - minus resource.cfg the rest are pretty eady to redo (as resource.cfg might have had passwords or other unique info. Definitely put your configs in svn, cvs, git or another version control system as you re-create your config templates - and makes sure you backup your vcs repository daily. Max On 6/24/10, Sean Carley scar...@gmi-mr.com wrote: You can also try /nagios/cgi-bin/config.cgi, not sure if that shows the running or on-disk configs. -s -Original Message- From: Gius, Mark [mailto:mg...@createspace.com] Sent: Thursday, June 24, 2010 4:11 PM To: Nagios Users List Subject: Re: [Nagios-users] Help - I just blew away my configs A pretty significant portion of the configurations are stored in the objects cache (/var/log/Nagios/objects.cache for me). This won't be as clean as your configs (and I'm not sure Nagios can use this file as a config directly), but you should be able to recover a pretty good amount of your running configs from there, and start reconstructing the config files. May I be the first to suggest placing your configuration in revision control, which will help mitigate this problem in the future. I would suggest Subversion (http://subversion.tigris.org/) or git (http://git-scm.com/). Good luck! -Gius -Original Message- From: Litwin, Matthew [mailto:mlit...@stubhub.com] Sent: Thursday, June 24, 2010 3:17 PM To: Nagios Users List Subject: [Nagios-users] Help - I just blew away my configs I just blew away all my nagios config files. Nagios is still running. Is there any way I can make nagios spit up the configs that were loaded from the command line or is all hope lost? --- --- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Postemsg
The main things you will not get from Nagios that you almost always get with Tivoli: * High recurring licensing fees * On-site Tivoli consultants :) Nagios does not give you out of the box the visualization dashboards that Tivoli has but with Nagviz you can you make very nice graphical dashboards at a much much lower cost to your company. Nagios also does not do auto-discovery out of the box but there are projects that give you that capabililty - again at a much lower cost. Distributed Nagios - there are a few choices, you will need to take the time to evaluate them all and choose the right one for you, but again, cost will be lower than Tivoli. The team I am on is building out a distributed architecture for Nagios based on our unique requirements - self service model where many SAs can all change configs on their schedule without our intervention, clustering, fast redistribution of hosts/services across pollers, centralized transparent (to the end user) command and control across all pollers. We are using some existing tools (Nagios and Merlin) and 4 developers and even then the TCO and maintenance cost will be magnitudes of order cheaper than Tivoli with much more functionality than most Tivoli shops offer. A polling model always has some challenges when it comes to scaling big but compared to Tivoli I think you will find Nagios to be both a lot more fun, a lot more flexible, a lot better fit, and, if politics don't interfere, your management should be much more happy with a fixed cost development price tag than the high $$ open ended maintenance costs of a commercial product like Tivoli. - Max On Fri, Jun 11, 2010 at 10:19 AM, steve f a31mod...@hotmail.com wrote: Hello All, I am currently looking for an alternative to using Tivoli , TEC postemsg for a rather large ( 6000 + ) remote environment. I have had great success with Nagios in my small local/remote test environment and the obvious cost savings without having TEC anymore is huge. Can I use the existing postemsg tests that are running on the boxes and via I guess External Commands have Nagios process the messages? For those familiar with both Tivoli Nagios, Is there anything that Tivoli gives me that I cant do with Nagios? I don't see it if there is. Thanks for the help, Steve -- The New Busy think 9 to 5 is a cute idea. Combine multiple calendars with Hotmail. Get busy.http://www.windowslive.com/campaign/thenewbusy?tile=multicalendarocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_5 -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Large Installation
I can attest / confirm what Andreas states about the merlin daemon. BTW, Andreas, I just patched our code base to contain your 0.6.7 changes and I will be posting that on Github for you and anyone else interested to check out over the weekend. Our tests so far are showing that with the Merlin NEB and daemon on a poller we lose less than 10% capacity on the poller compared to the poller without the NEB module and Merlind - our test poller is running 10k active services checks and 1k active host checks in less than 5 minutes with polling headroom to spare. - Max -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Large Installation
Our changes to Merlin allow N pollers to all write to the same database without conflicts. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Can an entity have more than one parent?
On Fri, May 28, 2010 at 2:22 PM, shadih rahman shadhi...@gmail.com wrote: 1) Can a host have two parents? http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#host 2) What happen if only one of those parents go down? http://nagios.sourceforge.net/docs/3_0/networkreachability.html -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Sending NAGIOS alerts to Gmail addresses can get your mail server listed as suspect.
Drew Weaver wrote: I noticed that a lot of our mail was ending up in users' junk/spam folder at GMail and it seems that if you send Nagios warning messages to Gmail they somehow assume that your server is malicious and spamming. Is it SOP to use a different SMTP server to deliver Nagios messages? I'd assume you'd have to take that up with GMail. Or check the spam filtering settings on the user accounts. I don't use GMail, but I assume there are junk spam filtering rules that you can do to classify mail on the settings, like all other e-mail clients and services. Regards, Max -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] newbie -- checking log files
On Thu, May 27, 2010 at 12:00 PM, Jay G. Scott g...@arlut.utexas.edu wrote: my boss says nagios (dunno which version he used) doesn't monitor log files. is that no longer true, i hope? http://www.google.com/#hl=enq=Nagios+monitor+log+files -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Full Throttle Nagios
Really look forward to seeing this code - this is the process pool code, yes? On 5/19/10, Andreas Ericsson a...@op5.se wrote: There are several tweaks one can do to make Nagios more or less saturate the CPU, but none of them are enough if the number of checks rise above a certain level. We've noticed that we can run about 60K service-checks so long as we don't have many state-changes in the network. That's not nearly good enough, so it will be improved. Work is under way to revamp the Nagios check handling though. It will get better, but it's a series of large patches that need to be thoroughly tested before they get released. I'll holler when I have them in a testable state, since I guess a lot of people would be quite interested in trying out the new checking engine I'm working on. If it works as good as it seems to, it should alleviate a lot of performance issues. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Full Throttle Nagios
Patch can be found here: http://github.com/netoisstools/Nagios-3-Patches/blob/master/events.c.patch Download it to your nagios source directory then run patch -p0 events.c.patch if you use the patch, make sure to set use_large_installation_tweaks to 1 as it will only skip the sleep time if large installation tweaks is enabled. Regards, Max -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] how to restart httpd service on remote server
As can puppet, Chef, Monit, Net-SNMP, and many others :) .. always a balance with deciding where process restarts should happen - the larger the network, the more benefit there is to having autonomous agents handle restarts and have Nagios focus purely on polling / asynchronous event handling. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Plugin fails with epn
If you look at p1.pl, which is in the Nagios bin directory you can enable ePn logging (EPN_LEAVE_MESSAGES or something similar for debug level) and have epn log each command execution to a log file ... This often will make the source of an epn error more obvious. The other thing to do is add Use warnings; To your script or run it from the command line with perl -w As any output to STDERR will cause ePN to halt execution. That said, 'not execute properly' errors are often permissions or path errors, so when you test from the CLI make sure you run the script as the same user your Nagios damon runs as. Max On 4/11/10, Joseph L. Casale jcas...@activenetwerx.com wrote: I've written a plugin that very closely resembles other contrib based Perl plugins and runs as expected at the cli. It fails with a Service check did not exit properly unless I force it to run w/o the epn. I have read the guidelines, but not really knowing much about Perl, I can't see where its tanking. Is there a way to run it at the cli with the epn in hopes to see more output to glean a clue as to what's going wrong? Or is there a better way to glean more useful info? Thanks, jlc -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Plugin fails with epn
Meant exit not execute :) ... Good luck. On 4/11/10, Joseph L. Casale jcas...@activenetwerx.com wrote: If you look at p1.pl, which is in the Nagios bin directory you can enable ePn logging (EPN_LEAVE_MESSAGES or something similar for debug level) and have epn log each command execution to a log file ... This often will make the source of an epn error more obvious. Ok, looks like there are some factors surrounding when the o0utput even gets dumped? Plugin was touched, now I am waiting... The other thing to do is add Use warnings; Got that in the plugin as per the guidelines. As any output to STDERR will cause ePN to halt execution. Works fine:/ That said, 'not execute properly' errors are often permissions or path errors, so when you test from the CLI make sure you run the script as the same user your Nagios damon runs as. Actually, it is 'exit' not execute, it has perms granting execution for all. We'll see what happens after I come back to it. Thanks for the help! jlc -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Plugin fails with epn
The 'not stayed shared' is the one warning that ePN will not die on at run time so you can safely continue with that warning being emitted. - Max On Sun, Apr 11, 2010 at 4:48 PM, Joseph L. Casale jcas...@activenetwerx.com wrote: Meant exit not execute :) ... Good luck. I guess the epn isn't reread on a 'reload', needed a 'restart', nuances :) Ok, so it complains: Variable $opt_hostname will not stay shared at (eval 1) line 23,. The plugin dump shows: 23 my ($opt_hostname,$opt_port,$opt_password,$opt_command,$opt_regex,$opt_help,$opt_version,$opt_verbose); Missed #4 at http://nagios.sourceforge.net/docs/3_0/epnplugins.html Not being fluent in Perl, I can't say I fully comprehend what that's about, but it works now:) Thanks! jlc -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Inteface traffic
IF-MIB On 4/8/10, Paras pradhan pradhanpa...@gmail.com wrote: Which is the correct SNMP MIB to see Data IN and Data out from interfaces? Thanks Paras. On Wed, Apr 7, 2010 at 12:14 PM, Paras pradhan pradhanpa...@gmail.comwrote: I have few Linux and Solaris hosts that I need to grab the total in and out traffic from there nics. Which plugin should i use? Do i need to configure snmp on all of my hosts or there are some non snmp plugins. Thanks! Paras. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Plugin to check file for string?
Guy Waugh wrote: Does anyone know of a nagios plugin that will grep a file for a string? I have a cronjob that logs output to a file and I basically want to grep for 'cronjob successful' in this file, alerting Critical if the string isn't found and OK if it is. The check_file plugin that is part of OpenFusion's plugin pack can check a file for a regular expression. I've used it for this purpose. http://www.openfusion.com.au/labs/nagios/ Regards, Max -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.2.1 - browser refresh
Trisha Hoang wrote: I have tried both and restarted apache. 1) rename file $prefix/share/index.php to index.html, and 2) placing this line ?php header(Cache-Control: max-age=7200, public); ? on the first line of index.php. Same here, neither changing the filename to .html or adding the php line in the first line work for me either. CentOS 5.4 and Nagios 3.2.1. Regards, Max -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.2.1 - browser refresh
Marc Powell wrote: Looking at CVS, there have been no changes to index.php in 14 months (several releases). Whatever's changed. it's not this. Smells like CentOS to me...either Apache or PHP. I know there was an httpd update this past week for CentOS 5, which I loaded. Is everyone else experiencing this on CentOS updated? Regards, Max -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to monitor a log file on a windows box?
Mirza Dedic wrote: I have a few Windows boxes where I want to monitor a specific log files for matching errors, and if they are there; I want Nagios to warn the appropriate users. I have accomplished this on our AIX box using consol's check_logfile plugin, however I need this function for Windows. My windows boxes are monitored by NSClient++ (latest build). Any idea on how I could accomplish this? Examples? Nagios 3.2 NSClient 0.3.8 (running NRPE and NSCA) I have not used this, but check a these links out and see if they can do what you're looking to accomplish. http://exchange.nagios.org/directory/Addons/Monitoring-Agents/Windows-Eventlog-Addon/details http://exchange.nagios.org/directory/Addons/Log-File-Management Regards, Max -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Pre Flight Check -Errors.
On Wed, Mar 31, 2010 at 1:29 AM, ravishankar.gundlap...@wipro.com wrote: Hi, Request someone to provide the solution... Please take some time to re-read the diagnostic message Nagios provided you, it tells you exactly what is wrong. Error: Contact group 'admins' specified in service 'C:\ Drive Space' for host 'LTCPDCERP01' is not defined anywhere! You need to define the contact group, as Giorgio stated in his email rresponse. If you do not understand how to correct it, the usual suggestions are: 1) Read the manual and learn how to correct this - http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#contactgroup 2) If you are not comfortable with learning on your own or find you are not able to fix this yourself, nagios.com offers paid service contracts and people who will be happy to provide the solution to you for a fee - http://nagios.com/services/support Regards, Max -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_cluster configuration.
On Mon, Mar 29, 2010 at 7:23 AM, HyunSung Jang hsj...@gmail.com wrote: then what should be the host_name for service definition ? my server hostname start from www1 ~ www100 and at this time I just added one more service under www1 definition. but this doesn't look good. Can i make it like a standalone ? I mean, i don't want put cluster service under one of my www server. it looks like a service that belongs to just one host. any idea about this~? One common pattern for this is to give the clustered host the name of the URL it represents, e.g. if the cluster represented the web service http://www.example.com I would use example.com as the name. If the cluster is behind a VIP, even better, you can then use the VIP IP as the 'virtual host' object IP. - Max -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Two IP for the Same server
On Fri, Mar 12, 2010 at 11:52 AM, Digital Edge reachta...@hotmail.com wrote: Dear List, I have a server , which is having 2 NIC with IP assign. I wanted to add in nagios those two IP with same hostname relevant service checks. Will it be possible. Not with a single host definition; however this can be modeled with 3 host definitions: * One virtual parent host object that has as a host check ping checks for both IPs * One virtual host object that represents NIC1 on the host * One virtual host object that represents NIC2 on the host Parent -- A - NIC 1 | |--- B - NIC 2 The two hosts representing the NICs would have the parent host as their parent so that if the ping checks fail for the two hosts you only get notified that the parent host object is down. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitor Java Process
You could use check_proc via NRPE or al custom SNMP base check SNMP agent (HR MIB or an agent specific MIB). A number of SnMP agents can be configured to send SnMP trpas when monitored processes exit. The Sun JVM also has a built in SNMP agent that exposes thread count, heap size and some other base JVM metrics. This agent can easily be proxied through an agent like Net-SNMP so you don't have multiple agent ports to work with. If you are using weblogic, it's built in SNMP agent can be configured to send a variety of SNMP traps, including starting up and shutting down traps. Max On 3/2/10, Jatin Davey jasho...@cisco.com wrote: Hi I want to know how i can monitor the java process running on remote box , basically i want to keep monitoring it and raise an email alert if it has re-started or stopped functioning. Please let me know how this can be achieved using nagios. Thanks Jatin -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE/NSCA replacement thoughts?
On Fri, Feb 19, 2010 at 1:11 PM, Kevin Keane subscript...@kkeane.com wrote: Personally, I'm not sure if there is a single best way. That is why I keep promoting separating the protocol from the transport. Basically, Nagios protocol data fits into Layer 6 or 7 of the OSI model, while the encryption and authentication really should be handled by layer 4 or 5. We have written in addtion to our other agents an HTTP-based agent that uses either RESTful or non RESTful style URLs for requests. Creating an HTTP RESTful style interface spec would be a pretty clean way to define a standard without getting into implementation discussions and would provide a common ground for plugins. As with SNMP or SSH HTTP comes with a performance price but more and more with Nagios I am seeing (because it is so flexible) that defining standards for a user group / org that uses Nagios is much harder than performance tuning is. -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE compilation on HPUX
On Wed, Feb 10, 2010 at 5:30 PM, Ciro Iriarte cyru...@gmail.com wrote: Hi, i'm getting issues compiling NRPE on HPUX 11.31, do you now what missing library can cause this? Colleague of mine used the NRPE package here: http://mayoxide.com/naghpux/ and said it works well. - max -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Three tier alert system, and how to invoke an external script?
Mr Gabriel wrote: Is it possible to do the following; There are five people, three groups of people who should be notified in case of any issues. the three groups of people each have three ways of being notified. Group One - Help Desk Team Group Two - IT Support Team Group Three - Management First notification is email, the second is text message, and the third one is by phone. Email is already configured. We have a perl script that can be run to send messages, and also a perl script that can connect to our asterisk boxes to make calls. On the first alert, notify group one by email, and ignore the second and third group. If no acknowledgment is recieved, run perl script for SMS with group one by, and group two notify by email If again no acknowledgment is received, run script for phone calls, and script for SMS/email to group three. I would like to know if such a setup is possible, and any hints and tips as to where I should be looking to figure it out. Before I forget, how can I invoke an external script, which is stored on the same server as nagios? You should be able to do all that you want to do by using escalations. http://nagios.sourceforge.net/docs/3_0/escalations.html I'm not certain how to invoke external commands using escalations, but you can notify the first to your e-mail, wait however many minutes for the second notification and have it go to group 2, and then wait another x amount of minutes, then notify group 3. Escalations should be the ticket for what you want. Regards, Max -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Integration with IBM TEC ( Tivoli )
Snmp traps are the easiest way to integrate Nagios with other tools that use SNMP. With a little leg work you can use SnMPTT to also receive traps from other SNMP sources and submit them toi Nagios as passive checks. We currently use SNMP traps to integrate Nagios with Spectrum, works quite nicely. Max On 1/11/10, steve f a31mod...@hotmail.com wrote: Hello, We have IBM TEC ( Tivoli Enterprise Console) for alot of our current monitoring. Does anyone have any experience / history integrating Nagios with IBM TEC? We use postd messages to get the info from the clients back to TEC. Can Nagios read the postd messages? I guess I could somehow use a passive check to do this correct ?? Would Nagios be considered a viable replacement for TEC? I have not been able to find any Marketing type of info as far as Nagios as a replacement for other monitoring solutions. Thanks, Steve _ Hotmail: Powerful Free email with security by Microsoft. http://clk.atdmt.com/GBL/go/196390710/direct/01/ -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 30 seconds initial delay
On Tue, Dec 15, 2009 at 5:57 PM, Brandino Andreas ampra...@gmail.com wrote: When I click a page I get the error Error: Could not read host and service status information! . After 30 seconds, all pages appear again!! - I don't have duplicated nagios service running - This delay didn't show up to older versions - My linux is Debian 5.0.3 (stable) - This error appears every time I restart nagios We typically see about the same start up time, both on 3.0.3 and 3.2.0 on hosts with 1-2k hosts and ~10k services where we have retention.dat files that are weeks or months old and we are using regular expressions and service - hostgroup mappings extensively. - Max -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios list of devices/IP addresses
On Mon, Nov 2, 2009 at 5:05 PM, Gareth Fletcher gareth.fletc...@onenet.co.nz wrote: As is always the case in IT now we want to extend this and insert the IP addresses into this table (so we can build a DNS zone with our Nagios hosts). Does anyone know a way I can easily grab the IP address of a given hostname? Or else an easier way to get all the hostnames/IPs? * You can parse objects.cache * You could switch to NDO or merlin * You could install and use Nagios2JSON as that exposes host and service configuration information. - Max -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] pnp4nagios over nrpe
On Fri, Oct 2, 2009 at 10:49 AM, Leonardo Carneiro lscarne...@veltrac.com.br wrote: I'm using pnp4nagios in some local services and it's running just fine. now i'm trying to get perfdata from a service running on a remote host, but doing so via nrpe. the command it's a simple shell script that checks how many files exist in a especified folder. When i try to see the graph, the last item show me an error: RRD Database/usr/local/nagios/share/perfdata/gwserver01/MSG_para_o_banco.rrd not found. I now i could be thousand of problems. what i wanna know if is possible to gather this kind of data via nrpe. tks in advance and forgive me for my poor english. yes, absolutely NRPE can be used to collect performance data .. does your shell script return performance data as part of it's output? You should see a pipe symbol | and then metrics in the output from the script, e,g. MY-CHECK - OK - Blah blah | 'metric1'=10;90;100 'metric2'=4;80;100' - Max -- Come build with us! The BlackBerryreg; Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9#45;12, 2009. Register now#33; http://p.sf.net/sfu/devconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SNMP for External IP Address
On Thu, Sep 24, 2009 at 12:46 PM, Kevin Holleran kdaw...@gmail.com wrote: Anyway, one of my tasks is to obtain the WAN address for a series of routers at remote locations (~100 locations). I have been trying to find the appropriate SNMP query for this but have come up short. When I do an snmpwalk, the only thing I can find that has the Internet IP is this: .1.3.6.1.2.1.4.20.1.1.THE.INTERNET.IP.ADDRESS = IpAddress: THE.INTERNET.IP.ADDRESS where THE.INTERNET.IP.ADDRESS is the actual address I am looking for. This is in the IP-MIB::ipAdEntAddr table. However, I cannot query this since what I am looking for is part of the OID that I am trying to query. Are you looking for the IP addresses associated with one or more WAN ports on the router? - Max -- Come build with us! The BlackBerryreg; Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9#45;12, 2009. Register now#33; http://p.sf.net/sfu/devconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using Nagios with SNMP traps to monitor storage
On Thu, Sep 24, 2009 at 11:51 AM, Mikey Sklar sklarm-nag...@screwdecaf.cx wrote: Is anyone currently using nagios snmp traps to monitor this type of storage equipment? - EMC CX500 We are doing trap receipt for this, your SNMPTT rules will have to be pretty detailed as the EMC CX* series only sends out 3 traps, with specific error and recovery information embedded in each trap. I've tried setting up nagios + snmptrapd + snmptt as detailed here. http://xavier.dusart.free.fr/joomla/index.php/en/nagios/47-traps-snmp-dans-nagios I also converted two mibs with snmptt for EMC / Hitachi devices. The MIBs were not for the exact hardware models we have. Should I just being doing a snmpwalk since I cannot locate the proper MIBs. You will not be able to get this information from an SNMP walk as traps are send only events :) so if they are codified it is only in the vendor provided MIB(s). Also, I continue to see this message on the nagios console for the SAN devices. Is that what I should expect to see until a successful trap occurs? Status: PENDING Status Information: Service is not scheduled to be checked. Yes, that means no passive check has been received for the service. If you specify initial_stateo in your service config Nagios should set the initial state for the service to the specified state, which is o == OK in the above config snippet. I say should because currently with our Nagios 3.0.3 instance we see cases where passive services stay in pending instead of being initialized to OK per our configs. - Max -- Come build with us! The BlackBerryreg; Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9#45;12, 2009. Register now#33; http://p.sf.net/sfu/devconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Plugin to monitor a file
Selvakumar A wrote: Dear All, I would like to monitor a file in the server,whether it is opened or modified.Is there any plugin available for that? Best place to check for these things are at NagiosExchange. http://www.monitoringexchange.org/cgi-bin/search.cgi?query=check_fileGo=Go Regards, Max -- Come build with us! The BlackBerryreg; Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9#45;12, 2009. Register now#33; http://p.sf.net/sfu/devconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to make 1 exception for 1 host when mapping services to hostgroups?
On Wed, Sep 9, 2009 at 2:42 AM, Janet Sullivan jsulli...@mndigital.com wrote: What kind of hardware are you on? We have 2 machines: * 1 does polling. nagios GUI, and trap ingest (SNMPTT) * 1 does PNP graphing and storage, MySQL storage for SNMPTT, notifications, and acts as a relay to send performance data off to our in house data warehouse. Each machines has: * Intel dual quad core CPU * 16 GB RAM * SCSI disks (no RAID) * Gigabit NIC We have a new set of hosts we are setting up now that have RAID 10, hoping to see some improvements on the RRD database performance with RAID 10 in place on our reporting and notification server (load is very low on the PNP servers though we do see some I/O wait .. hoping that will decrease with RAID 10). We are planning on doing HA between a redundant pair of PNP/notifications hosts using GlusterFS (still testing that out in our dev/test env). -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null