from:"Max Schubert"

[Nagios-users] How many hosts and services are you monitoring with Nagios?

2012-05-17 Thread Max Schubert

Hi,

I like it when people periodically post numbers and architecture
summaries, I am guessing with the distributed frameworks out now for
Nagios this thread might be seeing bigger numbers than past threads
have.

With our custom-built distributed Nagios-based monitoring system, we
are currently monitoring 18000+ hosts every 5 minutes and 100k+ active
services (plenty of passive services in addition to the actives) every
5 mins as well.  We collect performance data from every check as well
and pass that on to a highly distributed and scalabe time-series data
warehouse another team in our organization has built (which is why we
have the 5 min interval requirement)

We also do trap ingest using SNMPTT with a few custom mods, but not
going to include those numbers as they never have required the
optimizations the polling has required.

This isn't a monolithic instance, we have 6 projects using instances
of our distributed Nagios-based software, called Racon (soon my
manager will give our team to package it as open source - so I hear at
least).  We built it on core Nagios with a custom database layer based
on a very very early version of Merlin's database abstraction layer
(thank you Andreas!) - we have a custom client/server network-based
notification framework in use (we will release that as well) along
with a custom NEB/perl based client-server framework (also releasable,
just need time scheduled) for sending and processing performance data
- the performance and notification framework are both horizontally
scalabe and network fault tolerant.

What kinds of numbers of hosts and services are you all monitoring?
Which add-ons / distributed frameworks are you using?

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How many hosts and services are you monitoring with Nagios?

2012-05-17 Thread Max Schubert

On Thu, May 17, 2012 at 11:57 AM, C. Bensend be...@bennyvision.com wrote:
 Yes, I know, VM bad.  :)  Just not bad enough to spend real dollars on
 more physical hosts.

Nothing bad about using a VM, they just fall over horribly (generally
speaking) when you try to push the virtual machine's virtual CPU cores
and disk hard :p - kudos to you for making that work and pretty
interesting setup!

Thanks for sharing.

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How many hosts and services are you monitoring with Nagios?

2012-05-17 Thread Max Schubert

On Thu, May 17, 2012 at 12:19 PM, Mike Guthrie mguth...@nagios.com wrote:
 Had to reply to this one since I actually studied Max's blog articles
 for ideas when I was researching performance tuning.  Thanks Max for not
 only pioneering some of these ideas, but actually documenting what you
 did.  ; )
 http://www.semintelligent.com/blog/articles/38/nagios-performance-tuning-early-lessons-learned-lessons-shared-part-4-scalable-performance-data-graphing

Wow!  Forgot about that, glad the ideas in it helped you - Nothing
pioneering (though thank you for saying that), I culled work from
others and my teammate and I then took that and built on it.

I am hoping we can release our network performance data sending
framework first, which was initially based on the PNP modpnpsender
module and has now been re-written from scratch by several of us based
on Andreas's test NEB module that showed how to use the NEB module to
write to a pipe with a process listening to the pipe in order to
minimize time spent in Nagios and therefore latency added by the
NEB-based add in.

  Nagios XI Benchmarking Test Box
 - Running on one of our old Desktops:  1 Dual Core CPU, 4GB of RAM,
 CentOS 5.7
 - Ndoutils/MySQL offloaded to a second server
 - 1800 Hosts, 8200 services, 10k checks running every 5mn
 - Processing perfdata for all checks
 - rrdcached in use with PNP
 - RAM Disk being used for status.dat, objects.cache, host-perfdata,
 service-perfdata, all spool and tmp directories

Nice!

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_load -w $ARG1$ -c $ARG2$

2012-05-08 Thread Max Schubert

Load average is a relative measure of how much work over time a system is doing.

Read these to start:

http://www.linuxjournal.com/article/9001
http://www.teamquest.com/pdfs/whitepaper/ldavg1.pdf
http://www.teamquest.com/pdfs/whitepaper/ldavg2.pdf

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Centralized Dashboard

2012-02-08 Thread Max Schubert

+1 for Thruk + multi-site - well done project, easy to set up and easy
to replicate a set up across multiple hosts.

I haven't tried it with a large Nagios installation.

Query capabilities are very cool as well.

Max

On 2/8/12, Joerg Linge pitchf...@ederdrom.de wrote:

 Am 08.02.2012 um 14:40 schrieb trm asn:

 Dear Friend,

 I have 7 Nagios core server running at 7 different location. I have
 installed MNTOS to get the consolidated tactical views. Is there any other
 way to do so.

 Yes!

 mathias-kettner.de/checkmk_multisite.html
 http://thruk.org/

 Joerg
 --
 Keep Your Developer Skills Current with LearnDevNow!
 The most comprehensive online learning library for Microsoft developers
 is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
 Metro Style Apps, more. Free future releases when you subscribe now!
 http://p.sf.net/sfu/learndevnow-d2d
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios Optimization on CentOS

2011-12-27 Thread Max Schubert

I have written a number of blog posts about optimizing and tuning
Nagios performance as well - you might find some of them useful:

http://www.semintelligent.com/blog/

- Max

--
Write once. Port to many.
Get the SDK and tools to simplify cross-platform app development. Create 
new or port existing apps to sell to consumers worldwide. Explore the 
Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join
http://p.sf.net/sfu/intel-appdev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Feeding status information to SCOM?

2011-12-20 Thread Max Schubert

There is a commercial Nagios to SCOM connector, we use it at my place
of work - avoid it and work on a custom solution (which is the path
you are on) - the commercial connector is awful
* Only allows for one Nagios system to send to SCOM using SNMP traps
* Does not translate statuses properly
* Does not allow for event chaining

- Max

--
Write once. Port to many.
Get the SDK and tools to simplify cross-platform app development. Create 
new or port existing apps to sell to consumers worldwide. Explore the 
Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join
http://p.sf.net/sfu/intel-appdev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Average Check latency and execution time growth - 3.2.3

2011-10-08 Thread Max Schubert

What minor RHEL rev are you running?  We had one poller that was
running RHEL 5.3 that had constantly increasing latency - a Compaw /
AMD based host.  None of the optimizations / configuration changes we
made to the other pollers we ran at the time seemed to help this one -
we updated the poller in-box from 5.3 to 5.4 and voila - issue gone.

As Joerge mentioned, probably was a memory leak / bug in a library the
parent Nagios poller process was using, we never did determine which
one and we haven't hit that same issue since then with any 5.4 or 5.5
pollers.

Even with stable software we end up bouncing our pollers every 2-3
days - 1) because we have an active customer base who make config
changes often and 2) because we take the metrics from the checks and
put them in a time series data warehouse that is sensitive to interval
skew...any poller that hits 10 seconds latency has to be bounced.

We are at 12 pollers or so right now and we will be up to almost 20 by
next year at this time.

Max

On 10/2/11, Stuart Browne stuart.bro...@ausregistry.com.au wrote:
 Hi,

 I know this topic has been covered many times, but I've tried those tweaks
 and I have the remaining issue.

 After a few days, the latency on checks explodes.  It goes along quite
 happily with small values, then after (about) 3 days, the values rise quite
 sharply.  I've recently been graphing performance statistics (nagiostats,
 mrtg) and as you can see by the two attachments (day, week), it's rather
 surprising.

 We restart Nagios every few days (for other reasons) so thankfully the issue
 never gets completely out of control, but as you can see, it gets a bit
 crazy.

 I can't think of any combination of settings that would cause such growth
 after such a long period of time.  Does anybody have any knowledge as to why
 it would suddenly increase after running for days without issue?

 Basic Nagios system stats:
   2 x dual-core Xeon 5160 (3Ghz)
   6GB Memory
   4 x SAS, RAID1 (hardware, BBU, LVM over RAID1)
   RHEL5, fully patched
   Load average between 0.5 and 3.2

 'nagios -s /etc/nagios/nagios.cfg' output (trimmed):

 HOST SCHEDULING INFORMATION
 ---
 Total hosts: 252
 Total scheduled hosts:   252
 Host inter-check delay method:   SMART
 Average host check interval: 300.00 sec
 Host inter-check delay:  1.19 sec
 Max host check spread:   30 min
 First scheduled check:   Mon Oct  3 14:31:17 2011
 Last scheduled check:Mon Oct  3 14:36:15 2011


 SERVICE SCHEDULING INFORMATION
 ---
 Total services: 1575
 Total scheduled services:   1386
 Service inter-check delay method:   SMART
 Average service check interval: 878.40 sec
 Inter-check delay:  0.63 sec
 Interleave factor method:   SMART
 Average services per host:  6.25
 Service interleave factor:  6
 Max service check spread:   30 min
 First scheduled check:  Mon Oct  3 14:33:43 2011
 Last scheduled check:   Mon Oct  3 14:48:21 2011

 CHECK PROCESSING INFORMATION
 
 Check result reaper interval:   5 sec
 Max concurrent service checks:  Unlimited


 PERFORMANCE SUGGESTIONS
 ---
 I have no suggestions - things look okay.

 Stuart J. Browne
 Senior Linux Administrator


--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2dcopy2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Observing latency

2011-09-05 Thread Max Schubert

RAM disk for the NPCD queue + rrdcached is a necessity for scaling a
PNP backend - PNP in asynchonous mode is a must as well, preferrably
on a server other than the primary pollers - for example, use merlin
to distribute events and then designate one server or more to purely
processing PNP's RRD-based graphs.

- Max

--
Special Offer -- Download ArcSight Logger for FREE!
Finally, a world-class log management solution at an even better 
price-free! And you'll get a free Love Thy Logs t-shirt when you
download Logger. Secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsisghtdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Help required on Nagios Setup

2011-06-14 Thread Max Schubert

On Tue, Jun 14, 2011 at 10:28 PM, sunilang...@rediffmail.com wrote:

 Hi All

 Let me introduce myself. My name is Sunil Angane and I am working with
 Softenger (I) Pvt. Ltd., an IT Infrastructure Management Services company.

 We are doing Nagios setup at one of our prestigious client in Mumbai to
 cover OS and DB monitoring. As we are doing this for the first time, we need
 guidance from you all on this.

 Under OS part we are covering AIX, LINUX, SOLARIS and WINDOWS, under DB
 part ORACLE, SQL, SYBASE and DB2. Kindly share the installation /
 configuration documents related to above listed OS and DB flavors.

 Kindly let me know if anyone from negios user is residing in Mumbai or
 India so that it will be easier for me to contact.


Do you really expect the community to execute your contract for you?
Interesting!

Start here!

http://wiki.nagios.org/index.php/Main_Page

If you have directed questions that show you have done some reading and
educated yourself you will get useful answers from the group.

- Max
--
EditLive Enterprise is the world's most technically advanced content
authoring tool. Experience the power of Track Changes, Inline Image
Editing and ensure content is compliant with Accessibility Checking.
http://p.sf.net/sfu/ephox-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Is the nagios log file keeps growing over time

2011-06-10 Thread Max Schubert

On Fri, Jun 10, 2011 at 2:59 PM, Manish Kumar manikuma...@gmail.com wrote:

 Hello Frnds,
 I have got a concern, Will the nagios log file (nagios.log) in
 /usr/local/nagios/var/nagios.log
 will keep increasing in size or there is some auto-rotation facility. If it
 is there how it can be done.

http://nagios.sourceforge.net/docs/3_0/configmain.html#log_rotation_method

- Max

--
EditLive Enterprise is the world's most technically advanced content
authoring tool. Experience the power of Track Changes, Inline Image
Editing and ensure content is compliant with Accessibility Checking.
http://p.sf.net/sfu/ephox-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] [Nagios-devel] Q: Service Escalation Recovery Notifications.

2011-05-20 Thread Max Schubert

Hi,

On Thu, May 19, 2011 at 10:10 AM, Andreas Ericsson a...@op5.se wrote:
 On 05/19/2011 03:32 PM, Paul M. Dubuc wrote:
 OK, but wouldn't it be nice if all contacts who got an error notification 
 were
 able to get the recovery message instead of just the one last notified?  Is
 there any way to do that?  Setting up an explicit serviceescalation for
 recovery notifications doesn't seem to work.


 Max Schubert is working on a patch that does something similar to that.
 If he doesn't complete it, I might take a look at adding it myself.

I will send out my partial patch to the list sometime today along with
an explanation of my thinking / approach for it - feel free to use it
or discard it as you see fit :)!

- Max

--
What Every C/C++ and Fortran developer Should Know!
Read this article and learn how Intel has extended the reach of its 
next-generation tools to help Windows* and Linux* C/C++ and Fortran 
developers boost performance applications - including clusters. 
http://p.sf.net/sfu/intel-dev2devmay
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notification configuration (Was [Nagios-devel] RFC/RFP: Service parents)

2011-05-18 Thread Max Schubert

Andreas,

On Tue, May 17, 2011 at 7:57 AM, Andreas Ericsson a...@op5.se wrote:
 Any plans to detatch notification attributes from service / host
 definitions in 4.x and make them their own top-level configuration
 class like escalations  to make it easier to scale notification
 definitions for large projects?


 Not really. What would such an object look like? How would it add
 additional benefit compared to using templates for hosts and services?
 I think if I could just see some sort of example definition of it I'd
 get an inkling of why some seem to think it's such a great idea. Right
 now, I see no additional benefit to it.

It would look just like an escalation.  What doesn't work well for
large configurations with notification policies being stuck into host
and service objects is this scenario (which is the one we are in at
work by design):
* Multiple configuration editors who own various parts of the Nagios
configuration tree - in our case this used to be one big tree, now we
have set up separate trees for separate projects - we have about 20-30
people who can edit their project-specific configurations.
* A set of services that are global in nature - service - hostgroup
- host -  baseline monitoring required by all projects using
standards established by multiple organizations in our company - for
our example, base host monitoring with an SNMP agent (6 services
across every host) - we have other global services as well and a core
team who develop, maintain an augment both our distributed Nagios
software and these global services and configurations
* A set of services that are specific to each project using our
distributed variant of Nagios - managed by subject matter experts on
each team.

With this scenario, how do we let each group that is responsible for
hosts that have these global services on them create individually
tailored notification policies since there is one notification policy
per service?
* We configure our base service and host to 'notify' on every state
change using the command name do_nothing
* We created a custom patch so that when the string 'do_nothing'  is
seen in the command name this  state change only increments the
notification count - it does not trigger any external command to run
* We created a patch (partial - no serialization to disk) for
escalation logic that tracks in memory when a fault escalation was
sent so that OK escalations are only sent in response to something
that was in a fault state.  We are working on completing this patch so
that across restarts the state is saved.
* We have all groups use escalations to define their notification
policies - the service and host notification commands then trigger our
distributed pollers to send escalation requests to a network-based
notification service we have that then lets the notification requests
trigger email, SMS, SNMP traps, etc without having to re-configure
Nagios for every notification transport /. method change.

Yeah, it is very ugly, and why?  Because 1 notification policy per
service, that doesn't scale well when taking advantage of service -
hostgroup - host mappings, which is a critical pattern to use when
scaling a configuration.

We have over 9000 hosts being monitored by our distributed framework
(and growing) with around 30 configuration editors and 120+ users.
Our distributed framework was centralized and a ''one project for all
but now is a cluster of distributed set ups, one distributed set up
per project, which is scaling nicely.  Our largest distributed
installations have 3900 and 5100 hosts in them respectively - we have
4 other distributed instances that are just getting ramped up and only
have a few dozen hosts apiece at this point.

So while this is ugly, it works!  All editors can define escalation
objects that take into account both their individual needs for global
service notifications as well as any project-specific notifications -
and by putting project-specific hosts in project-specific host groups,
for most groups, two escalation policy definitions are all that are
needed per project - one for hosts, one for services.

If all notifications were just done through an escalation like
configuration object, life for a big project would be much easier.
1) Having notifications clearly separated as their own configuration
template in the Nagios DSL makes it much less confusing for people new
to Nagios to understand 'where to configure notifications'
2) The configuration flexibility of the escalation template makes it
very easy to work with for a large configuration.

Our global and project specific scenario and all the notification
changes we made is also serving us very well as we grow.

Notifications as separate objects would let us back out a number of
patches and would reallly simplify our configuraiton and let our
pollers run hotter .

- Max

--
What Every C/C++ and Fortran developer Should Know!
Read this

Re: [Nagios-users] [Nagios-devel] RFC/RFP: Service parents

2011-05-17 Thread Max Schubert

Andreas,

How does this differ functionality from the current ability to
describe dependencies with servicedependency?  Is it just easier to
define (I agree that would be a win)

--
Achieve unprecedented app performance and reliability
What every C/C++ and Fortran developer should know.
Learn how Intel has extended the reach of its next-generation tools
to help boost performance applications - inlcuding clusters.
http://p.sf.net/sfu/intel-dev2devmay
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] [Nagios-devel] RFC/RFP: Service parents

2011-05-17 Thread Max Schubert

On Tue, May 17, 2011 at 5:48 AM, Andreas Ericsson a...@op5.se wrote:
 On 05/17/2011 11:34 AM, Max Schubert wrote:
 Andreas,

 How does this differ functionality from the current ability to
 describe dependencies with servicedependency?  Is it just easier to
 define (I agree that would be a win)


 It's just easier to describe, and it ties in nicely with different
 additions to the configuration syntax being planned. It does have
 some limitations, but in 99.5% of all dependencies I've seen, it
 matches the usecase people put dependencies to so it should be quite
 a nice win anyway.

I agree - the current syntax is very flexible but understanding how to
implement it properly for some users results in a very painful
learning curve.

Any plans to detatch notification attributes from service / host
definitions in 4.x and make them their own top-level configuration
class like escalations  to make it easier to scale notification
definitions for large projects?

- Max

--
Achieve unprecedented app performance and reliability
What every C/C++ and Fortran developer should know.
Learn how Intel has extended the reach of its next-generation tools
to help boost performance applications - inlcuding clusters.
http://p.sf.net/sfu/intel-dev2devmay
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] WARNING: Unknown contact keyword: serviceescalation

2011-02-28 Thread Max Schubert

Please respond to the list with the version of Nagios you are using
and an example contact and/or serviceescalation definition that you
are using.

- Max

--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] WARNING: Unknown contact keyword: serviceescalation

2011-02-28 Thread Max Schubert

Have you tried removing the trailing commas from the contact attribute
'members' for the two contactgroup defintions?

- Max

On Mon, Feb 28, 2011 at 4:47 PM, Chad Rhyner crhy...@box.net wrote:
 On Mon, Feb 28, 2011 at 12:27 PM, Max Schubert m...@webwizarddesign.com
 wrote:

 Please respond to the list with the version of Nagios you are using
 and an example contact and/or serviceescalation definition that you
 are using.

 - Max

 Certainly.  The version I am using is 3.0.6.  Here is a copy of the output:

 Nagios 3.0.6
 Copyright (c) 1999-2008 Ethan Galstad (http://www.nagios.org)
 Last Modified: 12-01-2008
 License: GPL

 This is being run on a 64-bit Linux machine.  Here is an example of the
 serviceescalation definition:

 define serviceescalation{
     hostgroup_name  sms
     service_description HTTP
     first_notification  1
     last_notification   2
     notification_interval   10
     contact_groups  oncall,email
 }

 Here are the contactgroups related to that serviceescalation:

 define contactgroup{
     contactgroup_name   oncall
     alias   Oncall
     members user1,user2,
 }

 define contactgroup{
     contactgroup_name   email
     alias   Email
     members
 user1_email,user2_email,user3_email,...usern_email,
 }

 For both contactgroups oncall and email, there is a contact defined for each
 user that is named.  I have changed the usernames for privacy reasons.

 ~Chad


--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Which GUI to configure Nagios 3 ?

2011-02-25 Thread Max Schubert

We do have 60+ sysadmins using a shared instance of Nagios without a
configuration UI - all Nagios configuration files are in SVN, current
we monitor around 5300 hosts.

We found with a little training the sysadmins were happy to take the
hit on learning in order to have the flexibility of the full power of
the config language.

We avoided implementing or integrating a web-based UI because all of
them take away various capabilities of the language - we did not want
to treat our audience like dummies, so instead we went the path of
providing training and mentoring to them and empowered them to use the
language themselves.

Was this a lot of work for us, especially when we started?  YES.

Was it an approach that respected our user base and gave them the
ability to do what they needed to do with minimal intervention from
us? YES.

- Max

On 2/24/11, Breandan Dezendorf brean...@dezendorf.com wrote:
 On Thu, Feb 24, 2011 at 1:54 PM, Ennis Ibarra en...@nmcourts.gov wrote:
 My two cents:

 Once you lock up your configuration to a given tool, then you stick with
 the
 tool capabilities to configure stuff. The real power is understanding
 the Nagios structure and files, leave the GUIs for presentation only.

 I agree - but when you're trying to serve a large community of
 sysadmins, a GUI tool is a useful tool for getting the job done.  I
 can't teach 60+ admins how to write clean nagios configs, but I can
 teach them to use web-based tools.

 --
 Breandan Dezendorf
 brean...@dezendorf.com
 bwdez...@gmail.com

 --
 Free Software Download: Index, Search  Analyze Logs and other IT data in
 Real-Time with Splunk. Collect, index and harness all the fast moving IT
 data
 generated by your applications, servers and devices whether physical,
 virtual
 or in the cloud. Deliver compliance at lower cost and gain new business
 insights. http://p.sf.net/sfu/splunk-dev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_hpasm and check_openmanage over nrpe in windows

2011-02-25 Thread Max Schubert

you can configure the snmp agent to send snmp v3 traps with encryption.

On 2/25/11, Assaf Flatto nag...@flatto.net wrote:
 the main issue is that SNMP is a clear text protocol and no way to
 encrypt the date , while you can encrypt the NRPE traffic with SSL .

 Assaf


 Alexandre wrote:
 hello,
 I think using nrpe is not a good idea. I use snmp Trap of dell
 openmanage for alert in Nagios.

 Alex

 On 25/02/2011 12:38, Assaf Flatto wrote:

 Hello All

 I have encountered an issue with wanting to monitor HP and Dell servers
 running windows OS .
 The main issue is that due to security issue we can not use the
 NSClient++ internal functionality , but use the NRPE module option .
 I have not used the nrpe on windows boxes extensively before and wanted
 to know if anyone has deployed the check_hpasm and check_openmanage on
 windows boxes where the nagios server can only access  the NPRE port and
 not have SNMP access  direct to the server ?


 Thanks

 Assaf



 --
 Free Software Download: Index, Search  Analyze Logs and other IT data in
 Real-Time with Splunk. Collect, index and harness all the fast moving IT
 data
 generated by your applications, servers and devices whether physical,
 virtual
 or in the cloud. Deliver compliance at lower cost and gain new business
 insights. http://p.sf.net/sfu/splunk-dev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Multi-site frontend options

2011-02-25 Thread Max Schubert

Definitely check out Merlin -  through configuration you can have Nagios
instances communicate events with each other using a few types of
relationships and you can then serialize the data from an instance to a
database for either using their custom UI or making one of your own.

http://www.op5.org/community/plugin-inventory/op5-projects/merlin

When we used it back in early 2010 it was not quite ready with regards to
event distribution but I would think it is more mature by now - definitely
worth spending some proof of concept time on to explore as an option.

We have been using the database serialization portion of Merlin for the last
1.5 years, we have made mods for it to meet our purposes (hope to share some
of that code soon on our github account - one cool thing we added was
threaded MySQL connections from Merlind to the database which increased
throughput tremendously).

- Max
--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev ___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] NAGIOS does not register Critical | Warning status.

2011-02-22 Thread Max Schubert

There is a debug level you can set in nagios.cfg that will show you
all commands being executed - an easier way would be to make a shell
wrapper for check_int that logs all args and then execs them with the
real check_int.  Something like

1.  Copy check_int to check_int.bin
2.  Make check_int a shell script like

!#/bin/bash

echo $*  /path/tlo/logfile.log
exec /path/to/check_int.bin $*


Now whenever check_int is called, the full command line and args will
be appended to the log each run for you to inspect - without cranking
up debug or restarting nagios.

Max

On 2/22/11, socomm soc...@crowbar.cc wrote:
 On Mon, 21 Feb 2011 20:23:54 -0500, Allan Clark
 all...@chickenandporn.com
 wrote:
 Can you confirm that %% is needed for a single % token?  I don't
 remember that being needed, but I don't use % in my commands.  I
 could see $ needing doubling up due to token replacement.

 Apparently check_nt will returns same code whether I pass it the '%' sign
 or not, however the double '%' was a typo which I've corrected.

 Also, see about increasing logging, see if you can log what the actual
 command is running as, and what it responds as a result code.  Maybe
 proving that the check_nt script is being called in the way you think
 it is can help bisect the problem.

 I am looking for a way to log commands, but best I've found is the debug
 option which will give me option to log commands to a debug file. However,
 the output I get is the following:

 -  OUTPUT  -

 Raw Command Input: /usr/lib/nagios/plugins/check_nrpe -H $HOSTADDRESS$ -c
 $ARG1$
 Expanded Command Output: /usr/lib/nagios/plugins/check_nrpe -H
 $HOSTADDRESS$ -c $ARG1$

 

 Which I'm not sure why variables are not being expanded to the actual
 arguments being passed. Anyone more familiar as what I may need to do to
 view the expanded variables being passed?




 --
 Free Software Download: Index, Search  Analyze Logs and other IT data in
 Real-Time with Splunk. Collect, index and harness all the fast moving IT
 data
 generated by your applications, servers and devices whether physical,
 virtual
 or in the cloud. Deliver compliance at lower cost and gain new business
 insights. http://p.sf.net/sfu/splunk-dev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios::Plugin and Nagios Embedded perl interpretor

2011-02-08 Thread Max Schubert

Marc,

On Tue, Feb 8, 2011 at 6:16 AM, Marc-André Doll m...@b-care.net wrote:
 Hi list,

 I'm developing some Nagios plugin with the Nagios::Plugin perl module
 and it's saving me a great amount of time.

 However, I encountered a little problem when I tried to activate the
 embedded perl interpretor in Nagios: all checks returned an UNKNOWN
 state with an error from the ePN saying that an error occurred in the
 Getopt::Long perl module about the use of an uninitialized value line
 459.

 I suppose the problem come from my usage of the add_arg() function but I
 don't know how to correct it.

 Does somebody have already encountered that kind of behaviour or have an
 idea on how to debug a perl script when it's launched by the ePN?

Check out your command lines, especially if you use custom attribute
substitution in them - look for attributes that are empty (no spaces)
- that gets translated into a NULL or some other non-printable
character that Getopt::Long chokes on

http://www.semintelligent.com/blog/articles/50/epn-why-do-i-get-an-unitialized-value-error-message-from-my-getopt-longpm-when-nagios-runs-my-perl-based-plugin-under-epn

- Max

--
The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE:
Pinpoint memory and threading errors before they happen.
Find and fix more than 250 security defects in the development cycle.
Locate bottlenecks in serial and parallel code that limit performance.
http://p.sf.net/sfu/intel-dev2devfeb
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Backing up Nagios on Ubuntu 10.04

2011-01-05 Thread Max Schubert

While a live back up is definitely a good idea, if you start storing
your configuration tree in a version control system off server - CVS,
GIT, SVN, any other one you choose - then the back up issue on the
live server only becomes one of backing up retention.dat, which has
changes you / your users have made to the states of notifications,
flapping, etc from the CGIs or via the named pipe.

- Max

--
Learn how Oracle Real Application Clusters (RAC) One Node allows customers
to consolidate database storage, standardize their database environment, and, 
should the need arise, upgrade to a full multi-node Oracle RAC database 
without downtime or disruption
http://p.sf.net/sfu/oracle-sfdevnl
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Alert host down with passive check

2010-12-10 Thread Max Schubert

Read up on freshness checking:

http://nagios.sourceforge.net/docs/3_0/freshness.html

max

On 12/10/10, Rikard Dahlberg ej_...@hotmail.com wrote:

 Heya guys!

 Is there any possible way to configure nagios to report host as DOWN, if
 nagios havn't got any passive checkresult within like 5 minutes?
 Can I change that somehow? For now, when a host dies in my current
 configuration, it actually doesn't report is as DOWN, since im using passive
 checks. But i want to set nagios to if i havn't got a new passive.-check
 result in 5 minutes, i want nagios to automaticly set it as DOWN...

 Any idea please? :)

 Regards
 Rikard
   

-- 
Sent from my mobile device

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] high latency

2010-12-03 Thread Max Schubert

I find it interesting that a number of users get performance
improvements with embedded perl off - we lose 20-40% polling capacity
perl poller with it off.

- Max

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] high latency

2010-12-03 Thread Max Schubert

Latency increases much more quickly for us without epn as execution
times are noticably longer per check.

We use rhel 5.x, so the perl is 5.8.8.

We have semi dailoy updates to our pollers and with epn that means
cold restarts - memory leaks have not been noticable given that
scenrio, but on test hosts or hosts where we are doing burn ins it is
negligable enough that we can go for 2-3 days with no memory issues -
we always hit service latency thresholds first.

7 seconds is in general where we have to force a restart of our
pollers to prevent metric collection and snmp delta calculation
issues.

Max

On 12/3/10, Andreas Ericsson a...@op5.se wrote:
 On 12/03/2010 12:46 PM, Max Schubert wrote:
 I find it interesting that a number of users get performance
 improvements with embedded perl off - we lose 20-40% polling capacity
 perl poller with it off.


 How do you mean that you're losing capacity? Does latency start to creep
 upwards or is load increasing?

 Out of interest; How much memory does epn leak nowadays, and which perl
 version is it compiled against?

 --
 Andreas Ericsson   andreas.erics...@op5.se
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.


--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How can i show remote nagios results in one interface?

2010-12-02 Thread Max Schubert

How about Thruk?

http://www.thruk.org/

- Max

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-29 Thread Max Schubert

Thank you, Andreas - I actually really enjoy writing tests and
understand their value - will see if I can put a few in place using
tap over the next week or so and update the patch with them.

On 11/29/10, Andreas Ericsson a...@op5.se wrote:
 On 11/29/2010 09:20 AM, Andreas Ericsson wrote:

 This looks nice. It's in my compile and test queue right now, so
 assuming it works out ok it'll be committed before the week is out.


 And now it's out there. More testing would be much appreciated though,
 but considering the patch is fairly well written I have few qualms
 about it.

 Cheers, and thanks again.

 --
 Andreas Ericsson   andreas.erics...@op5.se
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.


--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-28 Thread Max Schubert

Hi,

Long overdue and anxiously waited on by at least 3 people :), here is
the patch, diffed against the CVS HEAD (as of tonight) revision of
Nagios, with a documentation patch for configmain.html as well (as
requested by Andreas):

The option is now named

allow_empty_hostgroup_assignment

Thanks to Dan Whittenberg and Michael Friedrich for their input on the
option name and the patch itself - thanks to my managers at Comcast (a
great place to work as a developer) Eric Scholz and Mike Fischer for
allowing the team I am on to contribute code back to the community,
and thanks to the team I am on for their hard work and dedication with
our distributed Nagios set up and the many other projects we are
involved with.

Andreas, please let me know if you'd like any further modifications or
documentation for the patch beyond what is provided.

https://github.com/netoisstools/Nagios-3-Patches/blob/master/CVS-HEAD-201011292220/201011281330-allow_empty_hostgroup_assignment.patch

- Max

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] More than one command defined

2010-11-19 Thread Max Schubert

We went through a couple of rounds of changes with this:
* We first hacked our copy of process_perfdata.pl and added logging to
it, then built a relay to our long term trending data warehouse for
relaying Nagios perfdata to it after allowing PNP to process it for
short term views of the data

Nagios - PNP NEB - network - performance data server - NPCD queue
- process_perfdata.pl  - log - relay - network - data warehouse

* Now we have a POE-based solution in place that processes the
perfdata for PNP and our relay as two data driven events

Nagios - PNP NEB - network - performance data server - POE server
POE server - RRD files (replaces npcd and process_perfdata.pl)
POE server - network - data warehouse network listener

For both methods we built off of the (at the time very young)
modpnpsender.c NEB module source included with PNP. ; we then migrated
to a NEB modules who's code was based on some NEB code Andreas wrote
that showed how to write to the input of a second external program
over a pipe + an external perl script that relayed the data to the
destination servers over the network (or queued if the network was
down).

Code will be available open source this year for people who are interested.

- Max

--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today
http://p.sf.net/sfu/msIE9-sfdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-19 Thread Max Schubert

Will try to get this done this weekend - sorry all, it is an irony to
be working for a company that allows us to release code from work to
open source and be so busy at the job that I rarely have time to do so
:).

My apologies for the delay - and thanks for your input, help, and
feedback on the patch, Dan.  Glad you find it useful.

- Max

On Fri, Nov 19, 2010 at 10:40 AM, Daniel Wittenberg
daniel.wittenberg.r...@statefarm.com wrote:
 I'll check with the original author of the patch...I re-rolled it
 already against head and tested great, been a lifesaver.  I sent the
 updates to him so he could post and move forward with so if he's not had
 time I'll see about submitting directly.

--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today
http://p.sf.net/sfu/msIE9-sfdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notifications on passive service checks

2010-11-16 Thread Max Schubert

Escalations can resolve this - if you set an escalation to trigger at
notification 1 and set a re-notify interval in the escalation you will
be re-notified at the set interval while the service remains in the
configured alarm state.

- Max

On Tue, Nov 16, 2010 at 4:41 PM, Alberto Menichetti
alb.meniche...@tai.it wrote:
 Hi all,

 I noticed the same strange behavior, but I don't think it's the right
 behavior.
 Operating in this way, a linkDown trap will be notified only once (in fact
 the sender device will generate a single trap in response to state change).
 Is it possibile to modify this behavior?



 Hall, JC wrote:

 After some testing, it looks like it will only re-notify after receiving
 another passive check result.  It won't simply re-notify because it's still
 in a non-ok state after the notification_interval has expired.  So to combat
 this I just used the check freshness attribute to re-execute my external
 script and feed the passive check result into nagios and thus re-sending a
 non-ok notification at what would have been the interval for notifications.

 So technically my external scripts are running at every interval to check
 the freshnes, not only when it's called for by my event_handler from another
 active service check...  which I'm ok with.

 -Original Message-
 From: Andreas Ericsson [mailto:a...@op5.se]
 Sent: Friday, November 12, 2010 5:00 AM
 To: Nagios Users List
 Cc: Hall, JC
 Subject: Re: [Nagios-users] Notifications on passive service checks

 On 11/11/2010 11:27 PM, Hall, JC wrote:


 Is it accurate that Nagios will only send 1 notification for a
 passive service check?

 IE, the notification_interval definition for a passively checked
 service won't instruct Nagios to re-send a notification such as with
 actively checked services?



 To be honest, I haven't got the faintest idea. An educated guess is
 that it will re-send the notification if it receives another passive
 check-result and enough time has passed though, or that it simply
 re-sends the notification when enough time has passed.

 If you try and find out, let me know either way and I'll amend the
 docs.



 --
 TAI S.r.l.

 Alberto Menichetti
 Area Mercato - Ingegneria dei Sistemi
 System Engineer

 50141 Firenze - Via Pazzagli, 2
 Voice: +39 055 42661 - Fax +39 055 4266356
 56125 Pisa - Viale Gramsci, 12
 Voice: +39 050 220221 - Fax: +39 050 24421

 e-mail: alb.meniche...@tai.it
 http://www.tai.it

 ---
 COMUNICAZIONE AI SENSI LEGGE 196/03
 Il presente messaggio di posta elettronica viene inviato al Vostro indirizzo
 email, che abbiamo acquisito da Vostre Visite, da incontri commerciali,
 elenchi di pubblico dominio, Vostre precedenti comunicazioni. Il Vostro dato
 in questione e' in possesso di TAI S.r.l., che lo ha immagazzinato in
 formato elettronico. Tali informazioni non saranno divulgate a terzi. Se
 desiderate verificare, cancellare o modificare i dati in nostro possesso,
 inviate fax al numero 0554266356.


 --
 Beautiful is writing same markup. Internet Explorer 9 supports
 standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
 Spend less time writing and  rewriting code and more time creating great
 experiences on the web. Be a part of the beta today
 http://p.sf.net/sfu/msIE9-sfdev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today
http://p.sf.net/sfu/msIE9-sfdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-03 Thread Max Schubert

Will do my best to do this this week - if I don't get to it this week
I will forget about it for sure, all the patches our team has done to
make Nagios (some very very specific to our env) work for us will be
available on our team github account; will do my best to get that
updated this week as well and post that url along with the patch /
docs to this list or I will forget to do that for another few months
too.

Max

On 11/3/10, Andreas Ericsson a...@op5.se wrote:
 On 11/02/2010 04:54 PM, Max Schubert wrote:

 allow_empty_hostgroups = 1


 I'll take this, but rename the option to something more mnemonic, as
 empty hostgroups are already allowed and it's a bit ambiguous what it
 means.

 Any chance you could re-roll the patch against latest CVS head and
 include a brief documentation update? Name the var allow_empty_hostgroups
 in the docs so I can find it when I sed it in.

 If you want any special attributions (company or person thanks, beyond
 the addition of your name to the THANKS file and your name and email in
 the commit log message), let me know.


 --
 Andreas Ericsson   andreas.erics...@op5.se
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.


--
Achieve Improved Network Security with IP and DNS Reputation.
Defend against bad network traffic, including botnets, malware, 
phishing sites, and compromised hosts - saving your company time, 
money, and embarrassment.   Learn More! 
http://p.sf.net/sfu/hpdev2dev-nov
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-02 Thread Max Schubert

On Tue, Nov 2, 2010 at 10:52 AM, Fredrik Barrud fred...@barrud.se wrote:
 Hi,

 I would also like to be able to create empty hostgroups with services 
 assigned.

 The reason for this is that I've setup a nagios installation for a customer 
 where a host is only member to one hostgroup to inherit services from other 
 hostgroups, like all-windows-citrix-server and for a few special 
 servertypes I would like to prepare the chain with hostgroup and services so 
 that when the servers are ready to be monitored everything is prepared and 
 the servers only needs to be added to the apropriate hostgroup.

We have a local patch to turn the error into a warning, patched
against 3.2.0 - we needed this because we have customers who can use
host - hostgroup - N related services mappings but we do not control
their configs, we only define the global service sets - so we could
not have this be fatal and fail everyone's changes just because one
customer stopped using the mapping.

We have been using this patch in production for a year without issues.

To apply:
1) Put patch in top level source directory
2) patch -p0  name-of-patch

- Max


20101026171548-allow_empty_hostgroups.patch
Description: Binary data
--
Nokia and ATT present the 2010 Calling All Innovators-North America contest
Create new apps  games for the Nokia N8 for consumers in  U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store 
http://p.sf.net/sfu/nokia-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-02 Thread Max Schubert

To activate the downgrade from error to warning, put

allow_empty_hostgroups = 1

in your nagios.cfg after compiling Nagios with the patch in place.

- Max

--
Nokia and ATT present the 2010 Calling All Innovators-North America contest
Create new apps  games for the Nokia N8 for consumers in  U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store 
http://p.sf.net/sfu/nokia-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] How many hosts and services are you monitoring with Nagios?

Re: [Nagios-users] How many hosts and services are you monitoring with Nagios?

Re: [Nagios-users] How many hosts and services are you monitoring with Nagios?

Re: [Nagios-users] check_load -w $ARG1$ -c $ARG2$

Re: [Nagios-users] Centralized Dashboard

Re: [Nagios-users] Nagios Optimization on CentOS

Re: [Nagios-users] Feeding status information to SCOM?

Re: [Nagios-users] Average Check latency and execution time growth - 3.2.3

Re: [Nagios-users] Observing latency

Re: [Nagios-users] Help required on Nagios Setup

Re: [Nagios-users] Is the nagios log file keeps growing over time

Re: [Nagios-users] [Nagios-devel] Q: Service Escalation Recovery Notifications.

Re: [Nagios-users] Notification configuration (Was [Nagios-devel] RFC/RFP: Service parents)

Re: [Nagios-users] [Nagios-devel] RFC/RFP: Service parents

Re: [Nagios-users] [Nagios-devel] RFC/RFP: Service parents

Re: [Nagios-users] WARNING: Unknown contact keyword: serviceescalation

Re: [Nagios-users] WARNING: Unknown contact keyword: serviceescalation

Re: [Nagios-users] Which GUI to configure Nagios 3 ?

Re: [Nagios-users] check_hpasm and check_openmanage over nrpe in windows

Re: [Nagios-users] Multi-site frontend options

Re: [Nagios-users] NAGIOS does not register Critical | Warning status.

Re: [Nagios-users] Nagios::Plugin and Nagios Embedded perl interpretor

Re: [Nagios-users] Backing up Nagios on Ubuntu 10.04

Re: [Nagios-users] Alert host down with passive check

Re: [Nagios-users] high latency

Re: [Nagios-users] high latency

Re: [Nagios-users] How can i show remote nagios results in one interface?

Re: [Nagios-users] hostgroup with no members - Enhancement

Re: [Nagios-users] hostgroup with no members - Enhancement

Re: [Nagios-users] More than one command defined

Re: [Nagios-users] hostgroup with no members - Enhancement

Re: [Nagios-users] Notifications on passive service checks

Re: [Nagios-users] hostgroup with no members - Enhancement

Re: [Nagios-users] hostgroup with no members - Enhancement

Re: [Nagios-users] hostgroup with no members - Enhancement

35 matches

Site Navigation

Mail list logo

Footer information