from:"Max"

[Nagios-users] How many hosts and services are you monitoring with Nagios?

2012-05-17 Thread Max Schubert

Hi,

I like it when people periodically post numbers and architecture
summaries, I am guessing with the distributed frameworks out now for
Nagios this thread might be seeing bigger numbers than past threads
have.

With our custom-built distributed Nagios-based monitoring system, we
are currently monitoring 18000+ hosts every 5 minutes and 100k+ active
services (plenty of passive services in addition to the actives) every
5 mins as well.  We collect performance data from every check as well
and pass that on to a highly distributed and scalabe time-series data
warehouse another team in our organization has built (which is why we
have the 5 min interval requirement)

We also do trap ingest using SNMPTT with a few custom mods, but not
going to include those numbers as they never have required the
optimizations the polling has required.

This isn't a monolithic instance, we have 6 projects using instances
of our distributed Nagios-based software, called Racon (soon my
manager will give our team to package it as open source - so I hear at
least).  We built it on core Nagios with a custom database layer based
on a very very early version of Merlin's database abstraction layer
(thank you Andreas!) - we have a custom client/server network-based
notification framework in use (we will release that as well) along
with a custom NEB/perl based client-server framework (also releasable,
just need time scheduled) for sending and processing performance data
- the performance and notification framework are both horizontally
scalabe and network fault tolerant.

What kinds of numbers of hosts and services are you all monitoring?
Which add-ons / distributed frameworks are you using?

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How many hosts and services are you monitoring with Nagios?

2012-05-17 Thread Max Schubert

On Thu, May 17, 2012 at 11:57 AM, C. Bensend be...@bennyvision.com wrote:
 Yes, I know, VM bad.  :)  Just not bad enough to spend real dollars on
 more physical hosts.

Nothing bad about using a VM, they just fall over horribly (generally
speaking) when you try to push the virtual machine's virtual CPU cores
and disk hard :p - kudos to you for making that work and pretty
interesting setup!

Thanks for sharing.

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How many hosts and services are you monitoring with Nagios?

2012-05-17 Thread Max Schubert

On Thu, May 17, 2012 at 12:19 PM, Mike Guthrie mguth...@nagios.com wrote:
 Had to reply to this one since I actually studied Max's blog articles
 for ideas when I was researching performance tuning.  Thanks Max for not
 only pioneering some of these ideas, but actually documenting what you
 did.  ; )
 http://www.semintelligent.com/blog/articles/38/nagios-performance-tuning-early-lessons-learned-lessons-shared-part-4-scalable-performance-data-graphing

Wow!  Forgot about that, glad the ideas in it helped you - Nothing
pioneering (though thank you for saying that), I culled work from
others and my teammate and I then took that and built on it.

I am hoping we can release our network performance data sending
framework first, which was initially based on the PNP modpnpsender
module and has now been re-written from scratch by several of us based
on Andreas's test NEB module that showed how to use the NEB module to
write to a pipe with a process listening to the pipe in order to
minimize time spent in Nagios and therefore latency added by the
NEB-based add in.

  Nagios XI Benchmarking Test Box
 - Running on one of our old Desktops:  1 Dual Core CPU, 4GB of RAM,
 CentOS 5.7
 - Ndoutils/MySQL offloaded to a second server
 - 1800 Hosts, 8200 services, 10k checks running every 5mn
 - Processing perfdata for all checks
 - rrdcached in use with PNP
 - RAM Disk being used for status.dat, objects.cache, host-perfdata,
 service-perfdata, all spool and tmp directories

Nice!

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_load -w $ARG1$ -c $ARG2$

2012-05-08 Thread Max Schubert

Load average is a relative measure of how much work over time a system is doing.

Read these to start:

http://www.linuxjournal.com/article/9001
http://www.teamquest.com/pdfs/whitepaper/ldavg1.pdf
http://www.teamquest.com/pdfs/whitepaper/ldavg2.pdf

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Centralized Dashboard

2012-02-08 Thread Max Schubert

+1 for Thruk + multi-site - well done project, easy to set up and easy
to replicate a set up across multiple hosts.

I haven't tried it with a large Nagios installation.

Query capabilities are very cool as well.

Max

On 2/8/12, Joerg Linge pitchf...@ederdrom.de wrote:

 Am 08.02.2012 um 14:40 schrieb trm asn:

 Dear Friend,

 I have 7 Nagios core server running at 7 different location. I have
 installed MNTOS to get the consolidated tactical views. Is there any other
 way to do so.

 Yes!

 mathias-kettner.de/checkmk_multisite.html
 http://thruk.org/

 Joerg
 --
 Keep Your Developer Skills Current with LearnDevNow!
 The most comprehensive online learning library for Microsoft developers
 is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
 Metro Style Apps, more. Free future releases when you subscribe now!
 http://p.sf.net/sfu/learndevnow-d2d
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios Optimization on CentOS

2011-12-27 Thread Max Schubert

I have written a number of blog posts about optimizing and tuning
Nagios performance as well - you might find some of them useful:

http://www.semintelligent.com/blog/

- Max

--
Write once. Port to many.
Get the SDK and tools to simplify cross-platform app development. Create 
new or port existing apps to sell to consumers worldwide. Explore the 
Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join
http://p.sf.net/sfu/intel-appdev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Feeding status information to SCOM?

2011-12-20 Thread Max Schubert

There is a commercial Nagios to SCOM connector, we use it at my place
of work - avoid it and work on a custom solution (which is the path
you are on) - the commercial connector is awful
* Only allows for one Nagios system to send to SCOM using SNMP traps
* Does not translate statuses properly
* Does not allow for event chaining

- Max

--
Write once. Port to many.
Get the SDK and tools to simplify cross-platform app development. Create 
new or port existing apps to sell to consumers worldwide. Explore the 
Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join
http://p.sf.net/sfu/intel-appdev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Average Check latency and execution time growth - 3.2.3

2011-10-08 Thread Max Schubert

What minor RHEL rev are you running?  We had one poller that was
running RHEL 5.3 that had constantly increasing latency - a Compaw /
AMD based host.  None of the optimizations / configuration changes we
made to the other pollers we ran at the time seemed to help this one -
we updated the poller in-box from 5.3 to 5.4 and voila - issue gone.

As Joerge mentioned, probably was a memory leak / bug in a library the
parent Nagios poller process was using, we never did determine which
one and we haven't hit that same issue since then with any 5.4 or 5.5
pollers.

Even with stable software we end up bouncing our pollers every 2-3
days - 1) because we have an active customer base who make config
changes often and 2) because we take the metrics from the checks and
put them in a time series data warehouse that is sensitive to interval
skew...any poller that hits 10 seconds latency has to be bounced.

We are at 12 pollers or so right now and we will be up to almost 20 by
next year at this time.

Max

On 10/2/11, Stuart Browne stuart.bro...@ausregistry.com.au wrote:
 Hi,

 I know this topic has been covered many times, but I've tried those tweaks
 and I have the remaining issue.

 After a few days, the latency on checks explodes.  It goes along quite
 happily with small values, then after (about) 3 days, the values rise quite
 sharply.  I've recently been graphing performance statistics (nagiostats,
 mrtg) and as you can see by the two attachments (day, week), it's rather
 surprising.

 We restart Nagios every few days (for other reasons) so thankfully the issue
 never gets completely out of control, but as you can see, it gets a bit
 crazy.

 I can't think of any combination of settings that would cause such growth
 after such a long period of time.  Does anybody have any knowledge as to why
 it would suddenly increase after running for days without issue?

 Basic Nagios system stats:
   2 x dual-core Xeon 5160 (3Ghz)
   6GB Memory
   4 x SAS, RAID1 (hardware, BBU, LVM over RAID1)
   RHEL5, fully patched
   Load average between 0.5 and 3.2

 'nagios -s /etc/nagios/nagios.cfg' output (trimmed):

 HOST SCHEDULING INFORMATION
 ---
 Total hosts: 252
 Total scheduled hosts:   252
 Host inter-check delay method:   SMART
 Average host check interval: 300.00 sec
 Host inter-check delay:  1.19 sec
 Max host check spread:   30 min
 First scheduled check:   Mon Oct  3 14:31:17 2011
 Last scheduled check:Mon Oct  3 14:36:15 2011


 SERVICE SCHEDULING INFORMATION
 ---
 Total services: 1575
 Total scheduled services:   1386
 Service inter-check delay method:   SMART
 Average service check interval: 878.40 sec
 Inter-check delay:  0.63 sec
 Interleave factor method:   SMART
 Average services per host:  6.25
 Service interleave factor:  6
 Max service check spread:   30 min
 First scheduled check:  Mon Oct  3 14:33:43 2011
 Last scheduled check:   Mon Oct  3 14:48:21 2011

 CHECK PROCESSING INFORMATION
 
 Check result reaper interval:   5 sec
 Max concurrent service checks:  Unlimited


 PERFORMANCE SUGGESTIONS
 ---
 I have no suggestions - things look okay.

 Stuart J. Browne
 Senior Linux Administrator


--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2dcopy2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Observing latency

2011-09-05 Thread Max Schubert

RAM disk for the NPCD queue + rrdcached is a necessity for scaling a
PNP backend - PNP in asynchonous mode is a must as well, preferrably
on a server other than the primary pollers - for example, use merlin
to distribute events and then designate one server or more to purely
processing PNP's RRD-based graphs.

- Max

--
Special Offer -- Download ArcSight Logger for FREE!
Finally, a world-class log management solution at an even better 
price-free! And you'll get a free Love Thy Logs t-shirt when you
download Logger. Secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsisghtdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Look soku89

2011-08-09 Thread Max Alex

04y10q5o5qqq.
 http://ianscott.biz/wp-content/uploads/img/oafv.html
79o9xjv 4q6rlp n1nprps0, 6a8svdq kh7uwj. fpiwe quqcnwb2a 6t5e5e.

-- 
Thanks,
Max Alex
Voip Developer

--
uberSVN's rich system and user administration capabilities and model 
configuration take the hassle out of deploying and managing Subversion and 
the tools developers use with it. Learn more about uberSVN and get a free 
download at:  http://p.sf.net/sfu/wandisco-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Help required on Nagios Setup

2011-06-14 Thread Max Schubert

On Tue, Jun 14, 2011 at 10:28 PM, sunilang...@rediffmail.com wrote:

 Hi All

 Let me introduce myself. My name is Sunil Angane and I am working with
 Softenger (I) Pvt. Ltd., an IT Infrastructure Management Services company.

 We are doing Nagios setup at one of our prestigious client in Mumbai to
 cover OS and DB monitoring. As we are doing this for the first time, we need
 guidance from you all on this.

 Under OS part we are covering AIX, LINUX, SOLARIS and WINDOWS, under DB
 part ORACLE, SQL, SYBASE and DB2. Kindly share the installation /
 configuration documents related to above listed OS and DB flavors.

 Kindly let me know if anyone from negios user is residing in Mumbai or
 India so that it will be easier for me to contact.


Do you really expect the community to execute your contract for you?
Interesting!

Start here!

http://wiki.nagios.org/index.php/Main_Page

If you have directed questions that show you have done some reading and
educated yourself you will get useful answers from the group.

- Max
--
EditLive Enterprise is the world's most technically advanced content
authoring tool. Experience the power of Track Changes, Inline Image
Editing and ensure content is compliant with Accessibility Checking.
http://p.sf.net/sfu/ephox-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Is the nagios log file keeps growing over time

2011-06-10 Thread Max Schubert

On Fri, Jun 10, 2011 at 2:59 PM, Manish Kumar manikuma...@gmail.com wrote:

 Hello Frnds,
 I have got a concern, Will the nagios log file (nagios.log) in
 /usr/local/nagios/var/nagios.log
 will keep increasing in size or there is some auto-rotation facility. If it
 is there how it can be done.

http://nagios.sourceforge.net/docs/3_0/configmain.html#log_rotation_method

- Max

--
EditLive Enterprise is the world's most technically advanced content
authoring tool. Experience the power of Track Changes, Inline Image
Editing and ensure content is compliant with Accessibility Checking.
http://p.sf.net/sfu/ephox-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] [Nagios-devel] Q: Service Escalation Recovery Notifications.

2011-05-20 Thread Max Schubert

Hi,

On Thu, May 19, 2011 at 10:10 AM, Andreas Ericsson a...@op5.se wrote:
 On 05/19/2011 03:32 PM, Paul M. Dubuc wrote:
 OK, but wouldn't it be nice if all contacts who got an error notification 
 were
 able to get the recovery message instead of just the one last notified?  Is
 there any way to do that?  Setting up an explicit serviceescalation for
 recovery notifications doesn't seem to work.


 Max Schubert is working on a patch that does something similar to that.
 If he doesn't complete it, I might take a look at adding it myself.

I will send out my partial patch to the list sometime today along with
an explanation of my thinking / approach for it - feel free to use it
or discard it as you see fit :)!

- Max

--
What Every C/C++ and Fortran developer Should Know!
Read this article and learn how Intel has extended the reach of its 
next-generation tools to help Windows* and Linux* C/C++ and Fortran 
developers boost performance applications - including clusters. 
http://p.sf.net/sfu/intel-dev2devmay
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notification configuration (Was [Nagios-devel] RFC/RFP: Service parents)

2011-05-18 Thread Max Schubert

Andreas,

On Tue, May 17, 2011 at 7:57 AM, Andreas Ericsson a...@op5.se wrote:
 Any plans to detatch notification attributes from service / host
 definitions in 4.x and make them their own top-level configuration
 class like escalations  to make it easier to scale notification
 definitions for large projects?


 Not really. What would such an object look like? How would it add
 additional benefit compared to using templates for hosts and services?
 I think if I could just see some sort of example definition of it I'd
 get an inkling of why some seem to think it's such a great idea. Right
 now, I see no additional benefit to it.

It would look just like an escalation.  What doesn't work well for
large configurations with notification policies being stuck into host
and service objects is this scenario (which is the one we are in at
work by design):
* Multiple configuration editors who own various parts of the Nagios
configuration tree - in our case this used to be one big tree, now we
have set up separate trees for separate projects - we have about 20-30
people who can edit their project-specific configurations.
* A set of services that are global in nature - service - hostgroup
- host -  baseline monitoring required by all projects using
standards established by multiple organizations in our company - for
our example, base host monitoring with an SNMP agent (6 services
across every host) - we have other global services as well and a core
team who develop, maintain an augment both our distributed Nagios
software and these global services and configurations
* A set of services that are specific to each project using our
distributed variant of Nagios - managed by subject matter experts on
each team.

With this scenario, how do we let each group that is responsible for
hosts that have these global services on them create individually
tailored notification policies since there is one notification policy
per service?
* We configure our base service and host to 'notify' on every state
change using the command name do_nothing
* We created a custom patch so that when the string 'do_nothing'  is
seen in the command name this  state change only increments the
notification count - it does not trigger any external command to run
* We created a patch (partial - no serialization to disk) for
escalation logic that tracks in memory when a fault escalation was
sent so that OK escalations are only sent in response to something
that was in a fault state.  We are working on completing this patch so
that across restarts the state is saved.
* We have all groups use escalations to define their notification
policies - the service and host notification commands then trigger our
distributed pollers to send escalation requests to a network-based
notification service we have that then lets the notification requests
trigger email, SMS, SNMP traps, etc without having to re-configure
Nagios for every notification transport /. method change.

Yeah, it is very ugly, and why?  Because 1 notification policy per
service, that doesn't scale well when taking advantage of service -
hostgroup - host mappings, which is a critical pattern to use when
scaling a configuration.

We have over 9000 hosts being monitored by our distributed framework
(and growing) with around 30 configuration editors and 120+ users.
Our distributed framework was centralized and a ''one project for all
but now is a cluster of distributed set ups, one distributed set up
per project, which is scaling nicely.  Our largest distributed
installations have 3900 and 5100 hosts in them respectively - we have
4 other distributed instances that are just getting ramped up and only
have a few dozen hosts apiece at this point.

So while this is ugly, it works!  All editors can define escalation
objects that take into account both their individual needs for global
service notifications as well as any project-specific notifications -
and by putting project-specific hosts in project-specific host groups,
for most groups, two escalation policy definitions are all that are
needed per project - one for hosts, one for services.

If all notifications were just done through an escalation like
configuration object, life for a big project would be much easier.
1) Having notifications clearly separated as their own configuration
template in the Nagios DSL makes it much less confusing for people new
to Nagios to understand 'where to configure notifications'
2) The configuration flexibility of the escalation template makes it
very easy to work with for a large configuration.

Our global and project specific scenario and all the notification
changes we made is also serving us very well as we grow.

Notifications as separate objects would let us back out a number of
patches and would reallly simplify our configuraiton and let our
pollers run hotter .

- Max

--
What Every C/C++ and Fortran developer Should Know!
Read

Re: [Nagios-users] [Nagios-devel] RFC/RFP: Service parents

2011-05-17 Thread Max Schubert

Andreas,

How does this differ functionality from the current ability to
describe dependencies with servicedependency?  Is it just easier to
define (I agree that would be a win)

--
Achieve unprecedented app performance and reliability
What every C/C++ and Fortran developer should know.
Learn how Intel has extended the reach of its next-generation tools
to help boost performance applications - inlcuding clusters.
http://p.sf.net/sfu/intel-dev2devmay
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] [Nagios-devel] RFC/RFP: Service parents

2011-05-17 Thread Max Schubert

On Tue, May 17, 2011 at 5:48 AM, Andreas Ericsson a...@op5.se wrote:
 On 05/17/2011 11:34 AM, Max Schubert wrote:
 Andreas,

 How does this differ functionality from the current ability to
 describe dependencies with servicedependency?  Is it just easier to
 define (I agree that would be a win)


 It's just easier to describe, and it ties in nicely with different
 additions to the configuration syntax being planned. It does have
 some limitations, but in 99.5% of all dependencies I've seen, it
 matches the usecase people put dependencies to so it should be quite
 a nice win anyway.

I agree - the current syntax is very flexible but understanding how to
implement it properly for some users results in a very painful
learning curve.

Any plans to detatch notification attributes from service / host
definitions in 4.x and make them their own top-level configuration
class like escalations  to make it easier to scale notification
definitions for large projects?

- Max

--
Achieve unprecedented app performance and reliability
What every C/C++ and Fortran developer should know.
Learn how Intel has extended the reach of its next-generation tools
to help boost performance applications - inlcuding clusters.
http://p.sf.net/sfu/intel-dev2devmay
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Opsview 3.9 -- Configuring the Check OpenManage Plugin for dell servers

2011-03-30 Thread Max Hetrick

On 03/30/2011 02:00 PM, m...@mnnwahl.com wrote:
 Has anyone had success is getting this to work? I have never gotten this
 work and would love to monitor the Dell openmanage parts in opsview.

I'd suggest contacting the Opsview community forum, as this really isn't 
a Nagios issue.

Regards,
Max

--
Create and publish websites with WebMatrix
Use the most popular FREE web apps or write code yourself; 
WebMatrix provides all the features you need to develop and 
publish your website. http://p.sf.net/sfu/ms-webmatrix-sf
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] WARNING: Unknown contact keyword: serviceescalation

2011-02-28 Thread Max Schubert

Please respond to the list with the version of Nagios you are using
and an example contact and/or serviceescalation definition that you
are using.

- Max

--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] WARNING: Unknown contact keyword: serviceescalation

2011-02-28 Thread Max Schubert

Have you tried removing the trailing commas from the contact attribute
'members' for the two contactgroup defintions?

- Max

On Mon, Feb 28, 2011 at 4:47 PM, Chad Rhyner crhy...@box.net wrote:
 On Mon, Feb 28, 2011 at 12:27 PM, Max Schubert m...@webwizarddesign.com
 wrote:

 Please respond to the list with the version of Nagios you are using
 and an example contact and/or serviceescalation definition that you
 are using.

 - Max

 Certainly.  The version I am using is 3.0.6.  Here is a copy of the output:

 Nagios 3.0.6
 Copyright (c) 1999-2008 Ethan Galstad (http://www.nagios.org)
 Last Modified: 12-01-2008
 License: GPL

 This is being run on a 64-bit Linux machine.  Here is an example of the
 serviceescalation definition:

 define serviceescalation{
     hostgroup_name  sms
     service_description HTTP
     first_notification  1
     last_notification   2
     notification_interval   10
     contact_groups  oncall,email
 }

 Here are the contactgroups related to that serviceescalation:

 define contactgroup{
     contactgroup_name   oncall
     alias   Oncall
     members user1,user2,
 }

 define contactgroup{
     contactgroup_name   email
     alias   Email
     members
 user1_email,user2_email,user3_email,...usern_email,
 }

 For both contactgroups oncall and email, there is a contact defined for each
 user that is named.  I have changed the usernames for privacy reasons.

 ~Chad


--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Which GUI to configure Nagios 3 ?

2011-02-25 Thread Max Schubert

We do have 60+ sysadmins using a shared instance of Nagios without a
configuration UI - all Nagios configuration files are in SVN, current
we monitor around 5300 hosts.

We found with a little training the sysadmins were happy to take the
hit on learning in order to have the flexibility of the full power of
the config language.

We avoided implementing or integrating a web-based UI because all of
them take away various capabilities of the language - we did not want
to treat our audience like dummies, so instead we went the path of
providing training and mentoring to them and empowered them to use the
language themselves.

Was this a lot of work for us, especially when we started?  YES.

Was it an approach that respected our user base and gave them the
ability to do what they needed to do with minimal intervention from
us? YES.

- Max

On 2/24/11, Breandan Dezendorf brean...@dezendorf.com wrote:
 On Thu, Feb 24, 2011 at 1:54 PM, Ennis Ibarra en...@nmcourts.gov wrote:
 My two cents:

 Once you lock up your configuration to a given tool, then you stick with
 the
 tool capabilities to configure stuff. The real power is understanding
 the Nagios structure and files, leave the GUIs for presentation only.

 I agree - but when you're trying to serve a large community of
 sysadmins, a GUI tool is a useful tool for getting the job done.  I
 can't teach 60+ admins how to write clean nagios configs, but I can
 teach them to use web-based tools.

 --
 Breandan Dezendorf
 brean...@dezendorf.com
 bwdez...@gmail.com

 --
 Free Software Download: Index, Search  Analyze Logs and other IT data in
 Real-Time with Splunk. Collect, index and harness all the fast moving IT
 data
 generated by your applications, servers and devices whether physical,
 virtual
 or in the cloud. Deliver compliance at lower cost and gain new business
 insights. http://p.sf.net/sfu/splunk-dev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_hpasm and check_openmanage over nrpe in windows

2011-02-25 Thread Max Schubert

you can configure the snmp agent to send snmp v3 traps with encryption.

On 2/25/11, Assaf Flatto nag...@flatto.net wrote:
 the main issue is that SNMP is a clear text protocol and no way to
 encrypt the date , while you can encrypt the NRPE traffic with SSL .

 Assaf


 Alexandre wrote:
 hello,
 I think using nrpe is not a good idea. I use snmp Trap of dell
 openmanage for alert in Nagios.

 Alex

 On 25/02/2011 12:38, Assaf Flatto wrote:

 Hello All

 I have encountered an issue with wanting to monitor HP and Dell servers
 running windows OS .
 The main issue is that due to security issue we can not use the
 NSClient++ internal functionality , but use the NRPE module option .
 I have not used the nrpe on windows boxes extensively before and wanted
 to know if anyone has deployed the check_hpasm and check_openmanage on
 windows boxes where the nagios server can only access  the NPRE port and
 not have SNMP access  direct to the server ?


 Thanks

 Assaf



 --
 Free Software Download: Index, Search  Analyze Logs and other IT data in
 Real-Time with Splunk. Collect, index and harness all the fast moving IT
 data
 generated by your applications, servers and devices whether physical,
 virtual
 or in the cloud. Deliver compliance at lower cost and gain new business
 insights. http://p.sf.net/sfu/splunk-dev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Multi-site frontend options

2011-02-25 Thread Max Schubert

Definitely check out Merlin -  through configuration you can have Nagios
instances communicate events with each other using a few types of
relationships and you can then serialize the data from an instance to a
database for either using their custom UI or making one of your own.

http://www.op5.org/community/plugin-inventory/op5-projects/merlin

When we used it back in early 2010 it was not quite ready with regards to
event distribution but I would think it is more mature by now - definitely
worth spending some proof of concept time on to explore as an option.

We have been using the database serialization portion of Merlin for the last
1.5 years, we have made mods for it to meet our purposes (hope to share some
of that code soon on our github account - one cool thing we added was
threaded MySQL connections from Merlind to the database which increased
throughput tremendously).

- Max
--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev ___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] NAGIOS does not register Critical | Warning status.

2011-02-22 Thread Max Schubert

There is a debug level you can set in nagios.cfg that will show you
all commands being executed - an easier way would be to make a shell
wrapper for check_int that logs all args and then execs them with the
real check_int.  Something like

1.  Copy check_int to check_int.bin
2.  Make check_int a shell script like

!#/bin/bash

echo $*  /path/tlo/logfile.log
exec /path/to/check_int.bin $*


Now whenever check_int is called, the full command line and args will
be appended to the log each run for you to inspect - without cranking
up debug or restarting nagios.

Max

On 2/22/11, socomm soc...@crowbar.cc wrote:
 On Mon, 21 Feb 2011 20:23:54 -0500, Allan Clark
 all...@chickenandporn.com
 wrote:
 Can you confirm that %% is needed for a single % token?  I don't
 remember that being needed, but I don't use % in my commands.  I
 could see $ needing doubling up due to token replacement.

 Apparently check_nt will returns same code whether I pass it the '%' sign
 or not, however the double '%' was a typo which I've corrected.

 Also, see about increasing logging, see if you can log what the actual
 command is running as, and what it responds as a result code.  Maybe
 proving that the check_nt script is being called in the way you think
 it is can help bisect the problem.

 I am looking for a way to log commands, but best I've found is the debug
 option which will give me option to log commands to a debug file. However,
 the output I get is the following:

 -  OUTPUT  -

 Raw Command Input: /usr/lib/nagios/plugins/check_nrpe -H $HOSTADDRESS$ -c
 $ARG1$
 Expanded Command Output: /usr/lib/nagios/plugins/check_nrpe -H
 $HOSTADDRESS$ -c $ARG1$

 

 Which I'm not sure why variables are not being expanded to the actual
 arguments being passed. Anyone more familiar as what I may need to do to
 view the expanded variables being passed?




 --
 Free Software Download: Index, Search  Analyze Logs and other IT data in
 Real-Time with Splunk. Collect, index and harness all the fast moving IT
 data
 generated by your applications, servers and devices whether physical,
 virtual
 or in the cloud. Deliver compliance at lower cost and gain new business
 insights. http://p.sf.net/sfu/splunk-dev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Free Software Download: Index, Search  Analyze Logs and other IT data in 
Real-Time with Splunk. Collect, index and harness all the fast moving IT data 
generated by your applications, servers and devices whether physical, virtual
or in the cloud. Deliver compliance at lower cost and gain new business 
insights. http://p.sf.net/sfu/splunk-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios::Plugin and Nagios Embedded perl interpretor

2011-02-08 Thread Max Schubert

Marc,

On Tue, Feb 8, 2011 at 6:16 AM, Marc-André Doll m...@b-care.net wrote:
 Hi list,

 I'm developing some Nagios plugin with the Nagios::Plugin perl module
 and it's saving me a great amount of time.

 However, I encountered a little problem when I tried to activate the
 embedded perl interpretor in Nagios: all checks returned an UNKNOWN
 state with an error from the ePN saying that an error occurred in the
 Getopt::Long perl module about the use of an uninitialized value line
 459.

 I suppose the problem come from my usage of the add_arg() function but I
 don't know how to correct it.

 Does somebody have already encountered that kind of behaviour or have an
 idea on how to debug a perl script when it's launched by the ePN?

Check out your command lines, especially if you use custom attribute
substitution in them - look for attributes that are empty (no spaces)
- that gets translated into a NULL or some other non-printable
character that Getopt::Long chokes on

http://www.semintelligent.com/blog/articles/50/epn-why-do-i-get-an-unitialized-value-error-message-from-my-getopt-longpm-when-nagios-runs-my-perl-based-plugin-under-epn

- Max

--
The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE:
Pinpoint memory and threading errors before they happen.
Find and fix more than 250 security defects in the development cycle.
Locate bottlenecks in serial and parallel code that limit performance.
http://p.sf.net/sfu/intel-dev2devfeb
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Backing up Nagios on Ubuntu 10.04

2011-01-06 Thread Max Hetrick

On 01/05/2011 05:39 PM, Max Schubert wrote:
 While a live back up is definitely a good idea, if you start storing
 your configuration tree in a version control system off server - CVS,
 GIT, SVN, any other one you choose - then the back up issue on the
 live server only becomes one of backing up retention.dat, which has
 changes you / your users have made to the states of notifications,
 flapping, etc from the CGIs or via the named pipe.

That's what we do at my employer, that is use SVN for the Nagios 
configuration files. I still backup all data off-site for backup 
purposes, though.

Where SVN comes into play for us is when every change has been made for 
Nagios, there is a comment on the commit. This allows us to easily 
navigate back knowing what changes were made, by whom, and what the date 
was when the change occurred to the configs.

I'd still opt for backup of all the system files off-site, however, and 
depending on your company, this may be policy for disaster recovery 
anyways. But SVN can also serve as backup too. You can never have too 
many, right. :)

Regards,
Max

--
Learn how Oracle Real Application Clusters (RAC) One Node allows customers
to consolidate database storage, standardize their database environment, and, 
should the need arise, upgrade to a full multi-node Oracle RAC database 
without downtime or disruption
http://p.sf.net/sfu/oracle-sfdevnl
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How to Monitor VMware ESX Server and Storage Units

2011-01-05 Thread Max Hetrick

On 01/05/2011 10:28 AM, Marc Haber wrote:
 Are there other possibilities of connecting Nagios to ESX and MSA
 devices? I'd appreciate your input.

Have a look at the check_esx.pl plugin. You can do quite a bit with it.

http://www.op5.org/community/plugin-inventory/op5-projects/check-esx-plugin

It just requires the VMware Perl SDK kit to be installed on the machine 
you want to connect from, in this instance, Nagios.

Regards,
Max

--
Learn how Oracle Real Application Clusters (RAC) One Node allows customers
to consolidate database storage, standardize their database environment, and, 
should the need arise, upgrade to a full multi-node Oracle RAC database 
without downtime or disruption
http://p.sf.net/sfu/oracle-sfdevnl
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Backing up Nagios on Ubuntu 10.04

2011-01-05 Thread Max Schubert

While a live back up is definitely a good idea, if you start storing
your configuration tree in a version control system off server - CVS,
GIT, SVN, any other one you choose - then the back up issue on the
live server only becomes one of backing up retention.dat, which has
changes you / your users have made to the states of notifications,
flapping, etc from the CGIs or via the named pipe.

- Max

--
Learn how Oracle Real Application Clusters (RAC) One Node allows customers
to consolidate database storage, standardize their database environment, and, 
should the need arise, upgrade to a full multi-node Oracle RAC database 
without downtime or disruption
http://p.sf.net/sfu/oracle-sfdevnl
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] URL monitoring

2011-01-04 Thread Max Hetrick

On 01/03/2011 10:01 PM, anish kumar wrote:
 Hi folks,

 I just want to know how to monitor urls using nagios.i.e running web
 applications under tomcat.

 can any one pleasAe provide me links or docs that narrates,how to
 configure nagios in a way to monitor our web applications.

 we already configured nagios for monitoring remote servers load,login
 details, disk space etc..

 But we could not find any documents for configuring nagios in order to
 monitor webapplications.

 Your valuable suggestions please.

Look around Monitoring Exchange. Here's two URL checks after a quick 
search on check_url.

https://www.monitoringexchange.org/search?query=check_url

Regards,
Max

--
Learn how Oracle Real Application Clusters (RAC) One Node allows customers
to consolidate database storage, standardize their database environment, and, 
should the need arise, upgrade to a full multi-node Oracle RAC database 
without downtime or disruption
http://p.sf.net/sfu/oracle-sfdevnl
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Alert host down with passive check

2010-12-10 Thread Max Schubert

Read up on freshness checking:

http://nagios.sourceforge.net/docs/3_0/freshness.html

max

On 12/10/10, Rikard Dahlberg ej_...@hotmail.com wrote:

 Heya guys!

 Is there any possible way to configure nagios to report host as DOWN, if
 nagios havn't got any passive checkresult within like 5 minutes?
 Can I change that somehow? For now, when a host dies in my current
 configuration, it actually doesn't report is as DOWN, since im using passive
 checks. But i want to set nagios to if i havn't got a new passive.-check
 result in 5 minutes, i want nagios to automaticly set it as DOWN...

 Any idea please? :)

 Regards
 Rikard
   

-- 
Sent from my mobile device

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] low-cost snmp-enabled temperature sensor?

2010-12-09 Thread Max Hetrick

On 12/09/2010 01:16 PM, Furnish, Trever G wrote:
 Can anyone recommend a low-cost external temperature sensor that doesn't
 require the buyer to break out a soldering iron?

 Preferably it would be SNMP-enabled so I can poll it from anything.
 Power-over-ethernet would be great too.

 The least expensive snmp-enabled sensor I've found so far is this one at
 195 USD:
 http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm

 I've also noted probes like this one at 15 USD:
 http://www.ibuttonlink.com/t-sense.aspx

 ...but the systems in the site where I'd hook this up are all Windows,
 so I'm not sure how I'd get from there into Nagios.

I use these guys here.

http://store.enviromon.net/cart.php?target=productproduct_id=255category_id=78

They are a little more expensive, but are ethernet connected, and have 
no issues using SNMP to incorporate into Nagios.

We have two sensors hooked up to it, one is temp/humidity and the other 
is a water sensor.

Regards,
Max

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] high latency

2010-12-03 Thread Max Schubert

I find it interesting that a number of users get performance
improvements with embedded perl off - we lose 20-40% polling capacity
perl poller with it off.

- Max

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] high latency

2010-12-03 Thread Max Schubert

Latency increases much more quickly for us without epn as execution
times are noticably longer per check.

We use rhel 5.x, so the perl is 5.8.8.

We have semi dailoy updates to our pollers and with epn that means
cold restarts - memory leaks have not been noticable given that
scenrio, but on test hosts or hosts where we are doing burn ins it is
negligable enough that we can go for 2-3 days with no memory issues -
we always hit service latency thresholds first.

7 seconds is in general where we have to force a restart of our
pollers to prevent metric collection and snmp delta calculation
issues.

Max

On 12/3/10, Andreas Ericsson a...@op5.se wrote:
 On 12/03/2010 12:46 PM, Max Schubert wrote:
 I find it interesting that a number of users get performance
 improvements with embedded perl off - we lose 20-40% polling capacity
 perl poller with it off.


 How do you mean that you're losing capacity? Does latency start to creep
 upwards or is load increasing?

 Out of interest; How much memory does epn leak nowadays, and which perl
 version is it compiled against?

 --
 Andreas Ericsson   andreas.erics...@op5.se
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.


--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Check size of large directory

2010-12-02 Thread Max Hetrick

On 12/02/2010 10:31 AM, Sebastian Ries wrote:

 Does anyone know about a plugin that can do this?

Check out the check_file plugins from the nagios-of-plugins.

http://www.openfusion.com.au/labs/nagios/

I'm pretty sure it will work on a directory the same as a file. Should 
be something like:

check_file -s -30MB -f /path/to/dir

Regards,
Max

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How can i show remote nagios results in one interface?

2010-12-02 Thread Max Schubert

How about Thruk?

http://www.thruk.org/

- Max

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-29 Thread Max Schubert

Thank you, Andreas - I actually really enjoy writing tests and
understand their value - will see if I can put a few in place using
tap over the next week or so and update the patch with them.

On 11/29/10, Andreas Ericsson a...@op5.se wrote:
 On 11/29/2010 09:20 AM, Andreas Ericsson wrote:

 This looks nice. It's in my compile and test queue right now, so
 assuming it works out ok it'll be committed before the week is out.


 And now it's out there. More testing would be much appreciated though,
 but considering the patch is fairly well written I have few qualms
 about it.

 Cheers, and thanks again.

 --
 Andreas Ericsson   andreas.erics...@op5.se
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.


--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-28 Thread Max Schubert

Hi,

Long overdue and anxiously waited on by at least 3 people :), here is
the patch, diffed against the CVS HEAD (as of tonight) revision of
Nagios, with a documentation patch for configmain.html as well (as
requested by Andreas):

The option is now named

allow_empty_hostgroup_assignment

Thanks to Dan Whittenberg and Michael Friedrich for their input on the
option name and the patch itself - thanks to my managers at Comcast (a
great place to work as a developer) Eric Scholz and Mike Fischer for
allowing the team I am on to contribute code back to the community,
and thanks to the team I am on for their hard work and dedication with
our distributed Nagios set up and the many other projects we are
involved with.

Andreas, please let me know if you'd like any further modifications or
documentation for the patch beyond what is provided.

https://github.com/netoisstools/Nagios-3-Patches/blob/master/CVS-HEAD-201011292220/201011281330-allow_empty_hostgroup_assignment.patch

- Max

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] More than one command defined

2010-11-19 Thread Max Schubert

We went through a couple of rounds of changes with this:
* We first hacked our copy of process_perfdata.pl and added logging to
it, then built a relay to our long term trending data warehouse for
relaying Nagios perfdata to it after allowing PNP to process it for
short term views of the data

Nagios - PNP NEB - network - performance data server - NPCD queue
- process_perfdata.pl  - log - relay - network - data warehouse

* Now we have a POE-based solution in place that processes the
perfdata for PNP and our relay as two data driven events

Nagios - PNP NEB - network - performance data server - POE server
POE server - RRD files (replaces npcd and process_perfdata.pl)
POE server - network - data warehouse network listener

For both methods we built off of the (at the time very young)
modpnpsender.c NEB module source included with PNP. ; we then migrated
to a NEB modules who's code was based on some NEB code Andreas wrote
that showed how to write to the input of a second external program
over a pipe + an external perl script that relayed the data to the
destination servers over the network (or queued if the network was
down).

Code will be available open source this year for people who are interested.

- Max

--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today
http://p.sf.net/sfu/msIE9-sfdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-19 Thread Max Schubert

Will try to get this done this weekend - sorry all, it is an irony to
be working for a company that allows us to release code from work to
open source and be so busy at the job that I rarely have time to do so
:).

My apologies for the delay - and thanks for your input, help, and
feedback on the patch, Dan.  Glad you find it useful.

- Max

On Fri, Nov 19, 2010 at 10:40 AM, Daniel Wittenberg
daniel.wittenberg.r...@statefarm.com wrote:
 I'll check with the original author of the patch...I re-rolled it
 already against head and tested great, been a lifesaver.  I sent the
 updates to him so he could post and move forward with so if he's not had
 time I'll see about submitting directly.

--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today
http://p.sf.net/sfu/msIE9-sfdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notifications on passive service checks

2010-11-16 Thread Max Schubert

Escalations can resolve this - if you set an escalation to trigger at
notification 1 and set a re-notify interval in the escalation you will
be re-notified at the set interval while the service remains in the
configured alarm state.

- Max

On Tue, Nov 16, 2010 at 4:41 PM, Alberto Menichetti
alb.meniche...@tai.it wrote:
 Hi all,

 I noticed the same strange behavior, but I don't think it's the right
 behavior.
 Operating in this way, a linkDown trap will be notified only once (in fact
 the sender device will generate a single trap in response to state change).
 Is it possibile to modify this behavior?



 Hall, JC wrote:

 After some testing, it looks like it will only re-notify after receiving
 another passive check result.  It won't simply re-notify because it's still
 in a non-ok state after the notification_interval has expired.  So to combat
 this I just used the check freshness attribute to re-execute my external
 script and feed the passive check result into nagios and thus re-sending a
 non-ok notification at what would have been the interval for notifications.

 So technically my external scripts are running at every interval to check
 the freshnes, not only when it's called for by my event_handler from another
 active service check...  which I'm ok with.

 -Original Message-
 From: Andreas Ericsson [mailto:a...@op5.se]
 Sent: Friday, November 12, 2010 5:00 AM
 To: Nagios Users List
 Cc: Hall, JC
 Subject: Re: [Nagios-users] Notifications on passive service checks

 On 11/11/2010 11:27 PM, Hall, JC wrote:


 Is it accurate that Nagios will only send 1 notification for a
 passive service check?

 IE, the notification_interval definition for a passively checked
 service won't instruct Nagios to re-send a notification such as with
 actively checked services?



 To be honest, I haven't got the faintest idea. An educated guess is
 that it will re-send the notification if it receives another passive
 check-result and enough time has passed though, or that it simply
 re-sends the notification when enough time has passed.

 If you try and find out, let me know either way and I'll amend the
 docs.



 --
 TAI S.r.l.

 Alberto Menichetti
 Area Mercato - Ingegneria dei Sistemi
 System Engineer

 50141 Firenze - Via Pazzagli, 2
 Voice: +39 055 42661 - Fax +39 055 4266356
 56125 Pisa - Viale Gramsci, 12
 Voice: +39 050 220221 - Fax: +39 050 24421

 e-mail: alb.meniche...@tai.it
 http://www.tai.it

 ---
 COMUNICAZIONE AI SENSI LEGGE 196/03
 Il presente messaggio di posta elettronica viene inviato al Vostro indirizzo
 email, che abbiamo acquisito da Vostre Visite, da incontri commerciali,
 elenchi di pubblico dominio, Vostre precedenti comunicazioni. Il Vostro dato
 in questione e' in possesso di TAI S.r.l., che lo ha immagazzinato in
 formato elettronico. Tali informazioni non saranno divulgate a terzi. Se
 desiderate verificare, cancellare o modificare i dati in nostro possesso,
 inviate fax al numero 0554266356.


 --
 Beautiful is writing same markup. Internet Explorer 9 supports
 standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
 Spend less time writing and  rewriting code and more time creating great
 experiences on the web. Be a part of the beta today
 http://p.sf.net/sfu/msIE9-sfdev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today
http://p.sf.net/sfu/msIE9-sfdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-03 Thread Max Schubert

Will do my best to do this this week - if I don't get to it this week
I will forget about it for sure, all the patches our team has done to
make Nagios (some very very specific to our env) work for us will be
available on our team github account; will do my best to get that
updated this week as well and post that url along with the patch /
docs to this list or I will forget to do that for another few months
too.

Max

On 11/3/10, Andreas Ericsson a...@op5.se wrote:
 On 11/02/2010 04:54 PM, Max Schubert wrote:

 allow_empty_hostgroups = 1


 I'll take this, but rename the option to something more mnemonic, as
 empty hostgroups are already allowed and it's a bit ambiguous what it
 means.

 Any chance you could re-roll the patch against latest CVS head and
 include a brief documentation update? Name the var allow_empty_hostgroups
 in the docs so I can find it when I sed it in.

 If you want any special attributions (company or person thanks, beyond
 the addition of your name to the THANKS file and your name and email in
 the commit log message), let me know.


 --
 Andreas Ericsson   andreas.erics...@op5.se
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.


--
Achieve Improved Network Security with IP and DNS Reputation.
Defend against bad network traffic, including botnets, malware, 
phishing sites, and compromised hosts - saving your company time, 
money, and embarrassment.   Learn More! 
http://p.sf.net/sfu/hpdev2dev-nov
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-02 Thread Max Schubert

On Tue, Nov 2, 2010 at 10:52 AM, Fredrik Barrud fred...@barrud.se wrote:
 Hi,

 I would also like to be able to create empty hostgroups with services 
 assigned.

 The reason for this is that I've setup a nagios installation for a customer 
 where a host is only member to one hostgroup to inherit services from other 
 hostgroups, like all-windows-citrix-server and for a few special 
 servertypes I would like to prepare the chain with hostgroup and services so 
 that when the servers are ready to be monitored everything is prepared and 
 the servers only needs to be added to the apropriate hostgroup.

We have a local patch to turn the error into a warning, patched
against 3.2.0 - we needed this because we have customers who can use
host - hostgroup - N related services mappings but we do not control
their configs, we only define the global service sets - so we could
not have this be fatal and fail everyone's changes just because one
customer stopped using the mapping.

We have been using this patch in production for a year without issues.

To apply:
1) Put patch in top level source directory
2) patch -p0  name-of-patch

- Max


20101026171548-allow_empty_hostgroups.patch
Description: Binary data
--
Nokia and ATT present the 2010 Calling All Innovators-North America contest
Create new apps  games for the Nokia N8 for consumers in  U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store 
http://p.sf.net/sfu/nokia-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] hostgroup with no members - Enhancement

2010-11-02 Thread Max Schubert

To activate the downgrade from error to warning, put

allow_empty_hostgroups = 1

in your nagios.cfg after compiling Nagios with the patch in place.

- Max

--
Nokia and ATT present the 2010 Calling All Innovators-North America contest
Create new apps  games for the Nokia N8 for consumers in  U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store 
http://p.sf.net/sfu/nokia-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Detecting Windows Backups?

2010-10-12 Thread Max Hetrick

On 10/12/2010 08:16 AM, Kevin Keane wrote:
 I’m wondering if there is a way to detect if a Windows backup is
 currently running. Every night during the backup window, my checks for
 disk and CPU activity go haywire. That is expected, of course.

 What I’m looking for is a way to get Nagios to either automatically
 schedule a downtime when the backup starts, or to use a service
 dependency or something like that to suppress these false alarms.

Does your backup run at the same time every night?

Max


--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today.
http://p.sf.net/sfu/beautyoftheweb
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Detecting Windows Backups?

2010-10-12 Thread Max Hetrick

On 10/12/2010 08:16 AM, Kevin Keane wrote:
 I’m wondering if there is a way to detect if a Windows backup is
 currently running. Every night during the backup window, my checks for
 disk and CPU activity go haywire. That is expected, of course.

 What I’m looking for is a way to get Nagios to either automatically
 schedule a downtime when the backup starts, or to use a service
 dependency or something like that to suppress these false alarms.

I had to dig up the link for the script I use for scheduling downtime 
remotely.

Anyways, if you know your backup times are always the same, or always 
the same window, I'd just use a cron job to run this nagios_downtime 
script written in Perl.

http://nagios.larsmichelsen.com/nagios-downtime/

Regards,
Max



--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today.
http://p.sf.net/sfu/beautyoftheweb
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Disabling Nagios in the event of network card failure.

2010-10-12 Thread Max Hetrick

On 10/12/2010 07:56 AM, Kristan Webb wrote:
 Hi All,

 Hopefully an easy query.

 We run Nagios 3 on an Ubuntu box and I'm looking for a way to stop
 Nagios falsely reporting services/hosts as down/unknown if, for example,
 the network cable was pulled out of the server / the card failed. This
 has happened recently for over an hour and now all hosts/services have a
 false hours downtime.

 I'm not too bothered, but I like to try and keep things as neat as
 possible.

 Does anyone know of a way of preventing this? All I can think is some
 way of detecting when the server has lost network and then automatically
 quitting / disabling Nagios?

This is pretty much the entire point of Nagios, that is to report when 
something has failed.

If you're intentionally taking down the network on the host, then I 
would suggest scheduling downtime for that host if you or others know 
you're going to take it down for a period of time, otherwise, Nagios is 
doing it's job.

Regards,
Max

--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today.
http://p.sf.net/sfu/beautyoftheweb
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Deploying Nagios on Windows box

2010-10-12 Thread Max Hetrick

On 10/12/2010 08:20 PM, Yu Watanabe wrote:
 Hello all

 Has anyone has a experience for deploying a Nagios 3.x + apache on a windows 
 box using Cygwin?
 Perhaps, Windows Server 2003 (32bit,64bit) or  later?

I don't think you're going to be able to do so, at least not easily, but 
I did run across this on Nagios Exchange. It looks very experimental, so 
beware, but maybe a possibility for you.

http://exchange.nagios.org/directory/Distributions/Nagios-Core-32bit-Windows-Installer/details

Regards,
Max


--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today.
http://p.sf.net/sfu/beautyoftheweb
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Alleviating Nagios i/o contention problem

2010-09-25 Thread Max

I like the suggestions Matthias makes; those suggestions have worked
well for us.

RRD updates are very expensive - I am pretty sure without knowing
anything more about your system that the RRD writes are causing most
of the I/O load.

Our current largest Nagios-based system has around 7500 hosts and
around 40k active services spread across 4 pollers - the four pollers
send perfdata to two report servers that do nothing but host the trap
databases for traps from SNMPTT from the pollers, RRD files / PNP web
UI, and the server side of our client/server notification system.  The
snmptt dbs and notification server dbs are replicated master master
between the two hosts.  Even with rrdcached and raid 10 these hosts
regularly have 3 - 10 pct I/O wait.

We hope to lower that number a bit by moving the DBs onto separate
dedicated DB hosts.

- Max


On 9/25/10, Matthias Flacke matthias.fla...@gmx.de wrote:
 On 9/25/10 2:30 PM, Frost, Mark {PBC} wrote:
 Greetings, listers,



 We've got an on-going issue with i/o contention.  There's the obvious
 problem that we've got a whole lot of things all writing to the same
 partition.  In this case, there's just one big chunk of RAID 5 disk on a
 single controller so I don't believe that making more partitions is
 going to help.



 On this same partition we have:



 1) Nagios 3.2.1 running as the central/reporting server for a couple of
 other Nagios nodes that are sending check results via NSCA.
 Approximately 6-7K checks.



 2) pnp4nagios 0.6.2 (with rrd 1.4.2) writing graph data.



 There's a 2nd server configured identically to the first that's acting
 as a hot spare so it also receives check data from the 2 distributed
 nodes and writes its own copy of the graph data locally as well.



 At the moment I'm concerned about the graphdata, but because I can only
 see i/o utilization as an aggregate, I can't tell what is the worst
 component on that filesystem -- status.dat updates?  graph data?  writes
 to the var/spool directory?  We also look at continued growth so this is
 only going to get worse.



 These systems are quite lightly loaded from a CPU (2 dual-core CPUs) and
 memory (4GB) perspective, but the i/o to the nagios filesystem is
 queuing now.



 We're about to order new hardware for these servers and I want to make a
 reasonable choice.  I'd like to make some reasonable changes without
 requiring too exotic of a setup.  I believe these servers are currently
 Dell 2950s and they're all running Suse Linux 10.3 SP2.



 My first thought was to potentially move the graphs to a NAS share which
 would shift that i/o to the network.  I don't know how that would work
 though and it would ultimately be an experiment.



 What experiences do people out there have handling this kind of i/o and
 what have you done to ease it?

 You didn't say how many of your checks create perfdata - but I assume
 that most of your disk I/O is related to RRD updates.
 RRD cached (see http://docs.pnp4nagios.org/pnp-0.6/rrdcached for PNP
 integration) is a good means to collect multiple RRD updates and burst
 write the RRD files.

 status.dat and the checkresults directory are always good candidates to
 be stored on a ramdisk, especially since they're volatile data. As a
 side note: status.dat on ramdisk is a pure boost for the CGIs :).
 I know people which also store nagios.log on a ramdisk and regularily
 save them via rsync onto a hard disk.

 My own systems with ~4000 checks and ~20.000 performance relevant data
 sets went down from 30% to less than 2% wait I/O with rrdcached and
 ramdisk use.

 Cheers,
 -Matthias

 --
 Start uncovering the many advantages of virtual appliances
 and start using them to simplify application deployment and
 accelerate your shift to cloud computing.
 http://p.sf.net/sfu/novell-sfdev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Creating multiple force checks quickly

2010-09-22 Thread Max

I suggest putting them all in a service group - then you can issue a
force check command from the Nagios UI.

Scriot is not a bad way to go either though - the Nagios
externalcommand API (via the named pipe) is easy to use and well
documented.

Max

On 9/22/10, Brian McKee brian.mc...@gmail.com wrote:
 Hi All

 I'm running an old version of Nagios (2.4)
 Sometimes I end up with the same service warning on multiple machines and
 I'd like to force a check on all of them.
 Using the web interface is a lot of clicks when it's more than just a couple
 machines.
 I'm thinking I should be able to gin up a for loop in bash with curl or wget
 to do it for me, or am I wrong?
 Does this exist already somewhere?
 Pointers in the right direction appreciated

 Brian
 --
 Hey, it's your computer isn't it?


--
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Creating multiple force checks quickly

2010-09-22 Thread Max

On Wed, Sep 22, 2010 at 11:58 AM, Brian McKee brian.mc...@gmail.com wrote:


 On Wed, Sep 22, 2010 at 9:34 AM, Max perld...@webwizarddesign.com wrote:
 PS Still curious about the service group thing though

I looked further into this and the service group control panel does
not include an  'schedule the next check of all services in this
group' option, so the script sounds like the right way to go.

- Max

--
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Enumerating Interfaces

2010-09-08 Thread Max

What I have done in the past and currently do is that each port that
is connected to another networking device is modelled as a Nagios host
- the host check is then the SNMP port status and the IF stats are
collected as services on the host.  The parent of the port is the
network device itself and that way any child devices can be associated
with the port and will be marked as unreachable should the port be
administratively or operationally turned down.

- Max

On Wed, Sep 8, 2010 at 12:22 PM, Jason Frisvold frisv...@lafayette.edu wrote:
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1

 On 09/07/2010 05:45 PM, Greg Pangrazio wrote:
 Well we use similar switches here.  what we do is create the host then
 services for the interfaces and if we have port channels I use the
 multiple check_service_cluster for all the interfaces in the port
 channel.  I just use check snmp for the interfaces.  If you want more
 specifics i can give you config definitions and command defs etc.

 Yeah, that's kind of what I settled with so far..  Testing it out on the
 SAN switches and maybe I'll move that config to other switches too..
 I'm using PNP4Nagios, so I wrote a custom script to check the
 ifOperStatus and get the traffic information on the port as well.  So a
 single check is telling me status, throughput, errors, and discards...

 I am running into one problem, though, and I'm not sure how to handle
 it..  How are you handling parent/child for these ports, or are you?  In
 other words, I'd like to pair the device connected to the port with the
 port itself.  However, I'm enumerating each switch for the ports, so I
 don't have individual services for each device/port combo...  My config
 looks something like this :

 define service {
   use                     service-standard
   hosts                   fcswitch1.example.com,fcswitch2.example.com
   servicegroups           fc-switch-standard
   service_description     Switch Port fc1/1
   check_command           check_snmp_traffic!fc1/1!-C $USER2$
 }

 I think I'm going to have to enumerate each host/port combo, or live
 without parent/child...  *sigh*

 Greg Pangrazio


 - --
 - ---
 Jason Frisvold
 Network Engineer
 frisv...@lafayette.edu
 - ---
 What I cannot create, I do not understand
   - Richard Feynman
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v2.0.14 (GNU/Linux)
 Comment: Using GnuPG with Fedora - http://enigmail.mozdev.org/

 iEYEARECAAYFAkyHuFwACgkQO80o6DJ8UvmAsACaAugytCgxrgsePwlNbrw27Cpi
 rkgAn0e4leboXWricN/+5CtiC07TkUm0
 =rEMU
 -END PGP SIGNATURE-

 --
 This SF.net Dev2Dev email is sponsored by:

 Show off your parallel programming skills.
 Enter the Intel(R) Threading Challenge 2010.
 http://p.sf.net/sfu/intel-thread-sfd
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios plain files vs database storage

2010-09-07 Thread Max

Fyi - plain text files are less system intensive than a db - with our
standalone instances we monitor about 2k hosts per poller (about 10k
active checks).  A db back end makes management and command / control
easier with large deployments but .not needed for just. 400 hosts.

If you want a nice UI, check out fruity or lillac for configuration.

Max

On 9/7/10, yuris yu...@smtp.com wrote:
 Hello,

 I'm using Nagios for several months with plain files setup, and
 nagiosgrapher. It works pretty nice and I have no problem monitoring ~50
 hosts; ~10 service_checks per host.

 Now I'm thinking about migration of other hosts from Zabbix to Nagios
 and totally it is going to be ~350 hosts. Since the number of hosts is
 significant plain test configs does not seem to be a good option, and
 keeping hosts/hostgroups in database seems preferable but from
 experience with Zabbix I know that storing everything in database might
 be not that good idea but pretty obvious bottleneck :/

 Could someone please give me an idea what drawbacks I may face if switch
 from plain nagios3 to ndoutils-nagios3-mysql with 350 - 400 hosts ? Can
 I store in mysql db only hosts and hostgroups and manipulate them
 (add/remove) via web interface while keep everything else in old way
 (plain test configs) to reduce load on mysql?

 --
 wbr,
 Yuris

 Linux System Engineer,
 SMTP.com

 --
 This SF.net Dev2Dev email is sponsored by:

 Show off your parallel programming skills.
 Enter the Intel(R) Threading Challenge 2010.
 http://p.sf.net/sfu/intel-thread-sfd
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Distributed Setup of Nagios

2010-08-19 Thread Max

On Thu, Aug 19, 2010 at 10:39 AM, Kyle O'Donnell nag...@isprime.org wrote:
 We have done some other significant changes to GW to scale as large as we
 did, but these had less to do with nagios and more do to with GW.  We are
 using a ramdisk for the nagios log directory, which helps quite a bit for
 nagios performance.

I can second that - RAM disk for nagios temp files and log files,
objects.cache and retention.dat (we rsync to disk every N minutes for
disaster recovery) makes a very noticable difference.

- max

--
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Centreon nagiosadmin login says invalid user!!!

2010-08-19 Thread Max

On Thu, Aug 19, 2010 at 10:41 PM, Toonz IT it.to...@gmail.com wrote:
 We can login to Nagios using user name nagiosadmin and password nagiosadmin.
 But Centreon returns Invalid User!

 We are using FAN 2.0

What did the users / developers on the Centreon / FAN mailing list say
about this?  They might be able to help.

--
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Distributed Setup of Nagios

2010-08-18 Thread Max

On Wed, Aug 18, 2010 at 7:22 AM, Ton Voon tonv...@gmail.com wrote:
 You may want to look at Opsview (http://opsview.com).

  From a single point of configuration, it pushes out the nagios
 configuration to the remote slaves which are independently running
 their own copy of Nagios. We have users going up to 25 slaves!

Cool - how many active service checks / active host checks per poller?

- Max

--
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Distributed Setup of Nagios

2010-08-18 Thread Max

On Wed, Aug 18, 2010 at 11:07 AM, Kyle O'Donnell nag...@isprime.org wrote:
 we have ~ 3 services and ~3000 hosts

 we have 6 pollers (each have a backup) processing checks and forwarding
 back to a central nagios host.

 our busiest poller has ~1000 hosts and ~9000 services... avg service check
 interval is 5 minutes, but there are a bunch at 1 and 2 minute intervals.

 avg service check latency is less than 1 second

 This is ~3yr old hardware too, i suspect we could increase capacity by 50%
 if we move to the new intel nahalems

Nice - appreciate you sharing your numbers - everyone who does
distributed code around Nagios adds overhead, so it is nice to see
real numbers as opposed to 'as many as can be done' as we all know how
wildly that varies :) - I have spent many many hours with my
colleagues tuning the 'as many as can be done' numbers.

We have done a distributed variant of Nagios as well - our
non-distributed pollers (Compaq 380s with 8 GB RAM + RAID 10) poll 2k
host checks (every 10 minutes) and 11k service checks (avg interval 5
minutes), all checks send performance data through a NEB module as
well to our performance data processing tier - with our distributed
code in place that falls to around 1.5k host checks and 8-9k service
checks per poller.

Average non-distributed host and service check latency around 1.2
seconds, distributed around 2.4 seconds.

Our new hardware consists of Dell R710s - dual 8 core processors, wow
do those rock - with our distributed code we are getting around 2x
those numbers per poller even with the overhead of the distribution
mechanism in place.

We will be releasing our distributed variant as open source software
in the next month or so - i suspect that our methodology is org
specific enough that it will not work for many places, but for higher
volume polling it might be worthwhile to adopt and some of the
concepts and metholodigies in it we hope will lead to sparking ideas
in others for better ways to do distributed Nagios.

We also take the approach of pushing out configs to remote pollers -
we have a redundant UI tier where we stage a configuration - after the
configuration is staged, we have code (will allow for manual operator
adjustment in a dot release) that will equally distribute checks among
pollers desginated as being available for use - that code then builds
out a common retention.dat file for all pollers along with
objects.pre-cache files for each poller - those files are pushed out
to each poller and the pollers are restarted (yes, we have thought
through and worked out all the synchronization issues involved).

Our UI then lets users take the actions the Nagios Ui does and knows
where to send the commands to affect the real poller instances.

Working well so far, and as with all the alternate Nagios UIs, we are
able to make a much more intuitive and flexible UI.

Code should be available in early October.

- Max

--
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Distributed Setup of Nagios

2010-08-17 Thread Max

On Tue, Aug 17, 2010 at 10:26 AM, Bradley Radjoo
bradley.rad...@is.co.za wrote:
 Hello All,
 Please can you assist me as I a 1st timer to Nagios itself as I need
 assistance on a Distributed Monitoring Setup of Nagios.
 Where one server manages other instances of Nagios Servers in differant
 locations.
 Any help or documentation on installation and configuration would be of
 great help ;-)

DNX might work for you:

http://dnx.sourceforge.net/

- Max

--
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Applying a multiplier to check_snmp results

2010-08-12 Thread Max

I have written and maintain a perl library that extends Nagios::Plugin
- it is called Nagios::Plugin::SNMP.

It is available on CPAN and while it does not take the complexity out
of SNMP data it does simplify writing plugins that retrieve SNMP data.

The team I am on at my job uses it extensively - you might find it
helpful for creating custom plugins tht need to not only retrieve but
also cook SNMP data before doing thresholding on it.

- Max

On 8/12/10, Hall, JC jh...@iodatacenters.com wrote:
 I certainly wish I could do that.  I'm just a novice.

 I can basically accomplish what I want (in regards to multiplying the
 output) with snmpget.

 printf scale=2;%d*0.01\n $(snmpget -v 1 -c public 192.168.104.37
 .enterprises.6347.1.1.17 -O Qve)|bc

 208.72

 Any thoughts surrounding how I could run this as an external command and
 pipe it back into nagios?

 -Original Message-
 From: Matthew J. Salerno [mailto:vagabond_k...@yahoo.com]
 Sent: Thursday, August 12, 2010 2:17 PM
 To: Nagios Users List
 Subject: Re: [Nagios-users] Applying a multiplier to check_snmp results

 I ran into the same thing, I ended up writing my own plugins.  Just look for
 a
 non binary plugin (perl/python etc) and then hack away.



 - Original Message 
 From: Jim Avery j...@jimavery.me.uk
 To: Nagios Users List nagios-users@lists.sourceforge.net
 Sent: Thu, August 12, 2010 5:05:42 PM
 Subject: Re: [Nagios-users] Applying a multiplier to check_snmp results

 On 12 August 2010 21:27, Hall, JC jh...@iodatacenters.com wrote:
 I have a situation where I would like to apply a multiplier to the
 results
 of a check_snmp output.

 The situation is this:

 The value received from my check_snmp = 20717.  The integer multiplier
 for
 this particular OID is 0.01.

 The output I want reported to Nagios is 20717 x 0.01 = 207.17.

 Looking through the options of the check_snmp plugin, I can’t determine
 how
 I could accomplish this.



 Any thoughts are appreciated.


 +1 to that!

 I was doing some monitoring on a UPS yesterday and found the current
 is given as an integer which is 10x the Amps, so it would be great to
 be able to divide by 10 (or multiply by whatever) in check_snmp.

 --
 This SF.net email is sponsored by

 Make an app they can't live without
 Enter the BlackBerry Developer Challenge
 http://p.sf.net/sfu/RIM-dev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any
 issue.

 ::: Messages without supporting info will risk being sent to /dev/null





 --
 This SF.net email is sponsored by

 Make an app they can't live without
 Enter the BlackBerry Developer Challenge
 http://p.sf.net/sfu/RIM-dev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null
 --
 This SF.net email is sponsored by

 Make an app they can't live without
 Enter the BlackBerry Developer Challenge
 http://p.sf.net/sfu/RIM-dev2dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null

--
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] checks per host

2010-07-12 Thread Max

On Mon, Jul 12, 2010 at 1:39 PM, Joel Brooks jbro...@oddelement.com wrote:
 hey gang,

 I'm trying to get a sense of what's normal for the number of checks per
 host.

 I'm pushing nagios to a number of servers and the list of things I want to
 monitor keeps growing.

 For some servers, I've got  30 checks - some  50.

 what is normal out there?

 is there a practical limit?

The limit is what your HW can handle and what your people can handle
seeing per host.

Our teams try to make checks that check multiple elements at once to
reduce the numbers of checks per host without loss of alarm
granularity - for example, our disk checks check all partitions on a
host and allow for rich thresholding - user can specify many
thresholds to check per service.  Same for critical processes - one
check to look for all critical processes and report any that have
failed.

- Max

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Running Nagios on Vmware

2010-07-08 Thread Max Hetrick

Kent Saunders wrote:

 
 We run a large part of our environment on VMware, however there are 
 certain things that we deem worthy of hardware, Nagios being one of 
 them.  Since we rely on Nagios to inform us of problems across our 
 environment, we need it to be highly reliable, with performance being a 
 lesser consideration.  We've experience issues with VMware that cause 
 large numbers of guests to lock up, and if this happened to Nagios we 
 wouldn't know, or have as deep a level of alerting.
 
 However, if you really like the idea of running it in a vm, I'd suggest 
 putting a simple is Nagios alive type test outside VMware somewhere, 
 to give you an extra layer of protection should the guest have troubles.

Same here. We take the philosophy with wanting Nagios on separate 
hardware running nothing else but monitoring and graphing/trending.

For the exact same reasons listed, we want to know if there's a problem 
with one of our virtualization servers, and if Nagios was a VM on the 
same box that's having a problem, we wouldn't get those notifications.

My Nagios instance is on an older server with a RAID 1, and of course 
backed up, so it could easily be restored should the machine die.

Of course running it as a VM allows the use of snapshots and very very 
easy backup and restores, but I still think having it isolated is important.

Regards,
Max

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] wiki down?

2010-06-30 Thread Max

On Wed, Jun 30, 2010 at 8:45 AM, Roy Sigurd Karlsbakk r...@karlsbakk.net 
wrote:
 This is getting silly - can someone please fix that wiki, or should we all 
 move over to Icinga and do some real work for once?

Why would you assume that any of us are not doing real work?

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] IP and hostname mapping control

2010-06-29 Thread Max

On Tue, Jun 29, 2010 at 1:40 AM, Network Operation Center FMC
Luxemburg n...@eurofmc.com wrote:
 Hi everybody,

 I'm looking for a way to check out the mapping between a hostname and IP
 address.

 Example: IP 192.168.0.1 exists and if the hostname foo.mylan.com is not
 associated with this IP, I would have an alarm.

 Indeed the script below returns no alarm :

Take a look at check_dns.

- Max

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] how to fix excessive latency

2010-06-29 Thread Max

Clock skew can be an issue as well depending on the virtualization platform.

On 6/29/10, Giorgio Zarrelli zarre...@linux.it wrote:
 I agree, better not to use Nagios on virtual machine. The I/O layer of vms
 have poor performances.

 Ciao,

 Giorgio

 Il giorno 29/giu/2010, alle ore 14:23, Andreas Ericsson a...@op5.se ha
 scritto:

 On 06/29/2010 03:57 AM, wwanghongrui wrote:
 Thanks your reply. We are writing to mysql database by ndoutils.We don't
 use nsca. About external_command_buffer_slots, we don't set it up.
 status_update_interval =15

 I use vmstate to capture system performance,like below.Maybe the
 bottleneck is not at system.


 Endeavour to not run Nagios on a virtual server. If you must use a virtual
 server,
 make very sure that your checkresult spooldirectory and status data files
 are on
 a ramdisk, or you will certainly run into trouble.

 --
 Andreas Ericsson   andreas.erics...@op5.se
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.

 --
 This SF.net email is sponsored by Sprint
 What will you do first with EVO, the first 4G phone?
 Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null

 --
 This SF.net email is sponsored by Sprint
 What will you do first with EVO, the first 4G phone?
 Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage

2010-06-28 Thread Max Williams

 is Ready
  OK |  0:0 | Connector 0 [SAS] on controller 0 is Ready
  OK |  0:1 | Connector 1 [SAS] on controller 0 is Ready
  OK |  1:0 | Connector 0 [SAS] on controller 1 is Ready
  OK |  1:1 | Connector 1 [SAS] on controller 1 is Ready
  OK |0:0:0 | Enclosure 0:0:0 [Backplane] on controller 0 is Ready
  OK |1:0:1 | Enclosure 1:0:1 [MD1000] on controller 1 is Ready
  OK |1:0:0 | Enclosure 1:0:0 [MD1000] on controller 1 is Ready
  OK |1:1:0 | Enclosure 1:1:0 [MD1000] on controller 1 is Ready
  OK |  1:0:1:1 | Fan 1 in enclosure 1 [MD1000] is Ready (speed=slow)
  OK |  1:0:1:2 | Fan 2 in enclosure 1 [MD1000] is Ready (speed=slow)
  OK |  1:0:1:3 | Fan 3 in enclosure 1 [MD1000] is Ready (speed=slow)
  OK |  1:0:1:4 | Fan 4 in enclosure 1 [MD1000] is Ready (speed=slow)
  OK |  1:0:0:1 | Fan 1 in enclosure 2 [MD1000] is Ready (speed=slow)
  OK |  1:0:0:2 | Fan 2 in enclosure 2 [MD1000] is Ready (speed=slow)
  OK |  1:0:0:3 | Fan 3 in enclosure 2 [MD1000] is Ready (speed=slow)
  OK |  1:0:0:4 | Fan 4 in enclosure 2 [MD1000] is Ready (speed=slow)
  OK |  1:1:0:1 | Fan 1 in enclosure 3 [MD1000] is Ready (speed=slow)
  OK |  1:1:0:2 | Fan 2 in enclosure 3 [MD1000] is Ready (speed=slow)
  OK |  1:1:0:3 | Fan 3 in enclosure 3 [MD1000] is Ready (speed=slow)
  OK |  1:1:0:4 | Fan 4 in enclosure 3 [MD1000] is Ready (speed=slow)
  OK |  1:0:1:1 | Power Supply 1 in enclosure 1 [MD1000] is Ready
  OK |  1:0:1:2 | Power Supply 2 in enclosure 1 [MD1000] is Ready
  OK |  1:0:0:1 | Power Supply 1 in enclosure 2 [MD1000] is Ready
  OK |  1:0:0:2 | Power Supply 2 in enclosure 2 [MD1000] is Ready
  OK |  1:1:0:1 | Power Supply 1 in enclosure 3 [MD1000] is Ready
  OK |  1:1:0:2 | Power Supply 2 in enclosure 3 [MD1000] is Ready
  OK |  1:0:1:0 | Temperature Probe 0 in enclosure 1 [MD1000]: 31 C (65 max)
  OK |  1:0:1:1 | Temperature Probe 1 in enclosure 1 [MD1000]: 32 C (65 max)
  OK |  1:0:1:2 | Temperature Probe 2 in enclosure 1 [MD1000]: 24 C (55 max)
  OK |  1:0:1:3 | Temperature Probe 3 in enclosure 1 [MD1000]: 24 C (55 max)
  OK |  1:0:0:0 | Temperature Probe 0 in enclosure 2 [MD1000]: 31 C (65 max)
  OK |  1:0:0:1 | Temperature Probe 1 in enclosure 2 [MD1000]: 33 C (65 max)
  OK |  1:0:0:2 | Temperature Probe 2 in enclosure 2 [MD1000]: 25 C (55 max)
  OK |  1:0:0:3 | Temperature Probe 3 in enclosure 2 [MD1000]: 25 C (55 max)
  OK |  1:1:0:0 | Temperature Probe 0 in enclosure 3 [MD1000]: 31 C (65 max)
  OK |  1:1:0:1 | Temperature Probe 1 in enclosure 3 [MD1000] is Inactive
  OK |  1:1:0:2 | Temperature Probe 2 in enclosure 3 [MD1000]:  C ( max)
  OK |  1:1:0:3 | Temperature Probe 3 in enclosure 3 [MD1000]:  C ( max)
  OK |  1:0:1:0 | EMM 0 in enclosure 1 [MD1000] is Ready
  OK |  1:0:1:1 | EMM 1 in enclosure 1 [MD1000] is Ready
  OK |  1:0:0:0 | EMM 0 in enclosure 2 [MD1000] is Ready
  OK |  1:0:0:1 | EMM 1 in enclosure 2 [MD1000] is Ready
  OK |  1:1:0:0 | EMM 0 in enclosure 3 [MD1000] is Ready
CRITICAL |  1:1:0:1 | EMM 1 in enclosure 3 [MD1000] needs attention: Not 
Installed
-
   Chassis Components
=
  STATE  |  ID  |  MESSAGE TEXT
-+--+
  OK |1 | Memory module 1 [DIMM1, 4096 MB] is Ok
  OK |2 | Memory module 2 [DIMM2, 4096 MB] is Ok
  OK |3 | Memory module 3 [DIMM3, 4096 MB] is Ok
  OK |4 | Memory module 4 [DIMM4, 4096 MB] is Ok
  OK |5 | Memory module 5 [DIMM5, 4096 MB] is Ok
  OK |6 | Memory module 6 [DIMM6, 4096 MB] is Ok
  OK |7 | Memory module 7 [DIMM7, 4096 MB] is Ok
  OK |8 | Memory module 8 [DIMM8, 4096 MB] is Ok
  OK |1 | Chassis fan 1 [System Board FAN 1 RPM]: 8700
  OK |2 | Chassis fan 2 [System Board FAN 2 RPM]: 8850
  OK |3 | Chassis fan 3 [System Board FAN 3 RPM]: 8775
  OK |4 | Chassis fan 4 [System Board FAN 4 RPM]: 8400
  OK |0 | Power Supply 0 [AC]: Presence detected
  OK |1 | Power Supply 1 [AC]: Presence detected
  OK |0 | Temperature Probe 0 [System Board Ambient Temp] reads 16 C 
(min=8/3, max=42/47)
  OK |0 | Processor 0 [Intel Xeon E5420 2.50GHz] is Present
  OK |0 | Voltage sensor 0 [CPU1 VCORE] is Good
  OK |1 | Voltage sensor 1 [System Board CPU VTT] is Good
  OK |2 | Voltage sensor 2 [System Board 1.5V PG] is Good
  OK |3 | Voltage sensor 3 [System Board 1.8V PG] is Good
  OK |4 | Voltage sensor 4 [System Board 3.3V PG] is Good
  OK |5 | Voltage sensor 5 [System Board 5V PG] is Good
  OK |6 | Voltage sensor 6 [Riser 1.5V PXH PG] is Good
  OK |7 | Voltage sensor 7 [Riser 5V Riser PG

Re: [Nagios-users] wiki down?

2010-06-28 Thread Max

On Mon, Jun 28, 2010 at 8:29 AM, Matt Simmons
standalone.sysad...@gmail.com wrote:
 If only there were some kind of software available to let us know when
 websites were down...

Or people to respond to alerts from the software :)

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage

2010-06-28 Thread Max Williams

Excellent, sorted, everything reports as OK now. 
Thanks so much Trond, amazing support and an amazingly useful plugin!
Best Regards,
Max Williams

-Original Message-
From: Trond Hasle Amundsen [mailto:t.h.amund...@usit.uio.no] 
Sent: 28 June 2010 15:21
To: Nagios Users List
Subject: Re: [Nagios-users] check_openmanage: Use of uninitialized value in 
sprintf at /usr/lib64/nagios/plugins/check_openmanage

Max Williams max.willi...@mflow.com writes:

 Here is the output, the inactive temperature probe is sorted but the
 missing EMM still produces an alert:

   OK |  1:1:0:1 | Temperature Probe 1 in enclosure 3 [MD1000] is Inactive

This one works as expected :)

   OK |  1:1:0:2 | Temperature Probe 2 in enclosure 3 [MD1000]:  C ( max)
   OK |  1:1:0:3 | Temperature Probe 3 in enclosure 3 [MD1000]:  C ( max)

Hmm... something strange going on here. I wonder why this happens, in
the SNMP output you attached previously the values are there. Anyway,
I've added some extra checking in the code to make it report better if
the reading is unavailable for some reason. It should now report simply:

  Temperature Probe 0 in enclosure 2:0:0 [MD1000] is Ready

if the temp reading is not an integer and OMSA reports the status as OK.

 CRITICAL |  1:1:0:1 | EMM 1 in enclosure 3 [MD1000] needs attention: Not 
 Installed

Ah.. I misread the SNMP output.. The status is Unknown when reported
by omreport, but Other when reported with SNMP. One little annoying
difference between the two.. The output should be:

  EMM 0 in enclosure 2:0:0 [MD1000] is Not Installed

with an OK state.

I've created a second test version:

  http://folk.uio.no/trondham/software/beta/check_openmanage

Please give this one a try and see if it performs better.

Cheers,
-- 
Trond H. Amundsen t.h.amund...@usit.uio.no
Center for Information Technology Services, University of Oslo

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage

2010-06-25 Thread Max Williams

Hi,
After adding more storage to a couple of our servers we are getting this error:

[r...@host  ~]# /usr/lib64/nagios/plugins/check_openmanage -C password -b 
ctrl_driver=0,1,2 -b ctrl_fw=0,1,2 -b intr=0 -H host2
Temperature Probe 1 in enclosure 3 [MD1000] is Inactive C at  ( max)
EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed
INTERNAL ERROR: Use of uninitialized value in sprintf at 
/usr/lib64/nagios/plugins/check_openmanage line 2312.
INTERNAL ERROR: Use of uninitialized value in sprintf at 
/usr/lib64/nagios/plugins/check_openmanage line 2312.
INTERNAL ERROR: Use of uninitialized value in sprintf at 
/usr/lib64/nagios/plugins/check_openmanage line 2318.
INTERNAL ERROR: Use of uninitialized value in sprintf at 
/usr/lib64/nagios/plugins/check_openmanage line 2318.
INTERNAL ERROR: Use of uninitialized value in sprintf at 
/usr/lib64/nagios/plugins/check_openmanage line 2318.
INTERNAL ERROR: Use of uninitialized value in sprintf at 
/usr/lib64/nagios/plugins/check_openmanage line 2318.
[r...@host  ~]#

We didn't get this error before adding a new cabinet of disks which now brings 
the total up to 47 (2x internal disk and 3x full MD1000s).
Has any one else come across this error? I am not perl literate so not sure how 
to debug or fix this.
Cheers,
Max
--
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage

2010-06-25 Thread Max Williams

SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.4 = INTEGER: 1
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.5 = INTEGER: 1
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.6 = INTEGER: 1
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.1 = STRING: A.04
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.2 = STRING: A.04
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.3 = STRING: A.04
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.4 = STRING: A.04
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.5 = STRING: A.04
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.1 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.2 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.3 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.4 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.5 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.6 = INTEGER: 1
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.1 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.2 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.3 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.4 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.5 = INTEGER: 3
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.6 = INTEGER: 1
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.1 = STRING: \\1\\0\\1\\0
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.2 = STRING: \\1\\0\\1\\1
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.3 = STRING: \\1\\0\\0\\0
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.4 = STRING: \\1\\0\\0\\1
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.5 = STRING: \\1\\1\\0\\0
SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.6 = STRING: \\1\\1\\0\\1

Let me know if you need anything else.
Best Regards,
Max Williams




-Original Message-
From: Trond Hasle Amundsen [mailto:t.h.amund...@usit.uio.no]
Sent: 25 June 2010 16:02
To: Nagios Users List
Subject: Re: [Nagios-users] check_openmanage: Use of uninitialized value in 
sprintf at /usr/lib64/nagios/plugins/check_openmanage

Max Williams max.willi...@mflow.com writes:

 Hi,

 After adding more storage to a couple of our servers we are getting this 
 error:



 [r...@host  ~]# /usr/lib64/nagios/plugins/check_openmanage -C password -b
 ctrl_driver=0,1,2 -b ctrl_fw=0,1,2 -b intr=0 -H host2

 Temperature Probe 1 in enclosure 3 [MD1000] is Inactive C at  ( max)

 EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed

 INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/
 plugins/check_openmanage line 2312.

 INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/
 plugins/check_openmanage line 2312.

 INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/
 plugins/check_openmanage line 2318.

 INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/
 plugins/check_openmanage line 2318.

 INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/
 plugins/check_openmanage line 2318.

 INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/
 plugins/check_openmanage line 2318.

 [r...@host  ~]#



 We didn?t get this error before adding a new cabinet of disks which now brings
 the total up to 47 (2x internal disk and 3x full MD1000s).

 Has any one else come across this error? I am not perl literate so not sure 
 how
 to debug or fix this.

Hi Max,

This is interesting. I've never seen Inactive temperature sensors in
external enclosures. Also, that the plugin reports missing EMMs seems
like a misfeature. Can you post the output from the following commands:

On the monitored host:

  omreport storage enclosure controller=id enclosure=id info=temps
  omreport storage enclosure controller=id enclosure=id info=emms

Replace id with controller/enclosure pairs. You'll get the
enclosure and controller IDs with commands

  omreport storage controller
  omreport storage enclosure

Also, since you're checking with SNMP, I'll need the output from an
snmpwalk of the enclosures wrt. temperatures and EMMs. From the Nagios
server:

  snmpwalk -v2c -c community hostname 1.3.6.1.4.1.674.10893.1.20.130.11
  snmpwalk -v2c -c community hostname 1.3.6.1.4.1.674.10893.1.20.130.13

If you are uncomfortable with posting this information on the
mailinglist, feel free to email me directly.

Debug output from the plugin could also be useful:

  check_openmanage -H hostname -C community -d

Cheers,
--
Trond H. Amundsen t.h.amund...@usit.uio.no
Center for Information Technology Services, University of Oslo

--
ThinkGeek and WIRED's GeekDad team up for the Ultimate
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
lucky parental unit.  See the prize list and enter to win:
http://p.sf.net/sfu/thinkgeek-promo
___
Nagios-users mailing list

Re: [Nagios-users] Help - I just blew away my configs

2010-06-24 Thread Max

Shows running settings (config.cgi)

Objects.cache contains a flattened version of all the configuration
objects your configuration tree had; the primary loss is that there
are no templates in objects.cache, so you will have to recreate those.

Isave a copy of objects.cache in a very safe place off server and save
a copy on server - nagios has a command line option that will let you
start it using an objects.cache formatted file ( search for pre-cached
configs)m

If you had plugins in your config tree those are lost, cgi.cfg
settings are lost, nagios.cfg settings are lost, and resource.cfg
settings are lost - minus resource.cfg the rest are pretty eady to
redo (as resource.cfg might have had passwords or other unique info.

Definitely put your configs in svn, cvs, git or another version
control system as you re-create your config templates - and makes sure
you backup your vcs repository daily.

Max

On 6/24/10, Sean Carley scar...@gmi-mr.com wrote:
 You can also try /nagios/cgi-bin/config.cgi, not sure if that shows the
 running or on-disk configs.

 -s



 -Original Message-
 From: Gius, Mark [mailto:mg...@createspace.com]
 Sent: Thursday, June 24, 2010 4:11 PM
 To: Nagios Users List
 Subject: Re: [Nagios-users] Help - I just blew away my configs

 A pretty significant portion of the configurations are stored in the
 objects cache (/var/log/Nagios/objects.cache for me).  This won't be as
 clean as your configs (and I'm not sure Nagios can use this file as a
 config directly), but you should be able to recover a pretty good amount
 of your running configs from there, and start reconstructing the config
 files.

 May I be the first to suggest placing your configuration in revision
 control, which will help mitigate this problem in the future.  I would
 suggest Subversion (http://subversion.tigris.org/) or git
 (http://git-scm.com/).

 Good luck!

 -Gius

 -Original Message-
 From: Litwin, Matthew [mailto:mlit...@stubhub.com]
 Sent: Thursday, June 24, 2010 3:17 PM
 To: Nagios Users List
 Subject: [Nagios-users] Help - I just blew away my configs

 I just blew away all my nagios config files. Nagios is still running.
 Is there any way I can make nagios spit up the configs that were
 loaded
 from the command line or is all hope lost?

 ---
 ---
 ThinkGeek and WIRED's GeekDad team up for the Ultimate
 GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
 lucky parental unit.  See the prize list and enter to win:
 http://p.sf.net/sfu/thinkgeek-promo
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null

 
 --
 ThinkGeek and WIRED's GeekDad team up for the Ultimate
 GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
 lucky parental unit.  See the prize list and enter to win:
 http://p.sf.net/sfu/thinkgeek-promo
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null

 --
 ThinkGeek and WIRED's GeekDad team up for the Ultimate
 GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
 lucky parental unit.  See the prize list and enter to win:
 http://p.sf.net/sfu/thinkgeek-promo
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios Postemsg

2010-06-11 Thread Max

The main things you will not get from Nagios that you almost always get with
Tivoli:
* High recurring licensing fees
* On-site Tivoli consultants

Nagios does not give you out of the box the visualization dashboards that
Tivoli has but with Nagviz you can you make very nice graphical dashboards
at a much much lower cost to your company.

Nagios also does not do auto-discovery out of the box but there are projects
that give you that capabililty - again at a much lower cost.

Distributed Nagios - there are a few choices, you will need to take the time
to evaluate them all and choose the right one for you, but again, cost will
be lower than Tivoli.

The team I am on is building out a distributed architecture for Nagios based
on our unique requirements - self service model where many SAs can all
change configs on their schedule without our intervention, clustering, fast
redistribution of hosts/services across pollers, centralized transparent (to
the end user) command and control across all pollers.

We are using some existing tools (Nagios and Merlin) and 4 developers and
even then the TCO and maintenance cost will be magnitudes of order cheaper
than Tivoli with much more functionality than most Tivoli shops offer.

A polling model always has some challenges when it comes to scaling big but
compared to Tivoli I think you will find Nagios to be both a lot more fun, a
lot more flexible, a lot better fit, and, if politics don't interfere, your
management should be much more happy with a fixed cost development price tag
than the high $$ open ended maintenance costs of a commercial product like
Tivoli.

- Max

On Fri, Jun 11, 2010 at 10:19 AM, steve f a31mod...@hotmail.com wrote:

Hello All,

I am currently looking for an alternative to using Tivoli , TEC postemsg
for a rather large ( 6000 + ) remote environment.

I have had great success with Nagios in my small local/remote test
environment and the obvious cost savings without having TEC anymore is huge.

Can I use the existing postemsg tests that are running on the boxes and via
I guess External Commands have Nagios process the messages?

For those familiar with both Tivoli Nagios, Is there anything that Tivoli
gives me that I cant do with Nagios? I don't see it if there is.

Thanks for the help,

Steve

--
The New Busy think 9 to 5 is a cute idea. Combine multiple calendars with
Hotmail. Get
busy.http://www.windowslive.com/campaign/thenewbusy?tile=multicalendarocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_5

--
ThinkGeek and WIRED's GeekDad team up for the Ultimate
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
lucky parental unit. See the prize list and enter to win:
http://p.sf.net/sfu/thinkgeek-promo
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null

--
ThinkGeek and WIRED's GeekDad team up for the Ultimate
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
lucky parental unit. See the prize list and enter to win:
http://p.sf.net/sfu/thinkgeek-promo___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting
any issue.
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Large Installation

2010-06-11 Thread Max

I can attest  / confirm what Andreas states about the merlin daemon.

BTW, Andreas, I just patched our code base to contain your 0.6.7 changes and
I will be posting that on Github for you and anyone else interested to check
out over the weekend.

Our tests so far are showing that with the Merlin NEB and daemon on a poller
we lose less than 10% capacity on the poller compared to the poller without
the NEB module and Merlind - our test poller is running 10k active services
checks and 1k active host checks in less than 5 minutes with polling
headroom to spare.

- Max
--
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Large Installation

2010-06-11 Thread Max

Our changes to Merlin allow N pollers to all write to the same database
without conflicts.
--
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Can an entity have more than one parent?

2010-05-28 Thread Max

On Fri, May 28, 2010 at 2:22 PM, shadih rahman shadhi...@gmail.com wrote:
 1)  Can a host have two parents?

http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#host

 2)  What happen if only one of those parents go down?

http://nagios.sourceforge.net/docs/3_0/networkreachability.html

--

___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Sending NAGIOS alerts to Gmail addresses can get your mail server listed as suspect.

2010-05-27 Thread Max Hetrick

Drew Weaver wrote:

 I noticed that a lot of our mail was ending up in users' junk/spam 
 folder at GMail and it seems that if you send Nagios warning messages to 
 Gmail they somehow assume that your server is malicious and spamming. Is 
 it SOP to use a different SMTP server to deliver Nagios messages?

I'd assume you'd have to take that up with GMail. Or check the spam 
filtering settings on the user accounts. I don't use GMail, but I assume 
there are junk spam filtering rules that you can do to classify mail on 
the settings, like all other e-mail clients and services.

Regards,
Max

--

___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] newbie -- checking log files

2010-05-27 Thread Max

On Thu, May 27, 2010 at 12:00 PM, Jay G. Scott g...@arlut.utexas.edu wrote:

 my boss says nagios (dunno which version he used) doesn't
 monitor log files.

 is that no longer true, i hope?

http://www.google.com/#hl=enq=Nagios+monitor+log+files

--

___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Full Throttle Nagios

2010-05-19 Thread Max

Really look forward to seeing this code - this is the process pool code, yes?

On 5/19/10, Andreas Ericsson a...@op5.se wrote:
 There are several tweaks one can do to make Nagios more or less saturate the
 CPU, but none of them are enough if the number of checks rise above a
 certain
 level. We've noticed that we can run about 60K service-checks so long as we
 don't have many state-changes in the network. That's not nearly good enough,
 so it will be improved.

 Work is under way to revamp the Nagios check handling though. It will get
 better, but it's a series of large patches that need to be thoroughly tested
 before they get released.

 I'll holler when I have them in a testable state, since I guess a lot of
 people would be quite interested in trying out the new checking engine I'm
 working on. If it works as good as it seems to, it should alleviate a lot
 of performance issues.

 --
 Andreas Ericsson   andreas.erics...@op5.se
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 Considering the successes of the wars on alcohol, poverty, drugs and
 terror, I think we should give some serious thought to declaring war
 on peace.

 --

 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--

___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Full Throttle Nagios

2010-05-18 Thread Max

Patch can be found here:

http://github.com/netoisstools/Nagios-3-Patches/blob/master/events.c.patch

Download it to your nagios source directory then run

patch -p0  events.c.patch

if you use the patch, make sure to set use_large_installation_tweaks
to 1 as it will only skip the sleep time if large installation tweaks
is enabled.

Regards,
Max

--

___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] how to restart httpd service on remote server

2010-05-03 Thread Max

As can puppet, Chef, Monit, Net-SNMP, and many others :) .. always a
balance with deciding where process restarts should happen - the
larger the network, the more benefit there is to having autonomous
agents handle restarts and have Nagios focus purely on polling /
asynchronous event handling.

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Plugin fails with epn

2010-04-11 Thread Max

If you look at p1.pl, which is in the Nagios bin directory you can
enable ePn logging (EPN_LEAVE_MESSAGES or something similar for debug
level) and have epn log each command execution to a log file ... This
often will make the source of an epn error more obvious.

The other thing to do is add

Use warnings;

To your script or run it from the command line with

perl -w

As any output to STDERR will cause ePN to halt execution.

That said, 'not execute properly' errors are often permissions or path
errors, so when you test from the CLI make sure you run the script as
the same user your Nagios damon runs as.

Max

On 4/11/10, Joseph L. Casale jcas...@activenetwerx.com wrote:
 I've written a plugin that very closely resembles other contrib based
 Perl plugins and runs as expected at the cli. It fails with a Service
 check did not exit properly unless I force it to run w/o the epn.

 I have read the guidelines, but not really knowing much about Perl, I
 can't see where its tanking.

 Is there a way to run it at the cli with the epn in hopes to see more
 output to glean a clue as to what's going wrong? Or is there a better
 way to glean more useful info?

 Thanks,
 jlc

 --
 Download Intel#174; Parallel Studio Eval
 Try the new software tools for yourself. Speed compiling, find bugs
 proactively, and fine-tune applications for parallel performance.
 See why Intel Parallel Studio got high marks during beta.
 http://p.sf.net/sfu/intel-sw-dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Plugin fails with epn

2010-04-11 Thread Max

Meant exit not execute :) ... Good luck.

On 4/11/10, Joseph L. Casale jcas...@activenetwerx.com wrote:
If you look at p1.pl, which is in the Nagios bin directory you can
enable ePn logging (EPN_LEAVE_MESSAGES or something similar for debug
level) and have epn log each command execution to a log file ... This
often will make the source of an epn error more obvious.

 Ok, looks like there are some factors surrounding when the o0utput even
 gets dumped? Plugin was touched, now I am waiting...

The other thing to do is add

Use warnings;

 Got that in the plugin as per the guidelines.

As any output to STDERR will cause ePN to halt execution.

 Works fine:/

That said, 'not execute properly' errors are often permissions or path
errors, so when you test from the CLI make sure you run the script as
the same user your Nagios damon runs as.

 Actually, it is 'exit' not execute, it has perms granting execution for all.

 We'll see what happens after I come back to it.
 Thanks for the help!
 jlc

 --
 Download Intel#174; Parallel Studio Eval
 Try the new software tools for yourself. Speed compiling, find bugs
 proactively, and fine-tune applications for parallel performance.
 See why Intel Parallel Studio got high marks during beta.
 http://p.sf.net/sfu/intel-sw-dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Plugin fails with epn

2010-04-11 Thread Max

The 'not stayed shared' is the one warning that ePN will not die on at
run time so you can safely continue with that warning being emitted.

- Max

On Sun, Apr 11, 2010 at 4:48 PM, Joseph L. Casale
jcas...@activenetwerx.com wrote:
Meant exit not execute :) ... Good luck.

I guess the epn isn't reread on a 'reload', needed a 'restart', nuances :)

Ok, so it complains:
Variable $opt_hostname will not stay shared at (eval 1) line 23,.

The plugin dump shows:
23 my
($opt_hostname,$opt_port,$opt_password,$opt_command,$opt_regex,$opt_help,$opt_version,$opt_verbose);

Missed #4 at http://nagios.sourceforge.net/docs/3_0/epnplugins.html

Not being fluent in Perl, I can't say I fully comprehend what that's
about, but it works now:)

Thanks!
jlc

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting
any issue.
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Inteface traffic

2010-04-08 Thread Max

IF-MIB

On 4/8/10, Paras pradhan pradhanpa...@gmail.com wrote:
 Which is the correct SNMP MIB to see Data IN and Data out from interfaces?

 Thanks
 Paras.


 On Wed, Apr 7, 2010 at 12:14 PM, Paras pradhan
 pradhanpa...@gmail.comwrote:

 I have few Linux and Solaris hosts that I need to grab the total in and
 out
 traffic from there nics. Which plugin should i use? Do i need to configure
 snmp on all of my hosts or there are some non snmp plugins.

 Thanks!
 Paras.




--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Plugin to check file for string?

2010-04-06 Thread Max Hetrick

Guy Waugh wrote:

 Does anyone know of a nagios plugin that will grep a file for a string?
 
 I have a cronjob that logs output to a file and I basically want to grep 
 for 'cronjob successful' in this file, alerting Critical if the string 
 isn't found and OK if it is.

The check_file plugin that is part of OpenFusion's plugin pack can check 
a file for a regular expression. I've used it for this purpose.

http://www.openfusion.com.au/labs/nagios/

Regards,
Max

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios 3.2.1 - browser refresh

2010-03-31 Thread Max Hetrick

Trisha Hoang wrote:
 I have tried both and restarted apache.
 1) rename file $prefix/share/index.php to index.html, and
 2) placing this line ?php header(Cache-Control: max-age=7200, 
 public); ? on the first line of index.php.

Same here, neither changing the filename to .html or adding the php line 
in the first line work for me either. CentOS 5.4 and Nagios 3.2.1.

Regards,
Max

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios 3.2.1 - browser refresh

2010-03-31 Thread Max Hetrick

Marc Powell wrote:

 Looking at CVS, there have been no changes to index.php in 14 months (several 
 releases). Whatever's changed. it's not this.

Smells like CentOS to me...either Apache or PHP. I know there was an 
httpd update this past week for CentOS 5, which I loaded.

Is everyone else experiencing this on CentOS updated?

Regards,
Max

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How to monitor a log file on a windows box?

2010-03-31 Thread Max Hetrick

Mirza Dedic wrote:

 I have a few Windows boxes where I want to monitor a specific log files for 
 matching errors, and if they are there; I want Nagios to warn the appropriate 
 users.
 
 I have accomplished this on our AIX box using consol's check_logfile plugin, 
 however I need this function for Windows. My windows boxes are monitored by 
 NSClient++ (latest build).
 
 Any idea on how I could accomplish this? Examples?
 
 Nagios 3.2
 NSClient 0.3.8 (running NRPE and NSCA)

I have not used this, but check a these links out and see if they can do 
what you're looking to accomplish.

http://exchange.nagios.org/directory/Addons/Monitoring-Agents/Windows-Eventlog-Addon/details

http://exchange.nagios.org/directory/Addons/Log-File-Management

Regards,
Max

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Pre Flight Check -Errors.

2010-03-30 Thread Max

On Wed, Mar 31, 2010 at 1:29 AM,  ravishankar.gundlap...@wipro.com wrote:
 Hi,

 Request someone to provide the solution...

Please take some time to re-read the diagnostic message Nagios
provided you, it tells you exactly what is wrong.

 Error: Contact group 'admins' specified in service 'C:\ Drive Space' for
host 'LTCPDCERP01' is not defined anywhere!

You need to define the contact group, as Giorgio stated in his email rresponse.

If you do not understand how to correct it, the usual suggestions are:
1)  Read the manual and learn how to correct this -
http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#contactgroup
2)  If you are not comfortable with learning on your own or find you
are not able to fix this yourself, nagios.com offers paid service
contracts and people who will be happy to provide the solution to you
for a fee - http://nagios.com/services/support

Regards,
Max

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_cluster configuration.

2010-03-29 Thread Max

On Mon, Mar 29, 2010 at 7:23 AM, HyunSung Jang hsj...@gmail.com wrote:
 then what should be the host_name for service definition ?
 my server hostname start from www1 ~ www100
 and at this time I just added one more service under www1 definition. but
 this doesn't look good.
 Can i make it like a standalone ?
 I mean, i don't want put cluster service under one of my www server.
 it looks like a service that belongs to just one host.
 any idea about this~?

One common pattern for this is to give the clustered host the name of
the URL it represents, e.g. if the cluster represented the web service
http://www.example.com I would use example.com as the name.

If the cluster is behind a VIP, even better, you can then use the VIP
IP as the 'virtual host' object IP.

- Max

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Two IP for the Same server

2010-03-12 Thread Max

On Fri, Mar 12, 2010 at 11:52 AM, Digital Edge reachta...@hotmail.com wrote:
 Dear List,

 I have a server , which is having 2 NIC with IP assign. I wanted to add in
 nagios those two IP with same hostname  relevant service checks.

 Will it be possible.

Not with a single host definition; however this can be modeled with 3
host definitions:
* One virtual parent host object that has as a host check ping checks
for both IPs
* One virtual host object that represents NIC1 on the host
* One virtual host object that represents NIC2 on the host

Parent -- A - NIC 1
|
|--- B - NIC 2

The two hosts representing the NICs would have the parent host as
their parent so that if the ping checks fail for the two hosts you
only get notified that the parent host object is down.

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Monitor Java Process

2010-03-02 Thread Max

You could use check_proc via NRPE or al custom SNMP base check SNMP
agent (HR MIB or an agent specific MIB).   A number of SnMP agents can
be configured to send SnMP trpas when monitored processes exit.

The Sun JVM also has a built in SNMP agent that exposes thread count,
heap size and some other base JVM metrics.  This agent can easily be
proxied through an agent like Net-SNMP so you don't have multiple
agent ports to work with.

If you are using weblogic, it's built in SNMP agent can be configured
to send a variety of SNMP traps, including starting up and shutting
down traps.

Max

On 3/2/10, Jatin Davey jasho...@cisco.com wrote:
 Hi

 I want to know how i can monitor the java process running on remote box
 , basically i want to keep monitoring it and raise an email alert if it
 has re-started or stopped functioning. Please let me know how this can
 be achieved using nagios.

 Thanks
 Jatin

 --
 Download Intel#174; Parallel Studio Eval
 Try the new software tools for yourself. Speed compiling, find bugs
 proactively, and fine-tune applications for parallel performance.
 See why Intel Parallel Studio got high marks during beta.
 http://p.sf.net/sfu/intel-sw-dev
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] NRPE/NSCA replacement thoughts?

2010-02-19 Thread Max

On Fri, Feb 19, 2010 at 1:11 PM, Kevin Keane subscript...@kkeane.com wrote:
 Personally, I'm not sure if there is a single best way. That is why I keep 
 promoting separating the protocol from the transport. Basically, Nagios 
 protocol data fits into Layer 6 or 7 of the OSI model, while the encryption 
 and authentication really should be handled by layer 4 or 5.

We have written in addtion to our other agents an HTTP-based agent
that uses either RESTful or non RESTful style URLs for requests.

Creating an HTTP RESTful style interface spec would be a pretty clean
way to define a standard without getting into implementation
discussions and would provide a common ground for plugins.

As with SNMP or SSH HTTP comes with a performance price but more and
more with Nagios I am seeing (because it is so flexible) that defining
standards for a user group / org that uses Nagios is much harder than
performance tuning is.

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] NRPE compilation on HPUX

2010-02-10 Thread Max

On Wed, Feb 10, 2010 at 5:30 PM, Ciro Iriarte cyru...@gmail.com wrote:
 Hi, i'm getting issues compiling NRPE on HPUX 11.31, do you now what
 missing library can cause this?

Colleague of mine used the NRPE package here:

http://mayoxide.com/naghpux/

and said it works well.

- max

--
SOLARIS 10 is the OS for Data Centers - provides features such as DTrace,
Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW
http://p.sf.net/sfu/solaris-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Three tier alert system, and how to invoke an external script?

2010-01-13 Thread Max Hetrick

Mr Gabriel wrote:
 Is it possible to do the following;
 
 There are five people, three groups of people who should be notified in 
 case of any issues. the three groups of people each have three ways of 
 being notified.
 
 Group One - Help Desk Team
 Group Two - IT Support Team
 Group Three - Management
 
 First notification is email, the second is text message, and the third 
 one is by phone. Email is already configured. We have a perl script that 
 can be run to send messages, and also a perl script that can connect to 
 our asterisk boxes to make calls.
 
 On the first alert, notify group one by email, and ignore the second and 
 third group.
 If no acknowledgment is recieved, run perl script for SMS with group one 
 by, and group two notify by email
 If again no acknowledgment is received, run script for phone calls, and 
 script for SMS/email to group three.
 
 
 I would like to know if such a setup is possible, and any hints and tips 
 as to where I should be looking to figure it out. Before I forget, how 
 can I invoke an external script, which is stored on the same server as 
 nagios?

You should be able to do all that you want to do by using escalations.

http://nagios.sourceforge.net/docs/3_0/escalations.html

I'm not certain how to invoke external commands using escalations, but 
you can notify the first to your e-mail, wait however many minutes for 
the second notification and have it go to group 2, and then wait another 
x amount of minutes, then notify group 3.

Escalations should be the ticket for what you want.

Regards,
Max


--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios Integration with IBM TEC ( Tivoli )

2010-01-11 Thread Max

Snmp traps are the easiest way to integrate Nagios with other tools
that use SNMP.

With a little leg work you can use SnMPTT to also receive traps from
other SNMP sources and submit them toi Nagios as passive checks.

We currently use SNMP traps to integrate Nagios with Spectrum, works
quite nicely.

Max

On 1/11/10, steve f a31mod...@hotmail.com wrote:

 Hello,

 We have IBM TEC ( Tivoli Enterprise Console) for alot of our current
 monitoring.

 Does anyone have any experience / history integrating Nagios with IBM TEC?

 We use postd messages to get the info from the clients back to TEC.  Can
 Nagios read the postd messages?  I guess I could somehow use a passive check
 to do this correct ??

 Would Nagios be considered a viable replacement for TEC?  I have not been
 able to find any Marketing type of info as far as Nagios as a replacement
 for other monitoring solutions.

 Thanks,
 Steve
   
 _
 Hotmail: Powerful Free email with security by Microsoft.
 http://clk.atdmt.com/GBL/go/196390710/direct/01/

--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios 30 seconds initial delay

2009-12-15 Thread Max

On Tue, Dec 15, 2009 at 5:57 PM, Brandino Andreas ampra...@gmail.com wrote:
 When I click a page I get the error Error: Could not read host and
 service status information! . After 30 seconds, all pages appear
 again!!

 - I don't have duplicated nagios service running
 - This delay didn't show up to older versions
 - My linux is Debian 5.0.3 (stable)
 - This error appears every time I restart nagios

We typically see about the same start up time, both on 3.0.3 and 3.2.0
on hosts with 1-2k hosts and ~10k services where we have retention.dat
files that are weeks or months old and we are using regular
expressions and service -  hostgroup mappings extensively.

- Max

--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios list of devices/IP addresses

2009-11-02 Thread Max

On Mon, Nov 2, 2009 at 5:05 PM, Gareth Fletcher
gareth.fletc...@onenet.co.nz wrote:
 As is always the case in IT now we want to extend this and insert the IP
 addresses into this table (so we can build a DNS zone with our Nagios
 hosts).



 Does anyone know a way I can easily grab the IP address of a given hostname?
 Or else an easier way to get all the hostnames/IPs?

* You can parse objects.cache
* You could switch to NDO or merlin
* You could install and use Nagios2JSON as that exposes host and
service configuration information.

- Max

--
Come build with us! The BlackBerry(R) Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9 - 12, 2009. Register now!
http://p.sf.net/sfu/devconference
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] pnp4nagios over nrpe

2009-10-02 Thread Max

On Fri, Oct 2, 2009 at 10:49 AM, Leonardo Carneiro
lscarne...@veltrac.com.br wrote:
 I'm using pnp4nagios in some local services and it's running just fine.
 now i'm trying to get perfdata from a service running on a remote host,
 but doing so via nrpe. the command it's a simple shell script that
 checks how many files exist in a especified folder.

 When i try to see the graph, the last item show me an error:

 RRD
 Database/usr/local/nagios/share/perfdata/gwserver01/MSG_para_o_banco.rrd
 not found.

 I now i could be thousand of problems. what i wanna know if is possible
 to gather this kind of data via nrpe.

 tks in advance and forgive me for my poor english.

yes, absolutely NRPE can be used to collect performance data .. does
your shell script return performance data as part of it's output?  You
should see a pipe symbol | and then metrics in the output from the
script, e,g.

MY-CHECK - OK - Blah blah | 'metric1'=10;90;100 'metric2'=4;80;100'

- Max

--
Come build with us! The BlackBerryreg; Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9#45;12, 2009. Register now#33;
http://p.sf.net/sfu/devconf
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP for External IP Address

2009-09-24 Thread Max

On Thu, Sep 24, 2009 at 12:46 PM, Kevin Holleran kdaw...@gmail.com wrote:
 Anyway, one of my tasks is to obtain the WAN address for a series of routers
 at remote locations (~100 locations).  I have been trying to find the
 appropriate SNMP query for this but have come up short.  When I do an
 snmpwalk, the only thing I can find that has the Internet IP is this:
 .1.3.6.1.2.1.4.20.1.1.THE.INTERNET.IP.ADDRESS = IpAddress:
 THE.INTERNET.IP.ADDRESS

 where THE.INTERNET.IP.ADDRESS is the actual address I am looking for.  This
 is in the IP-MIB::ipAdEntAddr table.  However, I cannot query this since
 what I am looking for is part of the OID that I am trying to query.

Are you looking for the IP addresses associated with one or more WAN
ports on the router?

- Max

--
Come build with us! The BlackBerryreg; Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9#45;12, 2009. Register now#33;
http://p.sf.net/sfu/devconf
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Using Nagios with SNMP traps to monitor storage

2009-09-24 Thread Max

On Thu, Sep 24, 2009 at 11:51 AM, Mikey Sklar
sklarm-nag...@screwdecaf.cx wrote:

 Is anyone currently using nagios snmp traps to monitor this type of storage
 equipment?

 - EMC CX500

We are doing trap receipt for this, your SNMPTT rules will have to be
pretty detailed as the EMC CX* series only sends out 3 traps, with
specific error and recovery information embedded in each trap.

 I've tried setting up nagios + snmptrapd + snmptt as detailed here.

 http://xavier.dusart.free.fr/joomla/index.php/en/nagios/47-traps-snmp-dans-nagios

 I also converted two mibs with snmptt for EMC / Hitachi devices.
 The MIBs were not for the exact hardware models we have. Should
 I just being doing a snmpwalk since I cannot locate the proper MIBs.

You will not be able to get this information from an SNMP walk as
traps are send only events :) so if they are codified it is only in
the vendor provided MIB(s).

 Also, I continue to see this message on the nagios console for the SAN
 devices. Is that what I should expect to see until a successful
 trap occurs?

 Status: PENDING
 Status Information: Service is not scheduled to be checked.

Yes, that means no passive check has been received for the service.
If you specify

initial_stateo

in your service config Nagios should set the initial state for the
service to the specified state, which is o == OK in the above config
snippet.

I say should because currently with our Nagios 3.0.3 instance we see
cases where passive services stay in pending instead of being
initialized to OK per our configs.

- Max

--
Come build with us! The BlackBerryreg; Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9#45;12, 2009. Register now#33;
http://p.sf.net/sfu/devconf
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Plugin to monitor a file

2009-09-16 Thread Max Hetrick

Selvakumar A wrote:
 
 Dear All,
  I would like to monitor a file in the server,whether it is 
 opened or modified.Is there any plugin available for that?

Best place to check for these things are at NagiosExchange.

http://www.monitoringexchange.org/cgi-bin/search.cgi?query=check_fileGo=Go

Regards,
Max

--
Come build with us! The BlackBerryreg; Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9#45;12, 2009. Register now#33;
http://p.sf.net/sfu/devconf
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How to make 1 exception for 1 host when mapping services to hostgroups?

2009-09-09 Thread Max

On Wed, Sep 9, 2009 at 2:42 AM, Janet Sullivan jsulli...@mndigital.com wrote:
 What kind of hardware are you on?

We have 2 machines:
* 1 does polling. nagios GUI, and trap ingest (SNMPTT)
* 1 does PNP graphing and storage, MySQL storage for SNMPTT,
notifications, and acts as a relay to send performance data off to our
in house data warehouse.

Each machines has:
* Intel dual quad core CPU
* 16 GB RAM
* SCSI disks (no RAID)
* Gigabit NIC

We have a new set of hosts we are setting up now that have RAID 10,
hoping to see some improvements on the RRD database performance with
RAID 10 in place on our reporting and notification server (load is
very low on the PNP servers though we do see some I/O wait .. hoping
that will decrease with RAID 10).  We are planning on doing HA between
a redundant pair of PNP/notifications hosts using GlusterFS (still
testing that out in our dev/test env).

--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

1 2 3 4 >

1 - 100 of 311 matches

Mail list logo