Re: [SLUG] System admin graphing tools
Aleksey Tsalolikhin atsaloli.t...@gmail.com writes: Ken Foskey wrote: We all know we should do it. Provide a monitoring system to see how our system loads are going. I know you got lots of replies, Ken. What did you end up using? You might want to also check out Splunk - it's pretty nifty and quite powerful and will show you how your system loads are going and a whole lot of other stuff too. Personally, I use Nagios for monitoring/alerting and Zenoss to get the pretty graphs of system loads, etc. Best, -at -- ✣ Daniel Pittman✉ dan...@rimspace.net☎ +61 401 155 707 ♽ made with 100 percent post-consumer electrons -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
Re: [SLUG] System admin graphing tools
$quoted_author = Ken Foskey ; We all know we should do it. Provide a monitoring system to see how our system loads are going. I have a couple of links that look interesting: In terms of graphing + alerting all-in-one have a look at: www.opsview.org ...which combines a lot of the tools already mentioned. I was also going to mention www.groundworkopensource.com but they seem to have gone out of their way to hide the actual open source bit. Download link? cheers Marty -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
[SLUG] System admin graphing tools
We all know we should do it. Provide a monitoring system to see how our system loads are going. I have a couple of links that look interesting: http://flapjack-project.com/ It is local so goes first :-) Flapjack is a scalable and distributed monitoring system. It natively talks the Nagios plugin format. http://www.cacti.net/ (Language PHP) Cacti is a complete network graphing solution... http://munin.projects.linpro.no/ (Language Perl) Munin is a networked resource monitoring tool that can help analyze resource trends and what just happened to kill our performance? problems. It is designed to be very plug and play. A default installation provides a lot of graphs with almost no work. http://oss.oetiker.ch/mrtg/ The Multi Router Traffic Grapher http://support.nagios.com/knowledgebase Cannot find a simple 'what is nagios' on website. 'Nagios is a host and service monitor designed to inform you of network problems.' From whitepaper. http://oss.oetiker.ch/smokeping/ SmokePing keeps track of your network latency Any comments on the above and any others to add to the list? Other reading: http://wiki.nagios.org/index.php/White_Papers Implementation of Cacti, Smokeping, Nagios (2004) Based on a quick read, munin looks pretty good. Ta Ken -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
Re: [SLUG] System admin graphing tools
On Fri, Feb 26, 2010 at 9:55 AM, Ken Foskey kfos...@tpg.com.au wrote: We all know we should do it. Provide a monitoring system to see how our system loads are going. I have a couple of links that look interesting: [...] Any comments on the above and any others to add to the list? JFFNMS is pretty good - I've used it in a couple of installations, and it'll do stuff cacti (my other default favourite) won't do, like alert on excessive link utilisation etc. Works with routers/switches (Cisco mainly) and servers with SNMP MIB's (disk, CPU etc). http://www.jffnms.org DaZZa -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
Re: [SLUG] System admin graphing tools
Have a look at zenoss It's nagios and munin in one. It does the alerting, threshholds, recovery actions and graphing all in one. Can monitor windo$e, vmware and talks nagios plugin format as well. Grant Ken Foskey wrote: We all know we should do it. Provide a monitoring system to see how our system loads are going. I have a couple of links that look interesting: http://flapjack-project.com/ It is local so goes first :-) Flapjack is a scalable and distributed monitoring system. It natively talks the Nagios plugin format. http://www.cacti.net/ (Language PHP) Cacti is a complete network graphing solution... http://munin.projects.linpro.no/ (Language Perl) Munin is a networked resource monitoring tool that can help analyze resource trends and what just happened to kill our performance? problems. It is designed to be very plug and play. A default installation provides a lot of graphs with almost no work. http://oss.oetiker.ch/mrtg/ The Multi Router Traffic Grapher http://support.nagios.com/knowledgebase Cannot find a simple 'what is nagios' on website. 'Nagios is a host and service monitor designed to inform you of network problems.' From whitepaper. http://oss.oetiker.ch/smokeping/ SmokePing keeps track of your network latency Any comments on the above and any others to add to the list? Other reading: http://wiki.nagios.org/index.php/White_Papers Implementation of Cacti, Smokeping, Nagios (2004) Based on a quick read, munin looks pretty good. Ta Ken -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
Re: [SLUG] System admin graphing tools
http://munin.projects.linpro.no/ (Language Perl) Munin is a networked resource monitoring tool that can help analyze resource trends and what just happened to kill our performance? problems. It is designed to be very plug and play. A default installation provides a lot of graphs with almost no work. I actively use munin and it is a great historical analysis tool. Great availability of plugins as well. -- Simon Males -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
Re: [SLUG] System admin graphing tools
On Fri, 26 Feb 2010, Ken Foskey wrote: Hi, just did a deploy of NAGIOS/Munin/RRDtool and friends to Solaris 10 SPARC. This was for a company who will remain nameless who we outsourced monitoring to, who insisted on the above rather than using the shiny SunMC infrastructure we already had for the purpose. Porting this was a case of dependency, dependency.. Munin wants a lot of stuff - about 20 perl modules and then it wants RRDTool which in turn wants a whole bunch of obscure shared libraries such as cairo, pixman and pango, which is fine if you are a web 2.0 monkey, but in sysadmin world it is annoying. And then RRDtool wants you to have practically the latest of everything. And then it wants something called pkgconfig which is great if you are developer, but for sysadmin, very annoying.And then finally you can have the RRDtool perl module.. And then don't start me on some of the undocumented problems that caused gcc to break the compile because some stupid Linux hacker didn't understand POSIX compliance... So, in the end, I got the whole stack working.. My point. (oblinux) If you have an older, broken or improperly installed system, you will find Munin a pain to install. It wants practically the latest of everything, even touch libglib. If you have all your ducks in a row, it is still a pain but possible. Porting to Solaris 10 was easy for me, but if it had been a broken or older system, I doubt it would have got there. Munin looks OK, but it was obviously created by a bunch of anal-retentive module hackers with a mandate on making sure their install occcupies as much sysadmin brain power as possible. Approach only if you have a reasonably shiny system.. rachel -- Rachel Polanskis Kingswood, Greater Western Sydney, Australia gr...@zeta.org.auhttp://www.zeta.org.au/~grove/grove.html The perversity of the Universe tends towards a maximum. - Finagle's Law -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
Re: [SLUG] System admin graphing tools
On 25 February 2010 17:55, Ken Foskey kfos...@tpg.com.au wrote: We all know we should do it. Provide a monitoring system to see how our system loads are going. I have a couple of links that look interesting: http://flapjack-project.com/ It is local so goes first :-) Flapjack is a scalable and distributed monitoring system. It natively talks the Nagios plugin format. Heh, thanks for the mention. :-) I wouldn't recommend using Flapjack right now unless you want to be testing bleeding edge stuff that is guaranteed to break, or you're a Ruby hacker with an inkling for sysadmin. I'd argue that you're conflating two types of software: statistic collectors (with graphs), and alerters/notifiers. For statistic collection, you cannot go past collectd[0]. collectd is very lightweight (it's written in C), has a plugin architecture (and a boatload of plugins to boot), and is network aware (you can collect stats from all your servers and aggregate them in one place). collectd has a few options for graphing: collection.cgi, collection3.cgi, and Visage[1]. collection*.cgi are CGI scripts (duh) that use RRDtool to generate graphs. Visage draws stats in the browser using JavaScript + SVG. collectd also has a Nagios bridge, so you can plug it into pretty much any alerting/notification system out there. Hope that helps! Lindsay [0] http://collectd.org [1] http://auxesis.github.com/visage (disclaimer: I wrote it) -- w: http://holmwood.id.au/~lindsay/ t: @auxesis -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
Re: [SLUG] System admin graphing tools
I wouldn't bother wasting time on any of these tools. Perhaps http://www.zabbix.com might be universal monitoring solution. Regards, Dmitry. On 26 February 2010 09:55, Ken Foskey kfos...@tpg.com.au wrote: We all know we should do it. Provide a monitoring system to see how our system loads are going. I have a couple of links that look interesting: http://flapjack-project.com/ It is local so goes first :-) Flapjack is a scalable and distributed monitoring system. It natively talks the Nagios plugin format. http://www.cacti.net/ (Language PHP) Cacti is a complete network graphing solution... http://munin.projects.linpro.no/ (Language Perl) Munin is a networked resource monitoring tool that can help analyze resource trends and what just happened to kill our performance? problems. It is designed to be very plug and play. A default installation provides a lot of graphs with almost no work. http://oss.oetiker.ch/mrtg/ The Multi Router Traffic Grapher http://support.nagios.com/knowledgebase Cannot find a simple 'what is nagios' on website. 'Nagios is a host and service monitor designed to inform you of network problems.' From whitepaper. http://oss.oetiker.ch/smokeping/ SmokePing keeps track of your network latency Any comments on the above and any others to add to the list? Other reading: http://wiki.nagios.org/index.php/White_Papers Implementation of Cacti, Smokeping, Nagios (2004) Based on a quick read, munin looks pretty good. Ta Ken -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
Re: [SLUG] System admin graphing tools
torrus is awesome www.torrus.org Dean Ken Foskey wrote: We all know we should do it. Provide a monitoring system to see how our system loads are going. I have a couple of links that look interesting: http://flapjack-project.com/ It is local so goes first :-) Flapjack is a scalable and distributed monitoring system. It natively talks the Nagios plugin format. http://www.cacti.net/ (Language PHP) Cacti is a complete network graphing solution... http://munin.projects.linpro.no/ (Language Perl) Munin is a networked resource monitoring tool that can help analyze resource trends and what just happened to kill our performance? problems. It is designed to be very plug and play. A default installation provides a lot of graphs with almost no work. http://oss.oetiker.ch/mrtg/ The Multi Router Traffic Grapher http://support.nagios.com/knowledgebase Cannot find a simple 'what is nagios' on website. 'Nagios is a host and service monitor designed to inform you of network problems.' From whitepaper. http://oss.oetiker.ch/smokeping/ SmokePing keeps track of your network latency Any comments on the above and any others to add to the list? Other reading: http://wiki.nagios.org/index.php/White_Papers Implementation of Cacti, Smokeping, Nagios (2004) Based on a quick read, munin looks pretty good. Ta Ken -- http://fragfest.com.au -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
Re: [SLUG] System admin graphing tools
On Fri, 2010-02-26 at 09:55 +1100, Ken Foskey wrote: http://www.cacti.net/ (Language PHP) Cacti is a complete network graphing solution... I found cacti to have a surprisingly steep learning curve, figuring out how the data sources and input methods and queries work. But it's a very very capable tool, and the interface is getting very nice and Web 2.0 in recent versions. Extending what you're graphing with cacti requires a reasonable understanding of SNMP and how net-snmpd works. But once you've overcome that you can monitor anything you can write a script for. It's pretty neat. http://munin.projects.linpro.no/ (Language Perl) Munin is a networked resource monitoring tool that can help analyze resource trends and what just happened to kill our performance? problems. It is designed to be very plug and play. A default installation provides a lot of graphs with almost no work. The plugin architecture for munin is very flexible - spend half an hour going through the tutorial and it's very easy to start churning out graphs for anything you can think of. The fact that it relies on agent software running on the monitored device is a bit of a pain. I'm still having stability issues with the Windows agent, and pulling stats from devices that can't run an agent is a pain - munin's SNMP support is very weak. I've managed to get it to poll one of my ciscos, but it was a battle. Maybe I just haven't spent enough time working with it, but the web interface for munin is very sparse compared to the features available in cacti. http://oss.oetiker.ch/mrtg/ The Multi Router Traffic Grapher Are people still using MRTG in new installations? I had the feeling it was kind of supplanted by other tools mentioned above. http://support.nagios.com/knowledgebase Cannot find a simple 'what is nagios' on website. 'Nagios is a host and service monitor designed to inform you of network problems.' From whitepaper. Everything else you've mentioned covers service performance. Nagios handles service availability and notifications only, and I consider it best-of-breed for this. http://oss.oetiker.ch/smokeping/ SmokePing keeps track of your network latency Also a big fan of this one. Does one thing only, and does it incredibly well. Based on a quick read, munin looks pretty good. I would have serious reservations about deploying munin if you have more than a couple of SNMP-only devices, or more than a couple of critical Windows boxes. -- Pete -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html