I need to monitor a cluster. I hear very good things about Ganglia, but it
seems to be missing a couple of key components. I'd like to find out if I'm
missing something, or how other ganglia users handle this. The two
components that I don't see are log monitoring and notification.
Ganglia provides a great deal of information - when I look at it. However,
I want to know when a node goes down, or has a problem. I want my
monitoring system to do something to notify me.
Also, ganglia gathers quite a bit of information, but does not appear to
monitor syslog files. I also need to know when my software logs a problem.
I know - I can have all the nodes forward syslogs to a single server, and
run a logwatch program there. There's a trade-off, between running the
analysis software on the individual nodes (takes CPU on compute nodes) and
sending it to a server (takes network, and storage at the server).
I've used BigBrother (with larrd and a few other plugins) in the past for
server monitoring. For a moderate to small cluster (with reliable
equipment), it seems to do a decent job. It has easily configurable
notification,and it tracks my log files.
So - does ganglia handle log files and notification? If not, what are you
using for those?
Thanks!
David
-------------------------------------------------------------------------
Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
Studies have shown that voting for your favorite open source project,
along with a healthy diet, reduces your potential for chronic lameness
and boredom. Vote Now at http://www.sourceforge.net/community/cca08
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general