Greg,
We are monitoring our ~47 Podcast Producer (PcP) capture agents with
Nagios - with Ping/SSH for host-check (there is no great way to check
for and, more importantly, recover/restart PcP service via Nagios, so we
actually do the service check via a custom script from the PcP server).
Our plan going forward with Matterhorn is to:
a) use Nagios with Ping/SSH for host check and HTTP for service check
b) use Cacti for resource monitoring (including storage) and alerts (for
storage, we will get alerts if storage is 90% full)
There are obviously quite a few different monitoring systems that can be
used, but we stuck with the most well known ones, which we will also be
using to monitor MH servers as well.
Kevin Chan
Operations Team
Educational Technology Services
UC Berkeley
On 7/21/11 9:07 AM, Brunner Armin wrote:
Greg,
We are monitoring our 20+ capture agents (Replay not yet Matterhorn) with
Nagios too. Ping for host-check and http for service-check.
This worked very good for some years now. I have no need for closer monitoring
yet.
With MCA the situation may be different. The internal storage is very limited
and the possibility to watch it may be helpful.
Regards
Armin
BTW.
I'm just testing the MCA and I'm quite impressed. With some bug and feature
fixing this product will become very useful.
-----Ursprüngliche Nachricht-----
Von: [email protected]
[mailto:[email protected]] Im Auftrag von Greg Logan
Gesendet: Donnerstag, 21. Juli 2011 17:42
An: Opencast Community; Opencast Matterhorn; Matterhorn Users
Betreff: [Matterhorn-users] Monitoring Capture Agents
Hi Folks,
Cross posting to ensure everyone hears this, sorry about the spam!
I'm the guy doing the Matterhorn integration work for the new Epiphan
Matterhorn Capture Appliances (MCAs), as well as a developer for the normal
Matterhorn capture agent (CA). We have about 10 CAs deployed on our campus
currently, and one of our concerns was monitoring their status outside of
Matterhorn itself. To do this we use Nagios to check if they're up by sshing
into them, and we're working on the next step of checking the system health
(smart, disk space, etc) as well as making sure they capture when they should.
Does anyone else monitor their CAs in a similar way? Are you using Nagios as
well? Munin? I ask, because we are at a stage in the MCA development where
adding something like Nagios would be easy (memory space permitting!), so I'd
like some community feedback in terms of what people are looking for in an SNMP
monitor.
Thanks,
G
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users