Greg,

We are monitoring our ~47 Podcast Producer (PcP) capture agents with Nagios - with Ping/SSH for host-check (there is no great way to check for and, more importantly, recover/restart PcP service via Nagios, so we actually do the service check via a custom script from the PcP server).

Our plan going forward with Matterhorn is to:

a) use Nagios with Ping/SSH for host check and HTTP for service check
b) use Cacti for resource monitoring (including storage) and alerts (for storage, we will get alerts if storage is 90% full)

There are obviously quite a few different monitoring systems that can be used, but we stuck with the most well known ones, which we will also be using to monitor MH servers as well.

  Kevin Chan

  Operations Team
  Educational Technology Services
  UC Berkeley


On 7/21/11 9:07 AM, Brunner Armin wrote:
Greg,

We are monitoring our 20+ capture agents (Replay not yet Matterhorn) with 
Nagios too. Ping for host-check and http for service-check.
This worked very good for some years now. I have no need for closer monitoring 
yet.
With MCA the situation may be different. The internal storage is very limited 
and the possibility to watch it may be helpful.

Regards
Armin

BTW.
I'm just testing the MCA and I'm quite impressed. With some bug and feature 
fixing this product will become very useful.


-----Ursprüngliche Nachricht-----
Von: [email protected] 
[mailto:[email protected]] Im Auftrag von Greg Logan
Gesendet: Donnerstag, 21. Juli 2011 17:42
An: Opencast Community; Opencast Matterhorn; Matterhorn Users
Betreff: [Matterhorn-users] Monitoring Capture Agents

Hi Folks,

Cross posting to ensure everyone hears this, sorry about the spam!

I'm the guy doing the Matterhorn integration work for the new Epiphan 
Matterhorn Capture Appliances (MCAs), as well as a developer for the normal 
Matterhorn capture agent (CA).  We have about 10 CAs deployed on our campus 
currently, and one of our concerns was monitoring their status outside of 
Matterhorn itself.  To do this we use Nagios to check if they're up by sshing 
into them, and we're working on the next step of checking the system health 
(smart, disk space, etc) as well as making sure they capture when they should.

Does anyone else monitor their CAs in a similar way?  Are you using Nagios as 
well?  Munin?  I ask, because we are at a stage in the MCA development where 
adding something like Nagios would be easy (memory space permitting!), so I'd 
like some community feedback in terms of what people are looking for in an SNMP 
monitor.

Thanks,
G

_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

Reply via email to