Hello all.

I am working on a cluster where we will have two scheduling nodes; a primary and a 
failover. We want to run MON on the headnode and have it monitory services on the 
scheduling nodes; easy enough.  However, there is a service that the scheduler runs 
that can only run on one of the nodes; if the service stop I need to try to restart 
it.  If it doesn't restart, I need to try to start the service on the failover node 
and if it starts I need to force a failover (bring down the eth card).  Anyhow, 
basically I need MON to check for the service on both machines, but if its running on 
one don't bother checking the other (or at least don't fire off an alert if its not 
running on the failover, since it shouldn't be).

Does that make any sense?  Any ideas?

Thanks,

Mike
_______________________________________________
mon mailing list
[EMAIL PROTECTED]
http://linux.kernel.org/mailman/listinfo/mon

Reply via email to