Nicolas KOWALSKI <[EMAIL PROTECTED]> writes:

> For information, after modifying the scheduler as described above,
> our monitoring server did not have any fork problem anymore: last
> week it launched 2.5M forks (monitors and alerts) happily.

We tracked down the source of the problem. In our logfiles, we
sometimes see the following:

2008-02-19 03:56:56 err: call_alert: could not exec alert 
/apps/Minotaure/lib/alert.d/wh-stat.alert: Argument list too long
2008-02-19 03:56:56 err: call_alert: could not exec alert 
/apps/Minotaure/lib/alert.d/wh-sendtrap.alert: Argument list too long
2008-02-19 03:56:56 err: call_alert: could not exec alert 
/apps/Minotaure/lib/alert.d/wh-kpi.alert: Argument list too long

This explains why we ended up with several mon processes. 

With our patch (exit if exec fails during call_alert), this does not
happen anymore.

-- 
Nicolas

_______________________________________________
mon mailing list
mon@linux.kernel.org
http://linux.kernel.org/mailman/listinfo/mon

Reply via email to