Nicolas KOWALSKI <[EMAIL PROTECTED]> writes: > For information, after modifying the scheduler as described above, > our monitoring server did not have any fork problem anymore: last > week it launched 2.5M forks (monitors and alerts) happily.
We tracked down the source of the problem. In our logfiles, we sometimes see the following: 2008-02-19 03:56:56 err: call_alert: could not exec alert /apps/Minotaure/lib/alert.d/wh-stat.alert: Argument list too long 2008-02-19 03:56:56 err: call_alert: could not exec alert /apps/Minotaure/lib/alert.d/wh-sendtrap.alert: Argument list too long 2008-02-19 03:56:56 err: call_alert: could not exec alert /apps/Minotaure/lib/alert.d/wh-kpi.alert: Argument list too long This explains why we ended up with several mon processes. With our patch (exit if exec fails during call_alert), this does not happen anymore. -- Nicolas _______________________________________________ mon mailing list mon@linux.kernel.org http://linux.kernel.org/mailman/listinfo/mon