On 20 May 2010, at 18:50, Rafael Carneiro wrote:

Since upgrading to 3.7 (on Ubuntu 10.04 x86_64, Build: 3.7.0.4272) I've seen it happen a couple of times. It seems to me that, after you cancel the downtime, nagios crashes (this is the last line after the crash - nagios.log: [1274376186] EXTERNAL COMMAND: DEL_HOST_SVC_DOWNTIME;CVH-VMS-001). I do remember doing the same thing from the Opsview interface when it crashed the time before that, so I believe that the DEL_HOST_SVC_DOWNTIME is causing it.

I've just done a test on etch, sol10, lenny, rhel5 and I can set a downtime for a host group, and then cancel it without the daemon dying.

I've also done a current downtime + a future downtime on Ubuntu lucid and that is okay too.

Is this on a distributed master or slave?

Anyone else seeing anything like that? Where could I look for clues?

Is there a core dump file? Enable nagios core dumps in System Preferences. You may want to increase nagios debugging in nagios.cfg.

An strace on the process while you deliver the downtime might give some clues too.

Were there any changes to the way that's handled by Opsview?

There were additions to downtime handling in 3.5.2, but nothing specifically springs to mind for 3.7.0. We did upgrade Nagios to 3.2.1, but I don't think there was anything particular there.

Ton

_______________________________________________
Opsview-users mailing list
Opsview-users@lists.opsview.org
http://lists.opsview.org/lists/listinfo/opsview-users

Reply via email to