It's a distributed environment, where everything but 20 boxes are monitored
by slaves (2 clusters of 2 slaves, about 600 hosts being monitored).

I seem to be able to replicate that by scheduling and then deleting downtime
for a host group.

I've changed debug_level=-1 and am still only able to see this in the
nagios.log before it crashes: [1274383762] EXTERNAL COMMAND:
DEL_HOSTGROUP_SVC_DOWNTIME;hostgroup_name

 I had core dumps enabled, but don't know where to look for them (not sure
if they're being created).

Rafael


On Thu, May 20, 2010 at 3:10 PM, Ton Voon <ton.v...@opsera.com> wrote:

>
> On 20 May 2010, at 18:50, Rafael Carneiro wrote:
>
>  Since upgrading to 3.7 (on Ubuntu 10.04 x86_64, Build: 3.7.0.4272) I've
>> seen it happen a couple of times.
>> It seems to me that, after you cancel the downtime, nagios crashes (this
>> is the last line after the crash - nagios.log: [1274376186] EXTERNAL
>> COMMAND: DEL_HOST_SVC_DOWNTIME;CVH-VMS-001).
>> I do remember doing the same thing from the Opsview interface when it
>> crashed the time before that, so I believe that the DEL_HOST_SVC_DOWNTIME is
>> causing it.
>>
>
> I've just done a test on etch, sol10, lenny, rhel5 and I can set a downtime
> for a host group, and then cancel it without the daemon dying.
>
> I've also done a current downtime + a future downtime on Ubuntu lucid and
> that is okay too.
>
> Is this on a distributed master or slave?
>
>
>  Anyone else seeing anything like that? Where could I look for clues?
>>
>
> Is there a core dump file? Enable nagios core dumps in System Preferences.
> You may want to increase nagios debugging in nagios.cfg.
>
> An strace on the process while you deliver the downtime might give some
> clues too.
>
>
>  Were there any changes to the way that's handled by Opsview?
>>
>
> There were additions to downtime handling in 3.5.2, but nothing
> specifically springs to mind for 3.7.0. We did upgrade Nagios to 3.2.1, but
> I don't think there was anything particular there.
>
> Ton
>
> _______________________________________________
> Opsview-users mailing list
> Opsview-users@lists.opsview.org
> http://lists.opsview.org/lists/listinfo/opsview-users
>



-- 
Rafael Carneiro
http://ca.linkedin.com/in/rcarneiro
_______________________________________________
Opsview-users mailing list
Opsview-users@lists.opsview.org
http://lists.opsview.org/lists/listinfo/opsview-users

Reply via email to