Re: [opsview-users] Opsview daemon occasionally dies when applying "Cancel all hostgroup downtime"

Ton Voon Mon, 10 Aug 2009 14:15:36 -0700


On 30 Jul 2009, at 02:48, Kang wrote:

>Please send the coredump file, the nagios executable and the straceoutput, tarred and gzipped.
Sorry, Coredump file contains too much our infra's information tosend.
the following is gdb bt log.
> sudo gdb /usr/local/nagios/bin/nagios core.6419
.
.
Core was generated by `/usr/local/nagios/bin/nagios -uxd /usr/local/nagios/etc/nagios.cfg'.
Program terminated with signal 11, Segmentation fault.
[New process 6419]
[New process 6421]
#0  0x000000000042a96d in cmd_delete_downtime ()
(gdb) bt
#0  0x000000000042a96d in cmd_delete_downtime ()
#1  0x00000000004277bf in process_external_command2 ()
#2  0x000000000042741c in process_external_command1 ()
#3  0x0000000000425ce9 in check_for_external_commands ()
#4  0x00000000004319df in event_execution_loop ()
#5  0x0000000000413d31 in main ()
(gdb)

We've patched cmd_delete_downtime to handle theDEL_HOSTGROUP_HOST_DOWNTIME and DEL_HOSTGROUP_SVC_DOWNTIME commands.

I've had a look through that part of the code and it looks fine. Weuse various pointers setup by Nagios to get lists of hosts andservices. It could be that either these pointers are corrupted or notsetup correctly. As we can't recreate it, there must be somethingabout your specific configuration.

If you can use gdb to list the variables and see if the pointers aresetup correctly, that may narrow down where the problem is, but wecan't do anything else without a problem configuration.


Ton

_______________________________________________
Opsview-users mailing list
[email protected]
http://lists.opsview.org/lists/listinfo/opsview-users

Re: [opsview-users] Opsview daemon occasionally dies when applying "Cancel all hostgroup downtime"

Reply via email to