On 30 Jul 2009, at 02:48, Kang wrote:
>Please send the coredump file, the nagios executable and the strace
output, tarred and gzipped.
Sorry, Coredump file contains too much our infra's information to
send.
the following is gdb bt log.
> sudo gdb /usr/local/nagios/bin/nagios core.6419
.
.
Core was generated by `/usr/local/nagios/bin/nagios -uxd /usr/local/
nagios/etc/nagios.cfg'.
Program terminated with signal 11, Segmentation fault.
[New process 6419]
[New process 6421]
#0 0x000000000042a96d in cmd_delete_downtime ()
(gdb) bt
#0 0x000000000042a96d in cmd_delete_downtime ()
#1 0x00000000004277bf in process_external_command2 ()
#2 0x000000000042741c in process_external_command1 ()
#3 0x0000000000425ce9 in check_for_external_commands ()
#4 0x00000000004319df in event_execution_loop ()
#5 0x0000000000413d31 in main ()
(gdb)
We've patched cmd_delete_downtime to handle the
DEL_HOSTGROUP_HOST_DOWNTIME and DEL_HOSTGROUP_SVC_DOWNTIME commands.
I've had a look through that part of the code and it looks fine. We
use various pointers setup by Nagios to get lists of hosts and
services. It could be that either these pointers are corrupted or not
setup correctly. As we can't recreate it, there must be something
about your specific configuration.
If you can use gdb to list the variables and see if the pointers are
setup correctly, that may narrow down where the problem is, but we
can't do anything else without a problem configuration.
Ton
_______________________________________________
Opsview-users mailing list
Opsview-users@lists.opsview.org
http://lists.opsview.org/lists/listinfo/opsview-users