On 2012-11-23 10:02, jan.co...@gmail.com wrote:

Hi,

We installed icinga 1.7.0 on Debian 6.0 virtual machine. But after while we got segfault, so we Installed icinga 1.8.1 from debmon repository. But again we got segfault.

What we can check or where is the problem? Here are information from logs:

From kern.log:

Nov 16 11:56:24 deu-s0244 kernel: [74631.877751] check_snmp_ibm_[31048]: segfault at 38 ip 0000000000000038 sp 00007fffb5dd9b70 error 14 in bash[400000+d9000]

Nov 16 11:56:52 deu-s0244 kernel: [74659.792505] icinga[31355]: segfault at 38 ip 0000000000000038 sp 00007fff6101be20 error 14 in icinga[400000+a4000]

Nov 16 12:10:18 deu-s0244 kernel: [75465.734324] sh[4711]: segfault at 38 ip 0000000000000038 sp 00007fff88f2c960 error 14 in dash[400000+19000]

Nov 16 12:20:59 deu-s0244 kernel: [76106.825141] icinga[9252]: segfault at 38 ip 0000000000000038 sp 00007fff61013d50 error 14 in icinga[400000+a4000]

Nov 16 12:53:07 deu-s0244 kernel: [78034.792423] icinga[30323]: segfault at 38 ip 0000000000000038 sp 00007fff6101be20 error 14 in icinga[400000+a4000]

Nov 17 13:55:02 deu-s0244 kernel: [168149.650410] check_snmp_ibm_[22135]: segfault at 38 ip 0000000000000038 sp 00007fffb1b061d0 error 14 in bash[400000+d9000]

Nov 17 17:46:39 deu-s0244 kernel: [182046.762417] icinga[20803]: segfault at 38 ip 0000000000000038 sp 00007fffae2445a0 error 14 in icinga[400000+a4000]

...

Nov 20 13:50:41 xxx kernel: [ 2880.448776] python[22380]: segfault at 38 ip 0000000000000038 sp 00007fff8a0068f0 error 14 in python2.6[400000+21d000]

Nov 20 13:54:26 xxx kernel: [ 3104.548646 <tel:%5B%203104.548646>] icinga[1383]: segfault at 38 ip 0000000000000038 sp 00007fffa0b1eb00 error 14 in icinga[400000+a7000]

Nov 20 15:55:58 xxx kernel: [10397.302069] python[9209]: segfault at 38 ip 0000000000000038 sp 00007fffffffbb60 error 14 in python2.6[400000+21d000]

From icinga.debug:

[1353498301.206147] [2048.1] [pid=11777] **** END MACRO PROCESSING *************

[1353498301.206199] [001.0] [pid=11777] process_macros_r()

[1353498301.206251] [001.0] [pid=11777] process_macros_r()

[1353498301.206738] [001.0] [pid=11777] process_macros_r()

[1353498301.206793] [001.0] [pid=11777] process_macros_r()

[1353498301.206846] [001.0] [pid=11777] process_macros_r()

[1353498301.211711] [016.0] [pid=11778] running command /usr/lib/nagios/plugins/check_snmp_storage.pl <http://check_snmp_storage.pl> -H xx.xx.xx.xx -C GET01SRVLAN -m ^[C] -w 80% -c 90% 0 -f via popen

[1353498301.409464] [016.2] [pid=11778] Moving temp check result file '/tmp/check8jCbM7' to queue file '/var/lib/icinga/spool/checkresults/cbQhIIT'...

I tried gdb:

[Thread debugging using libthread_db enabled]

[New Thread 0x7ffff61e8700 (LWP 11946)]

Program received signal SIGSEGV, Segmentation fault.

#00x0000000000000038 in ?? ()

No symbol table info available.

#10x00007ffff73cd748 in ?? () from /lib/libc.so.6

No symbol table info available.

#20x00007fffffffd440 in ?? ()

No symbol table info available.

#30x0000000000000000 in ?? ()

No symbol table info available.

rax0x2e0111777

rbx0x7fffffffd420140737488344096

rcx0x3856

rdx0x246582

rsi0x7ffff7feb9d0140737354054096

rdi0x2ea511941

rbp0x7fffffffd4800x7fffffffd480

rsp0x7fffffffd4200x7fffffffd420

r80x00

r90x7fffffffd440140737488344128

r100x00

r110x124ace019180768

r120x00

r130x7fffffffd440140737488344128

r140x00

r150x124ace019180768

rip0x380x38

eflags0x252cc2 [ ZF SF DF OF #13 RF AC ID ]

cs0x3351

ss0x2b43

ds0x00

es0x00

fs0x00

gs0x00

fctrl0x37f895

fstat0x00

ftag0xffff65535

fiseg0x00

fioff0x00

foseg0x00

fooff0x00

fop0x00

mxcsr0x1fa0[ PE IM DM ZM OM UM PM ]

0x38:Cannot access memory at address 0x38

Thread 2 (Thread 0x7ffff61e8700 (LWP 11946)):

#00x00007ffff7132c13 in poll () from /lib/libc.so.6

#10x00000000004457d6 in command_file_worker_thread (arg=<value optimized out>) at utils.c:3784

#20x00007ffff73d68ca in start_thread () from /lib/libpthread.so.0

#30x00007ffff713db6d in clone () from /lib/libc.so.6

#40x0000000000000000 in ?? ()

Thread 1 (Thread 0x7ffff7feb700 (LWP 11941)):

#00x0000000000000038 in ?? ()

#10x00007ffff73cd748 in ?? () from /lib/libc.so.6

#20x00007fffffffd440 in ?? ()

#30x0000000000000000 in ?? ()

A debugging session is active.

Inferior 1 [process 11941] will be killed.

Quit anyway? (y or n) [answered Y; input not from terminal]


i'd like to see "bt full" directly after the sigsegv happened, post the entire output.


Let me know if you need more information.

Thanks,
jan




------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov


_______________________________________________
icinga-users mailing list
icinga-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/icinga-users


--
DI (FH) Michael Friedrich

mail:     michael.friedr...@gmail.com
twitter:  https://twitter.com/dnsmichi
jabber:   dnsmi...@jabber.ccc.de
irc:      irc.freenode.net/icinga dnsmichi

icinga open source monitoring
position: lead core developer
url:      https://www.icinga.org

------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
icinga-users mailing list
icinga-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/icinga-users

Reply via email to